GENERAL FORMULATION AND EVALUATION OF AGGLOMERATIVE CLUSTERING METHODS WITH METRIC AND NONMETRIC DISTANCES

Citation
J. Boberg et T. Salakoski, GENERAL FORMULATION AND EVALUATION OF AGGLOMERATIVE CLUSTERING METHODS WITH METRIC AND NONMETRIC DISTANCES, Pattern recognition, 26(9), 1993, pp. 1395-1406
Citations number
47
Categorie Soggetti
Computer Sciences, Special Topics","Engineering, Eletrical & Electronic","Computer Applications & Cybernetics
Journal title
ISSN journal
00313203
Volume
26
Issue
9
Year of publication
1993
Pages
1395 - 1406
Database
ISI
SICI code
0031-3203(1993)26:9<1395:GFAEOA>2.0.ZU;2-0
Abstract
Agglomerative clustering methods with stopping criteria are generalize d. Clustering-related concepts are rigorously formulated with special consideration on metricity of object space. A new definition of combin atoriality is given, and a stronger proposition of monotonicity is pro ven. Specializations of the general method are applied to non-attribut ive non-metric and attributive pseudometric representations of biosequ ences. The furthest neighbor method is shown suitable for non-metric u se. In metric object space, four inter-clusteral distance functions, i ncluding a new truly context sensitive method, are compared using a me thod-independent goodness criterion. For biosequence clustering, the n ew method overcomes the UPGMA, UPGMC, and furthest neighbor methods.