Data clustering and noise undressing of correlation matrices - art. no. 061101

Citation
L. Giada et M. Marsili, Data clustering and noise undressing of correlation matrices - art. no. 061101, PHYS REV E, 6306(6), 2001, pp. 1101
Citations number
14
Categorie Soggetti
Physics
Journal title
PHYSICAL REVIEW E
ISSN journal
1063651X → ACNP
Volume
6306
Issue
6
Year of publication
2001
Part
1
Database
ISI
SICI code
1063-651X(200106)6306:6<1101:DCANUO>2.0.ZU;2-Q
Abstract
We discuss an approach to data clustering. We find that maximum likelihood leads naturally to an Hamiltonian of Potts variables that depends on the co rrelation matrix and whose low temperature behavior describes the correlati on structure of the data. For random, uncorrelated data sets no correlation structure emerges. On the other hand, for data sets with a built-in cluste r structure, the method is able to detect and recover efficiently that stru cture. Finally we apply the method to financial time series, where the low- temperature behavior reveals a nontrivial clustering.