ITA
ENG

The Russian gene pool: Gene geography of surnames

Authors

Balanovsky, OP Buzhilova, AP Balanovskaya, EV

Citation

Op. Balanovsky et al., The Russian gene pool: Gene geography of surnames, RUSS J GEN, 37(7), 2001, pp. 807-822

Citations number

Categorie Soggetti

Molecular Biology & Genetics

Journal title

RUSSIAN JOURNAL OF GENETICS

ISSN journal

10227954 → ACNP

Volume

Issue

Year of publication

2001

Pages

807 - 822

Database

ISI

SICI code

1022-7954(200107)37:7<807:TRGPGG>2.0.ZU;2-B

Abstract

Surnames are traditionally used in population genetics as "quasi-genetic" m arkers (i.e., analogs of genes) when studying the structure of the gene poo l and the factors of its microevolution. In this study, spatial variation o f Russian surnames was analyzed with the use of computer-based gene geograp hy. Gene geography of surnames was demonstrated to be promising for populat ion studies on the total Russian gene pool. Frequencies of surnames were st udied in 64 sel'sovets (rural communities; a total of 33 thousand persons) of 52 raions (districts) of 22 oblasts (regions) of the European part of Ru ssia. For each of 75 widespread surnames, an electronic map of its frequenc y was constructed. Summary maps of principal components were drawn based on all maps of individual surnames. The first 5 of 75 principal components ac counted for half of the total variance, which indicates high resolving powe r of surnames. The map of the first principal component exhibits a trend di rected from the northwestern to the eastern regions of the area studied. Th e trend of the second component was directed from the southwestern to the n orthern regions of the area studied, i.e., it was close to latitudinal. Thi s trend almost coincided with the latitudinal trend of principal components for three sets of data (genetic, anthropological, and dermatoglyphical). T herefore, the latitudinal trend may be considered the main direction of var iation of the Russian gene pool. The similarity between the main scenarios for the genetic and quasi-genetic markers demonstrates the effectiveness of the use of surnames for analysis of the Russian gene pool. In view of the dispute between R. Sokal and L.L. Cavalli-Sforza about the effects of false correlations, the maps of principal components of Russian surnames were co nstructed by two methods: through analysis of maps and through direct analy sis of original data on the frequencies of surnames. An almost complete coi ncidence of these maps (correlation coefficient rho = 0.96) indicates that, taking into account the reliability of the data, the resultant maps of pri ncipal components have no errors of false correlations.