Rf. Schipper et al., VALIDATION OF LARGE DATA SETS, AN ESSENTIAL PREREQUISITE FOR DATA-ANALYSIS - AN ANALYTICAL SURVEY OF THE BONE-MARROW DONORS WORLDWIDE, Tissue antigens, 47(3), 1996, pp. 169-178
Large data sets like the Bone Marrow Donors Worldwide (BMDW) data set
can be used for population genetic analyses. The qualities of such dat
a sets are unique. To be able to use the BMDW data for analyses, sever
al problems, like limited size and selective DR typing, of the data ha
ve to be solved and the quality of the registry data subsets has to be
examined. We describe these problems and methods to overcome them. Al
so, we give an overview of the qualities of the different registry sub
sets. Sixteen of the twenty-nine examined subsets contain data that ca
n be used for population genetic analysis. We will deal with these ana
lyses in the future. Additionally, we present a method to calculate th
e minimum number of individuals required for reliable haplotype freque
ncy estimation.