We report on a strategy for the identification of candidate genes for multi
ple malformation syndromes using expression data available in public databa
ses. The basis for this pilot study was the assumption that, for a multiple
malformation syndrome, the expression pattern of the causative gene should
at least cover the organs or tissues affected by the syndrome. Twenty malf
ormation syndromes were selected from the OMIM and defined by three to five
main symptoms. These key symptoms were translated into anatomical terms th
at were used to query the Gene eXpression Database (GXD). The searches cove
red 65% of the database and yielded an average of 16 candidate genes per sy
ndrome. Of these, 23% were ubiquitously expressed or housekeeping genes. Fu
rther database evaluation of these potential candidate genes was based on p
ositional information and on information from mouse knockouts, In a first e
xperiment, the correct gene was identified as a candidate in four of seven
syndromes for which the causative gene is already known. In addition, this
strategy identified new candidate genes for disorders for which the genetic
basis is unknown. We identified candidate genes for the Walker-Warburg, DO
OR, C, scalp-ear-nipple and oculocerebral hypopigmentation syndromes. Our r
esults suggest that it may ultimately be feasible to identify disease genes
by probing gene expression databases with simple syndrome descriptions.