We have searched the human genome for genes encoding new proteins that may
be involved in three nuclear gene expression processes: transcription, pre-
messenger RNA splicing and polyadenylation. A plethora of potential new fac
tors are implicated by sequence in nuclear gene expression, revealing a sub
stantial but selective increase in complexity compared with Drosophila mela
nogaster and Caenorhabditis elegans. Although the raw genomic information h
as limitations, its availability offers new experimental approaches for stu
dying gene expression.