HUGE is a database for human large proteins newly identified in the Kazusa
cDNA project, the aim of which is to predict the primary structure of prote
ins from the sequences of human large cDNAs (>4 kb). In particular, cDNA cl
ones capable of coding for large proteins (>50 kDa) are the current targets
of the project. HUGE contains >1100 cDNA sequences and detailed informatio
n obtained through analysis of the sequences of cDNAs and the predicted pro
teins, Besides an increase in the number of cDNA entries, the amount of exp
erimental data for expression profiling has been largely increased and data
on chromosomal locations have been newly added, All of the protein-coding
regions were examined by GeneMark analysis, and the results of a motif/doma
in search of each predicted protein sequence against the Pfam database have
been newly added. HUGE is available through the WWW at http://www.kazusa.o
r.jp/huge.