HIERARCHICAL LOGISTIC-REGRESSION MODELS FOR IMPUTATION OF UNRESOLVED ENUMERATION STATUS IN UNDERCOUNT ESTIMATION

Citation
Tr. Belin et al., HIERARCHICAL LOGISTIC-REGRESSION MODELS FOR IMPUTATION OF UNRESOLVED ENUMERATION STATUS IN UNDERCOUNT ESTIMATION, Journal of the American Statistical Association, 88(423), 1993, pp. 1149-1159
Citations number
27
Categorie Soggetti
Statistic & Probability","Statistic & Probability
Volume
88
Issue
423
Year of publication
1993
Pages
1149 - 1159
Database
ISI
SICI code
Abstract
In the process of collecting Post-Enumeration Survey (PES) data to eva luate census coverage, it is inevitable that there will be some indivi duals whose enumeration status (outcome in the census-PES match) remai ns unresolved even after extensive field follow-up operations. Earlier work developed a logistic regression framework for imputing the proba bility that unresolved individuals were enumerated in the census, so t hat the probability of having been enumerated is allowed to depend on covariates. The covariates may include demographic characteristics, ge ographic information, and census codes that summarize information on t he characteristics of the match (e.g., the before-follow-up match code assigned by clerks to describe the type of match between PES and cens us records). In the production of 1990 undercount estimates, the basic logistic regression model was expanded into a mixed hierarchical mode l to allow for the presence of group-specific effects, where groups ar e characterized by common before-follow-up match code. Parameter estim ates for individual match-code groups thus ''borrow strength'' across groups by making use of observed relationships between group-specific parameter estimates in the various groups and the characteristics of t he groups. This allows predictions to be made for groups for which the re are few or no resolved cases to which to fit the model. The model w as fitted by an approximate expectation-conditional-maximization (ECM) algorithm, using a large-sample approximation to the posterior distri butions of group parameters. Uncertainty in estimation of model parame ters was evaluated using a resampling procedure and became part of the evaluation of total error in PES estimates of population. Results fro m fitting the model in the 1990 Census and PES are described.