Title | On the asymptotic distribution of Pearson's $X^2$ in cross-validation samples |
Publication Type | Journal Article |
Year of Publication | 2006 |
Authors | Joe, H, Maydeu-Olivares, A |
Journal | Psychometrika |
Volume | 71 |
Pagination | 587-592 |
Date Published | SEP |
Type of Article | Article |
ISSN | 0033-3123 |
Keywords | contingency tables, goodness-of-fit, item response theory modeling, latent class analysis, quadratic form statistics |
Abstract | In categorical data analysis, two-sample cross-validation is used not only for model selection but also to obtain a realistic impression of the overall predictive effectiveness of the model. The latter is of particular importance in the case of highly parametrized models capable of capturing every idiosyncracy of the calibrating sample. We show that for maximum likelihood estimators or other asymptotically efficient estimators Pearson's X-2 is not asymptotically chi-square in the two-sample cross-validation framework due to extra variability induced by using different samples for estimation and goodness-off-it testing. We propose an alternative test statistic, X-xval(2) , obtained as a modification of X-2 which is asymptotically chi-square with C-1 degrees of freedom in cross-validation samples. Stochastically, X-xval(2) <= X-2. Furthermore, the use of X-2 instead of X-xval(2) with a chi(2)(C-1) reference distribution may provide an unduly poor impression of fit of the model in the cross-validation sample. |
DOI | 10.1007/s11336-005-1284-z |