On the asymptotic distribution of Pearson's $X^2$ in cross-validation samples

Title	On the asymptotic distribution of Pearson's $X^2$ in cross-validation samples
Publication Type	Journal Article
Year of Publication	2006
Authors	Joe, H, Maydeu-Olivares, A
Journal	Psychometrika
Volume	71
Pagination	587-592
Date Published	SEP
Type of Article	Article
ISSN	0033-3123
Keywords	contingency tables, goodness-of-fit, item response theory modeling, latent class analysis, quadratic form statistics
Abstract	In categorical data analysis, two-sample cross-validation is used not only for model selection but also to obtain a realistic impression of the overall predictive effectiveness of the model. The latter is of particular importance in the case of highly parametrized models capable of capturing every idiosyncracy of the calibrating sample. We show that for maximum likelihood estimators or other asymptotically efficient estimators Pearson's X-2 is not asymptotically chi-square in the two-sample cross-validation framework due to extra variability induced by using different samples for estimation and goodness-off-it testing. We propose an alternative test statistic, X-xval(2) , obtained as a modification of X-2 which is asymptotically chi-square with C-1 degrees of freedom in cross-validation samples. Stochastically, X-xval(2) <= X-2. Furthermore, the use of X-2 instead of X-xval(2) with a chi(2)(C-1) reference distribution may provide an unduly poor impression of fit of the model in the cross-validation sample.
DOI	10.1007/s11336-005-1284-z

Subscribe to email list