Sequential category aggregation and partitioning approaches for multi-way contingency tables based on survey and census data

التفاصيل البيبلوغرافية
العنوان: Sequential category aggregation and partitioning approaches for multi-way contingency tables based on survey and census data
المؤلفون: Jackson, L. Fraser, Gray, Alistair G., Fienberg, Stephen E.
المصدر: Annals of Applied Statistics 2008, Vol. 2, No. 3, 955-981
سنة النشر: 2008
المجموعة: Statistics
مصطلحات موضوعية: Statistics - Applications
الوصف: Large contingency tables arise in many contexts but especially in the collection of survey and census data by government statistical agencies. Because the vast majority of the variables in this context have a large number of categories, agencies and users need a systematic way of constructing tables which are summaries of such contingency tables. We propose such an approach in this paper by finding members of a class of restricted log-linear models which maximize the likelihood of the data and use this to find a parsimonious means of representing the table. In contrast with more standard approaches for model search in hierarchical log-linear models (HLLM), our procedure systematically reduces the number of categories of the variables. Through a series of examples, we illustrate the extent to which it can preserve the interaction structure found with HLLMs and be used as a data simplification procedure prior to HLL modeling. A feature of the procedure is that it can easily be applied to many tables with millions of cells, providing a new way of summarizing large data sets in many disciplines. The focus is on information and description rather than statistical testing. The procedure may treat each variable in the table in different ways, preserving full detail, treating it as fully nominal, or preserving ordinality.
Comment: Published in at http://dx.doi.org/10.1214/08-AOAS175Test the Annals of Applied Statistics (http://www.imstat.org/aoasTest/) by the Institute of Mathematical Statistics (http://www.imstat.orgTest)
نوع الوثيقة: Working Paper
DOI: 10.1214/08-AOAS175
الوصول الحر: http://arxiv.org/abs/0811.1686Test
رقم الانضمام: edsarx.0811.1686
قاعدة البيانات: arXiv