Goodness-of-Fit for the Clustered Binomial Models

  • Received : 2015.06.15
  • Accepted : 2015.08.16
  • Published : 2015.08.31

Abstract

We sometimes encounter the violation of independence between binary responses within subjects in clustered data that is commonly observed in cluster randomized trials. Under this situation the standard statistical methods that assume binomial distribution are inappropriate to use. In cluster randomized trials we have a target for comparing homogeneity of proportions between several treatment groups. The standard Pearson chi-squared statistic applied to the clustered data has a tendency to inflate type I error. Many researchers studied various kinds of adjusted chi-squared statistics in analyzing clustered binary data by incorporating the variance inflation and the intra-class correlation. In this paper we suggest the likelihood ratio test and the Wald test based on generalized linear mixed model. We explain the testing procedure via practical examples and also compare the performances through a Monte Carlo study.

Keywords

Acknowledgement

Supported by : Research Institute for Basic Sciences, Pusan National University

References

  1. Agresti, A. (2002). Categorical Data Analysis, 2nd ed., Wiley.
  2. Choi, D. Y., Jeong, K. M. (2014). Cumulative sums of residuals in GLMM and its implementation, Communications for Statistical Applications and Methods, 21(5), 423-433. https://doi.org/10.5351/CSAM.2014.21.5.423
  3. Cochran, W. G. (1977). Sampling Techniques, 3rd ed., Wiley.
  4. Donner, A. (1989). Statistical methods in ophthalmology: An adjusted chi-squared approach, Biometrics, 45, 605-611. https://doi.org/10.2307/2531501
  5. Eldridge, S. M., Ukoumunne, O. C., Carlin, J. B. (2009). The intra-cluster correlation coefficient in cluster randomized trials: A review of definitions, International Statistical Review, 77(3), 378-394. https://doi.org/10.1111/j.1751-5823.2009.00092.x
  6. Jeong, K. M., Lee, H. Y. (2013). Modeling overdispersion for clustered binomial data, Journal of the Korean Data Analysis Society, 15(5A), 2343-2356.
  7. Jeong, K. M., Lee, H. Y. (2014). The effects of extra-variation to the estimation of small-area proportions, Journal of the Korean Data Analysis Society, 16(6A), 2877-2887.
  8. Lee, D. H. (2013). Simulated maximum likelihood estimator for the zero inflated Poisson lognormal regression model, Journal of the Korean Data Analysis Society, 12(3B), 1347-1360. (in Korean).
  9. Lee, J. E., Lee, H. Y., Jeong, K. M. (2015). Power comparison of tests for extra-variation of counts data, Journal of the Korean Data Analysis Society, 17(3A), 1165-1174.
  10. Paul, S. R. (1982). Analysis of proportions of affected foetuses in teratological experiments, Biometrics, 38, 361-370. https://doi.org/10.2307/2530450
  11. Rao, J. N. K., Scott, A. J. (1992). A simple method for the analysis of clustered binary data, Biometrics, 48, 577-585. https://doi.org/10.2307/2532311
  12. Reed, J. F. (2004). Adjusted chi-square statistics : Application to clustered binary data in primary care, Analysis of Family Medicine, 2(3), 201-203. https://doi.org/10.1370/afm.41
  13. Ridout, M. S., Demetrio, C. G. B., Firth, D. (1999). Estimating intraclass correlation for binary data, Biometrics, 55(1), 137-148. https://doi.org/10.1111/j.0006-341X.1999.00137.x
  14. Sutradhar, S. C., Neerchal, N. K., Morel, J. G. (2007). A goodness-of-fit test for overdispersed binomial models, Journal of Statistical Planning and Inference, 138, 1459-1471.
  15. Turner, R. M., Omar, R. Z., Thompson, S. G. (2001). Bayesian methods of analysis for cluster randomized trials with binary outcome data, Statistics in Medicine, 20(3), 453-472. https://doi.org/10.1002/1097-0258(20010215)20:3<453::AID-SIM803>3.0.CO;2-L
  16. Williams, D. A. (1975). The analysis of binary responses from toxicological experiments involving reproduction and teratogenicity, Biometrics, 31, 949-952. https://doi.org/10.2307/2529820