Ir al contenido

Documat


Two Useful Discrete Distributions to Model Overdispersed Count Data

  • Autores: Josmar Mazucheli, Wesley Bertoli, Ricardo Oliveira
  • Localización: Revista Colombiana de Estadística, ISSN-e 2389-8976, ISSN 0120-1751, Vol. 43, Nº. 1, 2020, págs. 21-48
  • Idioma: inglés
  • DOI: 10.15446/rce.v43n1.77052
  • Títulos paralelos:
    • Dos distribuciones discretas útiles para modelar datos de recuento sobredispersos
  • Enlaces
  • Resumen
    • español

      Resumen Los métodos para obtener análogos discretos de distribuciones continuas han sido ampliamente considerados en los últimos años. En general, el pro ceso de discretización proporciona funciones de probabilidad en masa que pueden ser competitivas con el modelo tradicional utilizado en el análisis de datos de conteo, la distribución de Poisson. El procedimiento de discretización también evita el uso de la distribución continua en el análisis de datos estrictamente discretos. En este artículo, intentamos introducir dos análogos discretos para la distribución de Shanker utilizando el método de la serie infinita y el método basado en la función de supervivencia como al ternativas para modelar conjuntos de datos sobre dispersados. A pesar de la diferencia entre los métodos de discretización, las distribuciones resultantes son intercambiables. Sin embargo, la distribución generada por el método de series infinitas tiene expresiones matemáticas más simples para la forma, las funciones de generación y los momentos centrales. La teoría de máxi ma verosimilitud se considera para la estimación y las preocupaciones de inferencia asintótica. Se lleva a cabo un estudio de simulación para evaluar algunas propiedades frecuentistas de la metodología desarrollada. La utili dad de los modelos propuestos se evalúa utilizando conjuntos de datos reales proporcionados por la literatura.

    • English

      Abstract The methods to obtain discrete analogs of continuous distributions have been widely considered in recent years. In general, the discretization process provides probability mass functions that can be competitive with the tra ditional model used in the analysis of count data, the Poisson distribution. The discretization procedure also avoids the use of continuous distribution in the analysis of strictly discrete data. In this paper, we seek to introduce two discrete analogs for the Shanker distribution using the method of the in finite series and the method based on the survival function as alternatives to model overdispersed datasets. Despite the difference between discretization methods, the resulting distributions are interchangeable. However, the dis tribution generated by the method of the infinite series method has simpler mathematical expressions for the shape, the generating functions, and the central moments. The maximum likelihood theory is considered for estima tion and asymptotic inference concerns. A simulation study is carried out in order to evaluate some frequentist properties of the developed methodology. The usefulness of the proposed models is evaluated using real datasets pro vided by the literature.

  • Referencias bibliográficas
    • Bateman, H.,Erdélyi, A. (1953). Higher transcendental functions. McGraw-Hill. New York.
    • Bi, Z.,Faloutsos, C.,Korn, F. (2001). The DGX distribution for mining massive, skewed data. 'Proceedings of the seventh ACM SIGKDD International...
    • Bliss, C. I.,Fisher, R. A. (1953). 'Fitting the negative binomial distribution to biological data'. Biometrics. 9. 176-200
    • Bracquemond, C.,Gaudoin, O. (2003). 'A survey on discrete lifetime distribu tions'. International Journal of Reliability, Quality...
    • Chakraborty, S. (2015). 'Generating discrete analogues of continuous probability distributions - A survey of methods and constructions....
    • Chakraborty, S.. (2015). 'A new discrete distribution related to generalized Gamma distribution and its properties'. Communications...
    • Chakraborty, S.,Chakravarty, D. (2012). 'Discrete Gamma distributions: Prop erties and parameter estimation'. Communications in Statistics...
    • Chakraborty, S.,Chakravarty, D. (2016). 'A new discrete probability distribution with integer support on (-oo, +oo)'. Communications...
    • Chakraborty, S.,Gupta, R. D. (2015). 'Exponentiated Geometric distribution: Another generalization of Geometric distribution'. Communications...
    • Collett, D. (2003). Modelling survival data in medical research. 2. Chapman and Hall. New York.
    • Doornik, J. A. (2007). Object-oriented matrix programming using Ox. 3. Timberlake Consultants Press and Oxford. Lon don.
    • Doray, L. G.,Luong, A. (1997). 'Efficient estimators for the Good family'. Com munications in Statistics - Simulation and Computation....
    • Ghitany, M. E.,Atieh, B.,Nadarajah, S. (2008). 'Lindley distribution and its application'. Mathematics and Computers in Simulation....
    • Gómez-Déniz, E.,Calderín-Ojeda, E. (2011). 'The discrete Lindley distribution: Properties and applications'. Journal of Statistical...
    • Good, I. J. (1953). 'The population frequencies of species and the estimation of population parameters'. Biometrika. 40. 237
    • Grandell, J. (1997). Mixed Poisson processes. Chapman and Hall/CRC.
    • Haight, F. A. (1957). 'Queueing with balking'. Biometrika. 44. 360
    • Hamada, M. S.,Wilson, A. G.,Reese, C. S.,Martz, H. F. (2008). Bayesian reliability. Springer Series in Statistics. Springer, New York.
    • Hussain, T.,Ahmad, M. (2014). 'Discrete inverse Rayleigh distribution'. Pakistan Journal of Statistics. 30. 203
    • Inusah, S.,Kozubowski, T. J. (2006). 'A discrete analogue of the Laplace distri bution'. Journal of Statistical Planning and Inference....
    • Jazi, M. A.,Lai, C. D.,Alamatsaz, M. H. (2010). 'A discrete inverse Weibull dis tribution and estimation of its parameters'. Statistical...
    • Kalbfleisch, J. D.,Prentice, R. L. (2002). The statistical analysis of failure time data. 2. Wiley. New York.
    • Keilson, J.,Gerber, H. (1971). 'Some results for discrete unimodality'. Journal of the American Statistical Association. 66. 386
    • Kemp, A. W. (1997). 'Characterizations of a discrete Normal distribution'. Journal of Statistical Planning and Inference. 63. 223
    • Kemp, A. W. (2004). 'Classes of discrete lifetime distributions'. Communications in Statistics - Theory and Methods. 33. 3069
    • Kemp, A. W. (2008). The discrete Half-Normal distribution. Birkhäuser Boston. Boston.
    • Kennan, J. (1985). 'The duration of contract strikes in U.S. manufacturing'. Jour nal of Econometrics. 28. 5-28
    • Klein, J. P.,Moeschberger, M. L. (1997). Survival analysis: Techniques for censored and truncated data. Springer-Verlag. New York.
    • Kozubowski, T. J.,Inusah, S. (2006). 'A skew Laplace distribution on integers'. Annals of the Institute of Statistical Mathematics....
    • Krishna, H.,Pundir, P. S. (2009). 'Discrete Burr and discrete Pareto distribu tions'. Statistical Methodology. 6. 177
    • Kulasekera, K. B.,Tonkyn, D. W. (1992). A new discrete distribution, with ap plications to survival, dispersal and dispersion. Communications...
    • Lawless, J. F. (2003). Statistical models and methods for lifetime data. 2. John Wiley & Sons, Hoboken. New York.
    • Lee, E. T.,Wang, J. W. (2003). Statistical methods for survival data analysis. 3. John Wiley & Sons. Hoboken, New York.
    • Meeker, W. Q.,Escobar, L. A. (1998). Statistical methods for reliability data. John Wiley & Sons. New York.
    • Nakagawa, T.,Osaki, S. (1975). 'The discrete Weibull distribution'. IEEE Trans actions on Reliability. 24. 300
    • Nekoukhou, V.,Alamatsaz, M. H.,Bidram, H. (2012). A discrete analog of the Generalized Exponential distribution. Communication in Statistics...
    • Nekoukhou, V.,Alamatsaz, M. H.,Bidram, H. (2013). Discrete generalized Ex ponential distribution of a second type. Statistics - A Journal...
    • (2017). R Development Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria....
    • Roy, D. (2003). The discrete Normal distribution. Communication in Statistics -Theory and Methods. 32. 1871
    • Roy, D. (2004). Discrete Rayleigh distribution. IEEE Transactions on Reliability. 53. 255
    • Rubinstein, R. Y.,Kroese, D. P. (2008). Simulation and the Monte Carlo method. 2. John Wiley & Sons. Hoboken, New York.
    • Saha, K. K. (2008). Analysis of one-way layout of count data in the presence of over or under dispersion. Journal of Statistical Planning...
    • Sato, H.,Ikota, M.,Sugimoto, A.,Masuda, H. (1999). A new defect distribution metrology with a consistent discrete exponential formula and...
    • Shanker, R. (2015). Shanker distribution and its applications. International Jour nal of Statistics and Applications. 5. 338
    • Siromoney, G. (1964). 'The general Dirichlets Series distribution'. Journal of the Indian Statistical Association. 23. 1-7
    • Slater, L. J. (1966). Generalized hypergeometric functions. Cambridge University Press. London.
    • Vuong, Q. H. (1989). Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica. 57. 307
Los metadatos del artículo han sido obtenidos de SciELO Colombia

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno