| Peer-Reviewed

Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models

Received: 29 July 2015     Accepted: 4 August 2015     Published: 19 November 2015
Views:       Downloads:
Abstract

Since sampling weights are not simply equal to the reciprocal of selection probabilities its always challenging to incorporate survey weights into likelihood-based analysis. These weights are always adjusted for various characteristics. In cases where logistic regression model is used to predict categorical outcomes with survey data, the sampling weights should be considered if the sampling design does not give each individual an equal chance of being selected in the sample. The weights are rescaled to sum to an equivalent sample size since original weights have small variances. The new weights are called the adjusted weights. Quasi-likelihood maximization is the method that is used to make estimation with the adjusted weights but the other new method that can be created is correct likelihood for logistic regression which included the adjusted weights. Adjusted weights are further used to adjust for both covariates and intercepts when the correct likelihood method was used. We also looked at the differences and similarities between the two methods. Analysis: Both binary logistic regression model and multinomial logistic regression model were used in parameter estimation and we applied the methods to body mass index data from Nairobi Hospital, which is in Nairobi County where a sample of 265 was used. R-software Version 3.0.2 was used in the analysis. Conclusion: The results from the study showed that there were some similarities and differences between the quasi-likelihood and correct likelihood methods in parameter estimates, standard errors and statistical p-values.

Published in Science Journal of Applied Mathematics and Statistics (Volume 3, Issue 6)
DOI 10.11648/j.sjams.20150306.13
Page(s) 243-249
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2015. Published by Science Publishing Group

Keywords

Binary Logistic Regression, Multinomial Logistic Regression, Adjusted Weights, Correct Likelihood, Quasi-Likelihood, Nairobi

References
[1] Andersson J.C, Verkuilen, J., and Peyton, B. L. (2010).Modeling Polytomous Item Responses using Simultaneously Estimated Multinomial Logistic Regression Model. Journal of Educational and Behavioral Statistics, 422.
[2] Balgobin Nandram and Jai Won Choi. (2010). A Bayesian Analysis of Body Mass Index Data from Small Domains Under Nonignorable Nonresponse and Selection. Journal of American Statistical Association, 105, 120-135.
[3] Courvoisier, D.S., C. Combescure, T. Agoritsas, A. Gayet-Ageron and T.V. Perneger (2011) “Performance of logistic regression modeling: beyond the number of events per variable, the role of data structure.” Journal of Clinical Epidemiology 64: 993-1000.
[4] Einicke, G.A.; Falco, G.; Dunn, M.T.; Reid, D.C. (May 2012). "Iterative Smoother-Based Variance Estimation". IEEE Signal Processing Letters 19 (5)
[5] Gelman, Andrew. (2007) Struggles with Survey Weighting and Regression Modeling. Statistical Science, 22, 153-164.
[6] Grilli, L., and Pratesi, M. (2004). Weighted Estimation in Multinomial Ordinal and Binary Models in the Presence of Informative Sampling Designs. Survey Methodology, 30, 93-103.
[7] Hosmer D.W and Lemeshow S, (2000), Applied Logistic Regression 2nd Ed. John Wiley & Sons, Inc. Canada. PP1-17
[8] Michael, H., Kutner, C J., and Nachtsheim, J. N. (2004).Applied Linear Regression Models. McGraw-Hill/Irwin
[9] Nandram, B., and Choi, J. W. (2002), A Hierarchical Bayesian Nonresponse Model for Binary Data With Uncertainty About Ignorability, Journal of the American Statistical Association, 97, 381-388.
[10] Pfeffermann, D., Skinner, C. J., Holmes, D. J., Goldstein, H., and Rasbash, J. (1998).Weighting for Unequal Selection Probabilities in Multinomial Models. Journal of the Royal Statistical Society, 60, 23-40.
[11] Potthoff R, Woodbury, M. A., and Manton, K. G. (1992). Equivalent Sample Size and Equivalent Degrees of Freedom Refinements for Inference using Survey Weights under Super population Models. Journal of the American Statistical Association, 87, 383-396.
[12] Swinburn BA, Sacks G, Hall KD, et al. The global obesity pandemic: shaped by global drivers and local environments. Lancet. 2011; 378(9793)
[13] Wang YC, McPherson K, Marsh T, Gortmaker SL, Brown M. Health and economic burden of the projected obesity trends in the USA and the UK. Lancet. 2011; 378(9793).
Cite This Article
  • APA Style

    Kennedy Sakaya Barasa, Chris Muchwanju. (2015). Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models. Science Journal of Applied Mathematics and Statistics, 3(6), 243-249. https://doi.org/10.11648/j.sjams.20150306.13

    Copy | Download

    ACS Style

    Kennedy Sakaya Barasa; Chris Muchwanju. Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models. Sci. J. Appl. Math. Stat. 2015, 3(6), 243-249. doi: 10.11648/j.sjams.20150306.13

    Copy | Download

    AMA Style

    Kennedy Sakaya Barasa, Chris Muchwanju. Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models. Sci J Appl Math Stat. 2015;3(6):243-249. doi: 10.11648/j.sjams.20150306.13

    Copy | Download

  • @article{10.11648/j.sjams.20150306.13,
      author = {Kennedy Sakaya Barasa and Chris Muchwanju},
      title = {Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models},
      journal = {Science Journal of Applied Mathematics and Statistics},
      volume = {3},
      number = {6},
      pages = {243-249},
      doi = {10.11648/j.sjams.20150306.13},
      url = {https://doi.org/10.11648/j.sjams.20150306.13},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.sjams.20150306.13},
      abstract = {Since sampling weights are not simply equal to the reciprocal of selection probabilities its always challenging to incorporate survey weights into likelihood-based analysis. These weights are always adjusted for various characteristics. In cases where logistic regression model is used to predict categorical outcomes with survey data, the sampling weights should be considered if the sampling design does not give each individual an equal chance of being selected in the sample. The weights are rescaled to sum to an equivalent sample size since original weights have small variances. The new weights are called the adjusted weights. Quasi-likelihood maximization is the method that is used to make estimation with the adjusted weights but the other new method that can be created is correct likelihood for logistic regression which included the adjusted weights. Adjusted weights are further used to adjust for both covariates and intercepts when the correct likelihood method was used. We also looked at the differences and similarities between the two methods. Analysis: Both binary logistic regression model and multinomial logistic regression model were used in parameter estimation and we applied the methods to body mass index data from Nairobi Hospital, which is in Nairobi County where a sample of 265 was used. R-software Version 3.0.2 was used in the analysis. Conclusion: The results from the study showed that there were some similarities and differences between the quasi-likelihood and correct likelihood methods in parameter estimates, standard errors and statistical p-values.},
     year = {2015}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models
    AU  - Kennedy Sakaya Barasa
    AU  - Chris Muchwanju
    Y1  - 2015/11/19
    PY  - 2015
    N1  - https://doi.org/10.11648/j.sjams.20150306.13
    DO  - 10.11648/j.sjams.20150306.13
    T2  - Science Journal of Applied Mathematics and Statistics
    JF  - Science Journal of Applied Mathematics and Statistics
    JO  - Science Journal of Applied Mathematics and Statistics
    SP  - 243
    EP  - 249
    PB  - Science Publishing Group
    SN  - 2376-9513
    UR  - https://doi.org/10.11648/j.sjams.20150306.13
    AB  - Since sampling weights are not simply equal to the reciprocal of selection probabilities its always challenging to incorporate survey weights into likelihood-based analysis. These weights are always adjusted for various characteristics. In cases where logistic regression model is used to predict categorical outcomes with survey data, the sampling weights should be considered if the sampling design does not give each individual an equal chance of being selected in the sample. The weights are rescaled to sum to an equivalent sample size since original weights have small variances. The new weights are called the adjusted weights. Quasi-likelihood maximization is the method that is used to make estimation with the adjusted weights but the other new method that can be created is correct likelihood for logistic regression which included the adjusted weights. Adjusted weights are further used to adjust for both covariates and intercepts when the correct likelihood method was used. We also looked at the differences and similarities between the two methods. Analysis: Both binary logistic regression model and multinomial logistic regression model were used in parameter estimation and we applied the methods to body mass index data from Nairobi Hospital, which is in Nairobi County where a sample of 265 was used. R-software Version 3.0.2 was used in the analysis. Conclusion: The results from the study showed that there were some similarities and differences between the quasi-likelihood and correct likelihood methods in parameter estimates, standard errors and statistical p-values.
    VL  - 3
    IS  - 6
    ER  - 

    Copy | Download

Author Information
  • Department of Statistics and Actuarial Sciences, Jomo Kenyatta University of Science and Technology, Nairobi, Kenya

  • Department of Statistics and Actuarial Sciences, Jomo Kenyatta University of Science and Technology, Nairobi, Kenya

  • Sections