Covariate adjustment of spirometric and smoking phenotypes: The potential of neural network models.

Journal: PloS one
PMID:

Abstract

To increase power and minimize bias in statistical analyses, quantitative outcomes are often adjusted for precision and confounding variables using standard regression approaches. The outcome is modeled as a linear function of the precision variables and confounders; however, for many complex phenotypes, the assumptions of the linear regression models are not always met. As an alternative, we used neural networks for the modeling of complex phenotypes and covariate adjustments. We compared the prediction accuracy of the neural network models to that of classical approaches based on linear regression. Using data from the UK Biobank, COPDGene study, and Childhood Asthma Management Program (CAMP), we examined the features of neural networks in this context and compared them with traditional regression approaches for prediction of three outcomes: forced expiratory volume in one second (FEV1), age at smoking cessation, and log transformation of age at smoking cessation (due to age at smoking cessation being right-skewed). We used mean squared error to compare neural network and regression models, and found the models performed similarly unless the observed distribution of the phenotype was skewed, in which case the neural network had smaller mean squared error. Our results suggest neural network models have an advantage over standard regression approaches when the phenotypic distribution is skewed. However, when the distribution is not skewed, the approaches performed similarly. Our findings are relevant to studies that analyze phenotypes that are skewed by nature or where the phenotype of interest is skewed as a result of the ascertainment condition.

Authors

  • Kirsten Voorhies
    Department of Population Medicine, PRecisiOn Medicine Translational Research (PROMoTeR) Center, Harvard Pilgrim Health Care and Harvard Medical School, Boston, MA, United States of America.
  • Ruofan Bie
    Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States of America.
  • John E Hokanson
    Department of Epidemiology, University of Colorado, Denver, Aurora, CO.
  • Scott T Weiss
    From Research Information Systems and Computing (V.M.C., V.G., S.M.), Partners Healthcare; Boston Children's Hospital Informatics Program (D.D., S.F., G.S.); Harvard Medical School (D.D., S.Y., A.C., M.A.-E.-B., N.A.S., S.M., S.T.W., R.D.); Department of Medicine (S.Y., S.T.W.), Department of Neurosurgery (A.C., M.A.-E.-B., R.D.), Division of Rheumatology, Immunology and Allergy (N.A.S.), and Channing Division of Network Medicine (S.T.W., R.D.), Brigham and Women's Hospital, Boston, MA; Center for Statistical Science (S.Y.), Tsinghua University, Beijing, China; Department of Neurology (S.M.), Massachusetts General Hospital; and Biostatistics (T.C.), Harvard School of Public Health, Boston, MA.
  • Ann Chen Wu
    Department of Population Medicine, PRecisiOn Medicine Translational Research (PROMoTeR) Center, Harvard Pilgrim Health Care and Harvard Medical School, Boston, MA, United States of America.
  • Julian Hecker
    Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States of America.
  • Georg Hahn
    Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States of America.
  • Dawn L Demeo
    Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States of America.
  • Edwin Silverman
    Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States of America.
  • Michael H Cho
    Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.
  • Christoph Lange
    Clinical Infectious Diseases, Research Center Borstel, Borstel, Germany.
  • Sharon M Lutz
    Department of Population Medicine, PRecisiOn Medicine Translational Research (PROMoTeR) Center, Harvard Pilgrim Health Care and Harvard Medical School, Boston, MA, United States of America.