Relative importance of symptoms, cognition, and other multilevel variables for psychiatric disease classifications by machine learning.

Journal: Psychiatry research
PMID:

Abstract

This study used machine-learning algorithms to make unbiased estimates of the relative importance of various multilevel data for classifying cases with schizophrenia (n = 60), schizoaffective disorder (n = 19), bipolar disorder (n = 20), unipolar depression (n = 14), and healthy controls (n = 51) into psychiatric diagnostic categories. The Random Forest machine learning algorithm, which showed best efficacy (92.9% SD: 0.06), was used to generate variable importance ranking of positive, negative, and general psychopathology symptoms, cognitive indexes, global assessment of function (GAF), and parental ages at birth for sorting participants into diagnostic categories. Symptoms were ranked most influential for separating cases from healthy controls, followed by cognition and maternal age. To separate schizophrenia/schizoaffective disorder from bipolar/unipolar depression, GAF was most influential, followed by cognition and paternal age. For classifying schizophrenia from all other psychiatric disorders, low GAF and paternal age were similarly important, followed by cognition, psychopathology and maternal age. Controls misclassified as schizophrenia cases showed lower nonverbal abilities, mild negative and general psychopathology symptoms, and younger maternal or older paternal age. The importance of symptoms for classification of cases and lower GAF for diagnosing schizophrenia, notably more important and distinct from cognition and symptoms, concurs with current practices. The high importance of parental ages is noteworthy and merits further study.

Authors

  • Julie Walsh-Messinger
    Department of Psychology, University of Dayton, Dayton, OH, United States; Department of Psychiatry, Wright State University Boonshoft School of Medicine, Dayton, OH, United States. Electronic address: jmessinger1@udayton.edu.
  • Haoran Jiang
    Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States.
  • Hyejoo Lee
    Korea Institute of Science and Technology, Seoul, Republic of Korea.
  • Karen Rothman
    Department of Psychology, University of Miami, Coral Gables, FL, United States.
  • Hongshik Ahn
    Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States.
  • Dolores Malaspina
    Icahn Medical School at Mount Sinai, New York, NY, United States.