Performance of machine learning classification models of autism using resting-state fMRI is contingent on sample heterogeneity.

Journal: Neural computing & applications
Published Date:

Abstract

Autism spectrum disorders (ASDs) are heterogeneous neurodevelopmental conditions. In fMRI studies, including most machine learning studies seeking to distinguish ASD from typical developing (TD) samples, cohorts differing in gender and symptom severity composition are often treated statistically as one "ASD group". Using resting-state functional connectivity (FC) data, we implemented random forest to build diagnostic classifiers in 4 ASD samples including a total of 656 participants (N = 306, N = 350, ages 6-18). Groups were manipulated to titrate heterogeneity of gender and symptom severity and partially overlapped. Each sample differed on inclusionary criteria: (1) all genders, unrestricted severity range; (2) only male participants, unrestricted severity; (3) all genders, higher severity only; (4) only male participants, higher severity. Each set consisted of 200 participants per group (ASD, TD; matched on age and head motion), 160 for training and 40 for validation. FMRI time series from 237 regions of interest (ROIs) were Pearson correlated in a 237×237 FC matrix and classifiers were built using random forest in training samples. Classification accuracies in validation samples were 62.5%, 65%, 70% and 73.75%, respectively for samples 1-4. Connectivity within cingulo-opercular task control (COTC) network, and between COTC ROIs and default mode and dorsal attention network contributed overall most informative features, but features differed across sets. Findings suggest that diagnostic classifiers vary depending on ASD sample composition. Specifically, greater homogeneity of samples regarding gender and symptom severity enhances classifier performance. However, given the true heterogeneity of ASDs, performance metrics alone may not adequately reflect classifier utility.

Authors

  • Maya A Reiter
    Brain Development Imaging Lab (BDIL), Psychology, San Diego State University (SDSU), 6363 Alvarado Ct. Suite 200, San Diego, CA 92120, USA.
  • Afrooz Jahedi
    Computational Science, San Diego State University/ Claremont Graduate University's Joint Doctoral Program, San Diego, CA, USA.
  • A R Jac Fredo
    Computational Science, San Diego State University/ Claremont Graduate University's Joint Doctoral Program, San Diego, CA, USA.
  • Inna Fishman
    Brain Development Imaging Lab (BDIL), Psychology, San Diego State University (SDSU), 6363 Alvarado Ct. Suite 200, San Diego, CA 92120, USA.
  • Barbara Bailey
    Department of Mathematics and Statistics, San Diego State University, San Diego, California.
  • Ralph-Axel Müller
    Brain Development Imaging Lab (BDIL), Psychology, San Diego State University (SDSU), 6363 Alvarado Ct. Suite 200, San Diego, CA 92120, USA.

Keywords

No keywords available for this article.