A comparative machine learning study of schizophrenia biomarkers derived from functional connectivity.

Journal: Scientific reports
PMID:

Abstract

Functional connectivity holds promise as a biomarker of schizophrenia. Yet, the high dimensionality of predictive models trained on functional connectomes, combined with small sample sizes in clinical research, increases the risk of overfitting. Recently, low-dimensional representations of the connectome such as macroscale cortical gradients and gradient dispersion have been proposed, with studies noting consistent gradient and dispersion differences in psychiatric conditions. However, it is unknown which of these derived measures has the highest predictive capacity and how they compare to raw functional connectivity specifically in the case of schizophrenia. Our study evaluates which connectome features derived from resting state functional MRI - functional connectivity, gradients, or gradient dispersion - best identify schizophrenia. To this end, we leveraged data of 936 individuals from three large open-access datasets: COBRE, LA5c, and SRPBS-1600. We developed a pipeline which allows us to aggregate over a million different features and assess their predictive potential in a single, computationally efficient experiment. We selected top 1% of features with the largest permutation feature importance and trained 13 classifiers on them using 10-fold cross-validation. Our findings indicate that functional connectivity outperforms its low-dimensional derivatives such as cortical gradients and gradient dispersion in identifying schizophrenia (Mann-Whitney test conducted on test accuracy: connectivity vs. 1st gradient: U = 142, p < 0.003; connectivity vs. neighborhood dispersion: U = 141, p = 0.004). Additionally, we demonstrated that the edges which contribute the most to classification performance are the ones connecting primary sensory regions. Functional connectivity within the primary sensory regions showed the highest discrimination capabilities between subjects with schizophrenia and neurotypical controls. These findings along with the feature selection pipeline proposed here will facilitate future inquiries into the prediction of schizophrenia subtypes and transdiagnostic phenomena.

Authors

  • Victoria Shevchenko
    Cognitive Neuroanatomy Lab, INCC UMR 8002, CNRS, Université Paris Cité, Paris, France. victoria.shevchenko@inria.fr.
  • R Austin Benn
    Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain.
  • Robert Scholz
    Cognitive Neuroanatomy Lab, INCC UMR 8002, CNRS, Université Paris Cité, Paris, France.
  • Wei Wei
    Dept. Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
  • Carla Pallavicini
    Fundación para la lucha contra las enfermedades neurológicas de la infancia (FLENI), Montañeses 2325, C1428, AQK, Buenos Aires, Argentina; Departamento de Física, Universidad de Buenos Aires and Instituto de Física de Buenos Aires (IFIBA - CONICET), Pabellón I, Ciudad Universitaria (1428), Buenos Aires, Argentina. Electronic address: carlap@df.uba.ar.
  • Ulysse Klatzmann
    Cognitive Neuroanatomy Lab, INCC UMR 8002, CNRS, Université Paris Cité, Paris, France.
  • Francesco Alberti
    Cognitive Neuroanatomy Lab, INCC UMR 8002, CNRS, Université Paris Cité, Paris, France.
  • Theodore D Satterthwaite
    Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, PA, USA.
  • Demian Wassermann
    Université Côte d'Azur, Inria, Sophia-Antipolis, France; Parietal, CEA, Inria, Saclay, Île-de-France.
  • Pierre-Louis Bazin
    Full Brain Picture Analytics, Leiden, The Netherlands.
  • Daniel S Margulies
    Cognitive Neuroanatomy Lab, INCC UMR 8002, CNRS, Université Paris Cité, Paris, France. daniel.margulies@cnrs.fr.