Immune-based Machine learning Prediction of Diagnosis and Illness State in schizophrenia and bipolar Disorder: How data bias and overfitting were avoided.

Journal: Brain, behavior, and immunity
PMID:

Abstract

In a letter critiquing our manuscript, Takefuji highlights general pitfalls in machine learning, without directly engaging with our study. The comments provide generic advice rather than a specific critique of our methods or findings. Despite raising important topics, the concerns reflect standard risks in machine learning, which we were aware of and explicitly addressed in our analyses. We applied rigorous methods, including nested cross-validation, stratified sampling, and comprehensive performance metrics, to mitigate overfitting, class imbalance, and potential biases. Traditional statistical methods, such as ANCOVA and Spearman correlations, were employed and supplemented our machine learning analysis to validate findings. Concerns about collinearity, causality, and data preprocessing were acknowledged and addressed as detailed in the manuscript and supplementary materials. Although the critique underscores critical issues in machine learning, it does not identify specific missteps in our study. We conclude that our analyses align with best practices and sufficiently address the potential pitfalls discussed in the commentary.

Authors

  • Katrien Skorobogatov
    Scientific Initiative for Neuropsychiatric and Psychopharmacological Studies (SINAPS), University Psychiatric Hospital Campus Duffel (UPCD), Rooienberg 19, 2570 Duffel, Belgium; Collaborative Antwerp Psychiatric Research Institute (CAPRI), University of Antwerp, Campus Drie Eiken, S.003, Universiteitsplein 1, 2610 Wilrijk, Belgium. Electronic address: katrien.skorobogatov@uantwerpen.be.
  • Pieter Meysman
    Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium; Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Biomedical Informatics Research Network Antwerp (Biomina), University of Antwerp, Antwerp, Belgium.
  • Manuel Morrens
    Scientific Initiative for Neuropsychiatric and Psychopharmacological Studies (SINAPS), University Psychiatric Hospital Campus Duffel (UPCD), Rooienberg 19, 2570 Duffel, Belgium; Collaborative Antwerp Psychiatric Research Institute (CAPRI), University of Antwerp, Campus Drie Eiken, S.003, Universiteitsplein 1, 2610 Wilrijk, Belgium.
  • Marion Leboyer
    Université Paris Est Créteil (UPEC), Inserm U955, IMRB Translational Neuropsychiatry Laboratory, AP-HP, Hôpitaux Universitaires H Mondor, DMU IMPACT, FHU ADAPT, Fondation FondaMental, Créteil, France.
  • Livia De Picker
    Scientific Initiative for Neuropsychiatric and Psychopharmacological Studies (SINAPS), University Psychiatric Hospital Campus Duffel (UPCD), Rooienberg 19, 2570 Duffel, Belgium; Collaborative Antwerp Psychiatric Research Institute (CAPRI), University of Antwerp, Campus Drie Eiken, S.003, Universiteitsplein 1, 2610 Wilrijk, Belgium.