Combining heterogeneous data sources for neuroimaging based diagnosis: re-weighting and selecting what is important.

Journal: NeuroImage
Published Date:

Abstract

Combining neuroimaging and clinical information for diagnosis, as for example behavioral tasks and genetics characteristics, is potentially beneficial but presents challenges in terms of finding the best data representation for the different sources of information. Their simple combination usually does not provide an improvement if compared with using the best source alone. In this paper, we proposed a framework based on a recent multiple kernel learning algorithm called EasyMKL and we investigated the benefits of this approach for diagnosing two different mental health diseases. The well known Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset tackling the Alzheimer Disease (AD) patients versus healthy controls classification task, and a second dataset tackling the task of classifying an heterogeneous group of depressed patients versus healthy controls. We used EasyMKL to combine a huge amount of basic kernels alongside a feature selection methodology, pursuing an optimal and sparse solution to facilitate interpretability. Our results show that the proposed approach, called EasyMKLFS, outperforms baselines (e.g. SVM and SimpleMKL), state-of-the-art random forests (RF) and feature selection (FS) methods.

Authors

  • Michele Donini
    Computational Statistics and Machine Learning (CSML), Istituto Italiano di Tecnologia, Genova, Italy. Electronic address: donini.michele@gmail.com.
  • João M Monteiro
    Department of Computer Science, University College London, London, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, London, UK. Electronic address: joao.monteiro@ucl.ac.uk.
  • Massimiliano Pontil
    Computational Statistics and Machine Learning (CSML), Istituto Italiano di Tecnologia, Genova, Italy; Department of Computer Science, University College London, United Kingdom.
  • Tim Hahn
  • Andreas J Fallgatter
    Department of Psychiatry and Psychotherapy, University Hospital Tuebingen, Germany.
  • John Shawe-Taylor
    Department of Computer Science, University College London, London, United Kingdom.
  • Janaina Mourão-Miranda
    Centre for Medical Image Computing, Department of Computer Science, University College London, London, United Kingdom; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom.