Comparison of Feature Selection Techniques in Machine Learning for Anatomical Brain MRI in Dementia.

Journal: Neuroinformatics
Published Date:

Abstract

We present a comparative split-half resampling analysis of various data driven feature selection and classification methods for the whole brain voxel-based classification analysis of anatomical magnetic resonance images. We compared support vector machines (SVMs), with or without filter based feature selection, several embedded feature selection methods and stability selection. While comparisons of the accuracy of various classification methods have been reported previously, the variability of the out-of-training sample classification accuracy and the set of selected features due to independent training and test sets have not been previously addressed in a brain imaging context. We studied two classification problems: 1) Alzheimer's disease (AD) vs. normal control (NC) and 2) mild cognitive impairment (MCI) vs. NC classification. In AD vs. NC classification, the variability in the test accuracy due to the subject sample did not vary between different methods and exceeded the variability due to different classifiers. In MCI vs. NC classification, particularly with a large training set, embedded feature selection methods outperformed SVM-based ones with the difference in the test accuracy exceeding the test accuracy variability due to the subject sample. The filter and embedded methods produced divergent feature patterns for MCI vs. NC classification that suggests the utility of the embedded feature selection for this problem when linked with the good generalization performance. The stability of the feature sets was strongly correlated with the number of features selected, weakly correlated with the stability of classification accuracy, and uncorrelated with the average classification accuracy.

Authors

  • Jussi Tohka
    Department of Bioengineering and Aerospace Engineering, Universidad Carlos III de Madrid, Leganes, Spain; Instituto de Investigación Sanitaria Gregorio Marañon, Madrid, Spain; University of Eastern Finland, AI Virtanen Institute for Molecular Sciences, Kuopio, Finland.
  • Elaheh Moradi
    Institute of Biosciences and Medical Technology, University of Tampere, Tampere, Finland.
  • Heikki Huttunen
    Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101, Tampere, Finland.