Automatic selection of spoken language biomarkers for dementia detection.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

This paper analyzes diverse features extracted from spoken language to select the most discriminative ones for dementia detection. We present a two-step feature selection (FS) approach: Step 1 utilizes filter methods to pre-screen features, and Step 2 uses a novel feature ranking (FR) method, referred to as dual dropout ranking (DDR), to rank the screened features and select spoken language biomarkers. The proposed DDR is based on a dual-net architecture that separates FS and dementia detection into two neural networks (namely, the operator and selector). The operator is trained on features obtained from the selector to reduce classification or regression loss. The selector is optimized to predict the operator's performance based on automatic regularization. Results show that the approach significantly reduces feature dimensionality while identifying small feature subsets that achieve comparable or superior performance compared with the full, default feature set. The Python codes are available at https://github.com/kexquan/dual-dropout-ranking.

Authors

  • Xiaoquan Ke
    Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region. Electronic address: xiaoquan.ke@connect.polyu.hk.
  • Man Wai Mak
    Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region. Electronic address: enmwmak@polyu.edu.hk.
  • Helen M Meng
    Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong Special Administrative Region. Electronic address: hmmeng@cuhk.edu.hk.