Interpretable Classification of Bacterial Raman Spectra With Knockoff Wavelets.

Journal: IEEE journal of biomedical and health informatics

Published Date: Feb 4, 2022

Abstract

Deep neural networks and other machine learning models are widely applied to biomedical signal data because they can detect complex patterns and compute accurate predictions. However, the difficulty of interpreting such models is a limitation, especially for applications involving high-stakes decision, including the identification of bacterial infections. This paper considers fast Raman spectroscopy data and demonstrates that a logistic regression model with carefully selected features achieves accuracy comparable to that of neural networks, while being much simpler and more transparent. Our analysis leverages wavelet features with intuitive chemical interpretations, and performs controlled variable selection with knockoffs to ensure the predictors are relevant and non-redundant. Although we focus on a particular data set, the proposed approach is broadly applicable to other types of signal data for which interpretability may be important.

Authors

Charmaine Chia
Matteo Sesia
Chi-Sing Ho

Dept. of Applied Physics, Stanford University, Stanford, CA, USA. csho@alumni.stanford.edu.
Stefanie S Jeffrey

3Department of Surgery, Stanford University School of Medicine, MSLS Bldg, 1201 Welch Road, Stanford, CA 94305 USA.
Jennifer Dionne

Pumpkinseed Technologies, Palo Alto, CA, United States.
Emmanuel J Candes
Roger T Howe

Keywords

Humans Logistic Models Machine Learning Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (34232897)

Interpretable Classification of Bacterial Raman Spectra With Knockoff Wavelets.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals