ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Journal: Trends in hearing

PMID: 39397786

Abstract

Decoding speech envelopes from electroencephalogram (EEG) signals holds potential as a research tool for objectively assessing auditory processing, which could contribute to future developments in hearing loss diagnosis. However, current methods struggle to meet both high accuracy and interpretability. We propose a deep learning model called the auditory decoding transformer (ADT) network for speech envelope reconstruction from EEG signals to address these issues. The ADT network uses spatio-temporal convolution for feature extraction, followed by a transformer decoder to decode the speech envelopes. Through anticausal masking, the ADT considers only the current and future EEG features to match the natural relationship of speech and EEG. Performance evaluation shows that the ADT network achieves average reconstruction scores of 0.168 and 0.167 on the SparrKULee and DTU datasets, respectively, rivaling those of other nonlinear models. Furthermore, by visualizing the weights of the spatio-temporal convolution layer as time-domain filters and brain topographies, combined with an ablation study of the temporal convolution kernels, we analyze the behavioral patterns of the ADT network in decoding speech envelopes. The results indicate that low- (0.5-8 Hz) and high-frequency (14-32 Hz) EEG signals are more critical for envelope reconstruction and that the active brain regions are primarily distributed bilaterally in the auditory cortex, consistent with previous research. Visualization of attention scores further validated previous research. In summary, the ADT network balances high performance and interpretability, making it a promising tool for studying neural speech envelope tracking.

Authors

Ruixiang Liu

School of Intelligent Medicine, China Medical University, Shenyang, China.
Chang Liu

Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Dan Cui

Shengjing Hospital of China Medical University, Shenyang, China.
Huan Zhang

Department of Plant Protection, Zhejiang University, 866 Yuhangtang Road, 5 Hangzhou 310058, China.
Xinmeng Xu

School of Intelligent Medicine, China Medical University, Shenyang, China.
Yuxin Duan

School of Intelligent Medicine, China Medical University, Shenyang, China.
Yihu Chao

School of Intelligent Medicine, China Medical University, Shenyang, China.
Xianzheng Sha

Department of Biomedical Engineering, School of Fundamental Sciences, China Medical University, Shenyang 110122, Liaoning Province, China.
Limin Sun

Department of Bridge Engineering, College of Civil Engineering, Tongji University, Shanghai 200092, China.
Xiulan Ma

Department of Otolaryngology-Head and Neck Surgery, Shengjing Hospital of China Medical University, Shenyang, China.
Shuo Li

Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Shijie Chang

Department of Biomedical Engineering, China Medical University, Shenyang, 110122.

Keywords

Acoustic Stimulation Auditory Cortex Deep Learning Electroencephalography Humans Neural Networks, Computer Nonlinear Dynamics Signal Processing, Computer-Assisted Speech Acoustics Speech Perception

External Resources

View on PubMed Access via DOI PubMed (39397786)

ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals