Screening for depressive symptoms in primary and secondary school students based on speech features: A one-year longitudinal study from Jiangsu, China.

Journal: Journal of affective disorders
Published Date:

Abstract

OBJECTIVE: This study aims to develop and validate an interpretable prediction model for recognizing depression risk in primary and secondary school students based on acoustic features of speech using multiple machine learning methods. METHODS: This study focused on the Jiangsu Province School-based Evaluation Advancing Response for Child Health (SEARCH) program. The baseline survey was conducted on 10,926 individuals in December 2022. Longitudinal follow-up was conducted with 4321 students who had no depressive symptoms at baseline and were available for the second and third follow-up surveys conducted in June 2023 and December 2023, respectively. Four machine learning methods were used, and the Shapley additive explanations (SHAP) method was used to rank and interpret the significance of acoustic speech features. Separate prediction models were constructed to assess the risk of depression in primary and secondary school students based on the number and types of acoustic speech features. The final prediction model was selected based on the optimal number of acoustic speech features. Additionally, participants were analyzed through subgroup stratification by academic level and gender, with the predictive effectiveness of the final model assessed across different groups. RESULTS: Of the 4321 students without depressive symptoms at baseline, 452 (10.5 %) had developed symptoms by the second follow-up, and 512 (11.8 %) by the third. Of the four machine learning models tested, the Light Gradient Boosting Machine (LightGBM) achieved the best validation performance, with an AUC of 0.718 (95 % CI: 0.695-0.741), a PPV of 0.940, and an NPV of 0.196. Feature selection yielded an interpretable LightGBM model based on ten acoustic predictors. SHAP analysis indicated that the most influential were spectral features (mel spectrograms, spectral contrast, and MFCCs), with one additional prosodic feature. Validation of the final model indicated stable performance (AUC 0.699, 95 % CI: 0.675-0.722). Subgroup analysis further revealed higher predictive efficacy in primary school students than in middle and high school students. CONCLUSIONS: Acoustic speech features can effectively identify depression risk in primary and secondary school students and may provide a novel approach for individualized assessment of depressive symptom risk in student populations.

Authors

  • Jiawen Tang
    Department of Pathology, The Affiliated People's Hospital of Ningbo University, Ningbo, Zhejiang, China.
  • Xin Wang
    Key Laboratory of Bio-based Material Science & Technology (Northeast Forestry University), Ministry of Education, Harbin 150040, China.
  • Wendi Zhu
    School of Public Health, Nanjing Medical University, Nanjing, China; Department of Child and Adolescent Health Promotion, Jiangsu, Provincial Center for Disease Control and Prevention, Nanjing, China.
  • Wenyi Yang
    School of Software, Central South University, Changsha, 410075, China.
  • Yao Xiang
    Department of Child and Adolescent Health Promotion, Jiangsu, Provincial Center for Disease Control and Prevention, Nanjing, China.
  • Yan Wang
    College of Animal Science and Technology, Beijing University of Agriculture, Beijing, China.
  • Xiyan Zhang
    Department of Child and Adolescent Health Promotion, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing City, Jiangsu Province, China.
  • Fei Wang
    Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, United States.
  • Jie Yang
    Key Laboratory of Development and Maternal and Child Diseases of Sichuan Province, Department of Pediatrics, Sichuan University, Chengdu, China.

Keywords

No keywords available for this article.