Explainable artificial intelligence in forensic DNA analysis: Alleles identification in challenging electropherograms using supervised machine learning methods.

Journal: Forensic science international. Genetics
Published Date:

Abstract

Challenging samples in capillary electrophoresis (CE)-based short tandem repeat (STR) analysis often produce artefactual signals that cannot be completely filtered out by expert electropherogram (EPG) reading systems, complicating allele interpretation. Previous studies have demonstrated the potential of artificial intelligence (AI) to address this issue by accurately distinguishing allele signals from artefacts in EPGs. Traditional machine learning models offer significant advantages in enhancing the interpretability and transparency of AI models used in DNA analysis, particularly in criminal investigations and legal contexts. In this study, five traditional machine learning algorithms were employed to train and construct models using EPG signal datasets from single-source low-template EPGs, mixture EPGs, and combined datasets. Performance evaluation and validation with additional datasets demonstrated the feasibility of these models in improving the reportability of potential information in EPGs. However, further optimization is needed for mixture EPGs to enhance classification accuracy. Implementing Receiver Operating Characteristic (ROC) curve analysis and prediction probability thresholds effectively reduced false positive classifications. Additionally, a user-friendly platform was developed for EPG signal classification based on machine learning and ensemble learning, allowing for the classification of any signal datasets using traditional machine learning models and combining the prediction results of multiple models. This platform will provide analysts with more optimal and robust results. This study shows that machine-learning-based EPG signal classification models can significantly enhance the efficiency of sample analysis and interpretation, providing a solid foundation for future research.

Authors

  • Mengyu Tan
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Yuxuan Tan
    Department of Global Health School of Public Health Wuhan University, Wuhan, China; Global Health Institute School of Public Health Wuhan University, Wuhan, China.
  • Haoyan Jiang
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Jiaming Xue
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Qiushuo Wu
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Yazi Zheng
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Guihong Liu
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.
  • Yuanyuan Xiao
    Division of Epidemiology and Health Statistics, School of Public Health, Kunming Medical University, Kunming, China. 33225647@qq.com.
  • Meili Lv
    Department of Immunology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, Sichuan 610041, China.
  • Miao Liao
    Department of Biomedical and Information Engineering, Central South University, Changsha 410083, China.
  • Lin Zhang
    Laboratory of Molecular Translational Medicine, Centre for Translational Medicine, Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Clinical Research Center for Birth Defects of Sichuan Province, West China Second Hospital, Sichuan University, Chengdu, Sichuan, 610041, China. Electronic address: zhanglin@scu.edu.cn.
  • Shengqiu Qu
    Department of Forensic Genetics, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China. Electronic address: qushengqiu@scu.edu.cn.
  • Weibo Liang
    Department of Forensic Genetics, West China School of Preclinical and Forensic Medicine, Sichuan University, Chengdu, 610041, Sichuan, People's Republic of China.