Development of Venous Thromboembolism Risk Prediction Models Based on Whole Blood Gene Expression Profiling Using 20 Machine Learning Algorithms: Comprehensive Analysis Study.

Journal: JMIR medical informatics
Published Date:

Abstract

BACKGROUND: There is a lack of venous thromboembolism (VTE) risk prediction models based on gene expression information. OBJECTIVE: This study aimed to construct a VTE prediction model based on whole blood gene expression profiling, by performing a comprehensive analysis of 20 machine learning (ML) algorithms. METHODS: Two transcriptome datasets containing patients with VTE and healthy controls were obtained by searching the Gene Expression Omnibus database and used as the training and validation sets, respectively. Feature selection for model construction was performed on the training set using the least absolute shrinkage and selection operator and random forest, followed by the selection of the intersection of the chosen features. Subsequently, recursive feature elimination was applied to further refine the selected features. The selected features underwent model construction using 20 ML algorithms. The performance of the models was evaluated using various methods such as receiver operating characteristic and confusion matrix. The validation set was used for external model validation. RESULTS: The final results demonstrated that all algorithm models, except for k-nearest neighbor, exhibited good performance in VTE prediction. External validation data indicated that 9 algorithm models had an area under the curve greater than 0.75. The confusion matrix analysis revealed that the algorithm models maintained high specificity in the external validation cohort. CONCLUSIONS: This study used 20 ML algorithms to construct VTE prediction models based on whole blood gene expression information, with 9 of these models demonstrating good diagnostic performance in external validation cohorts. The above models, when used in conjunction with D-dimer, may provide more valuable references for VTE diagnosis.

Authors

  • Yedong Huang
    Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, Fujian, China.
  • Xiaoyun Chen
    State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, China.
  • Guannan Bai
    Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
  • Yajun Zhao
    Department of Health Management Centre, Zhongshan Hospital, Fudan University, Shanghai, China.
  • Dapeng Kuang
    Department of Emergency and Critical Care, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Lin Zhang
    Laboratory of Molecular Translational Medicine, Centre for Translational Medicine, Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Clinical Research Center for Birth Defects of Sichuan Province, West China Second Hospital, Sichuan University, Chengdu, Sichuan, 610041, China. Electronic address: [email protected].
  • Wei Lu
    Department of Pharmacy, Taihe Hospital, Hubei University of Medicine, Shiyan, China.