Prostate Cancer Risk Prediction and Online Calculation Based on Machine Learning Algorithm.

Journal: Chinese medical sciences journal = Chung-kuo i hsueh k'o hsueh tsa chih
Published Date:

Abstract

Objective To build a prostate cancer (PCa) risk prediction model based on common clinical indicators to provide a theoretical basis for the diagnosis and treatment of PCa and to evaluate the value of artificial intelligence (AI) technology under healthcare data platforms. Methods After preprocessing of the data from Population Health Data Archive, smuothly clipped absolute deviation (SCAD) was used to select features. Random forest (RF), support vector machine (SVM), back propagation neural network (BP), and convolutional neural network (CNN) were used to predict the risk of PCa, among which BP and CNN were used on the enhanced data by SMOTE. The performances of models were compared using area under the curve (AUC) of the receiving operating characteristic curve. After the optimal model was selected, we used the Shiny to develop an online calculator for PCa risk prediction based on predictive indicators. Results Inorganic phosphorus, triglycerides, and calcium were closely related to PCa in addition to the volume of fragmented tissue and free prostate-specific antigen (PSA). Among the four models, RF had the best performance in predicting PCa (accuracy: 96.80%; AUC: 0.975, 95% : 0.964-0.986). Followed by BP (accuracy: 85.36%; AUC: 0.892, 95% : 0.849-0.934) and SVM (accuracy: 82.67%; AUC: 0.824, 95% : 0.805-0.844). CNN performed worse (accuracy: 72.37%; AUC: 0.724, 95% : 0.670-0.779). An online platform for PCa risk prediction was developed based on the RF model and the predictive indicators. Conclusions This study revealed the application value of traditional machine learning and deep learning models in disease risk prediction under healthcare data platform, proposed new ideas for PCa risk prediction in patients suspected for PCa and had undergone core needle biopsy. Besides, the online calculation may enhance the practicability of AI prediction technology and facilitate medical diagnosis.

Authors

  • Chun Wang
    Department of Obstetrics and Gynecology, Peking University Shenzhen Hospital, Shenzhen, China.
  • Qin-Xue Chang
    Department of Health Statistics, School of Public Health, Tianjin Medical University, Tianjin 300070, China.
  • Xiao-Meng Wang
    Department of Health Statistics, School of Public Health, Tianjin Medical University, Tianjin 300070, China.
  • Ke-Yun Wang
    Department of Health Statistics, School of Public Health, Tianjin Medical University, Tianjin 300070, China.
  • He Wang
    Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, China International Neuroscience Institute, Beijing, China.
  • Zhuang Cui
    Department of Health Statistics, College of Public Health, Tianjin Medical University, Heping District, Tianjin, P.R. China.
  • Chang-Ping Li
    Department of Health Statistics, School of Public Health, Tianjin Medical University, Tianjin 300070, China.