Constructing and implementing a performance evaluation indicator set for artificial intelligence decision support systems in pediatric outpatient clinics: an observational study.

Journal: Scientific reports
Published Date:

Abstract

Artificial intelligence (AI) decision support systems in pediatric healthcare have a complex application background. As an AI decision support system (AI-DSS) can be costly, once applied, it is crucial to focus on its performance, interpret its success, and then monitor and update it to ensure ongoing success consistently. Therefore, a set of evaluation indicators was explicitly developed for AI-DSS in pediatric healthcare, enabling continuous and systematic performance monitoring. The study unfolded in two stages. The first stage encompassed establishing the evaluation indicator set through a literature review, a focus group interview, and expert consultation using the Delphi method. In the second stage, weight analysis was conducted. Subjective weights were calculated based on expert opinions through analytic hierarchy process, while objective weights were determined using the entropy weight method. Subsequently, subject and object weights were synthesized to form the combined weight. In the two rounds of expert consultation, the authority coefficients were 0.834 and 0.846, Kendall's coordination coefficient was 0.135 in Round 1 and 0.312 in Round 2. The final evaluation indicator set has three first-class indicators, fifteen second-class indicators, and forty-seven third-class indicators. Indicator I-1(Organizational performance) carries the highest weight, followed by Indicator I-2(Societal performance) and Indicator I-3(User experience performance) in the objective and combined weights. Conversely, 'Societal performance' holds the most weight among the subjective weights, followed by 'Organizational performance' and 'User experience performance'. In this study, a comprehensive and specialized set of evaluation indicators for the AI-DSS in the pediatric outpatient clinic was established, and then implemented. Continuous evaluation still requires long-term data collection to optimize the weight proportions of the established indicators.

Authors

  • Yingwen Wang
    Nursing Department, Children's Hospital of Fudan University, Shanghai, China.
  • Weijia Fu
    Medical Information Center, Children's Hospital of Fudan University, Shanghai, China.
  • Yuejie Zhang
    School of Computer Science, Fudan University, Shanghai, China.
  • Daoyang Wang
    School of Computer Science, Fudan University, Shanghai, China.
  • Ying Gu
    Department of Radiation Oncology, Jinling Hospital, Nanjing, Jiangsu, 210002, China.
  • Weibing Wang
    Key Laboratory of Public Health Safety of the Ministry of Education and National Health Commission, Key Laboratory of Health Technology Assessment, School of Public Health, Fudan University, Shanghai 200438, China.
  • Hong Xu
    Department of Neurosurgery, Changshu Hospital Affiliated to Soochow University, Changshu, China.
  • Xiaoling Ge
    Statistical and Data Management Center, Children's Hospital of Fudan University, Shanghai, China.
  • Chengjie Ye
    Medical Information Center, Children's Hospital of Fudan University, Shanghai, China.
  • Jinwu Fang
    School of Public Health, Fudan University, Shanghai, China.
  • Ling Su
    Department of Infusion Room of Emergency, Children's Hospital of Nanjing Medical University, Nanjing, 210000 Jiangsu Province, China.
  • Jiayu Wang
    Department of Cardiology, the Second Hospital of Shandong University, 250033 Jinan, Shandong, China.
  • Wen He
    Department of Ultrasound, Beijing Tian Tan Hospital, Capital Medical University, Beijing 100070, China.
  • Xiaobo Zhang
    School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255049, P. R. China. liyueyun@sdut.edu.cn.
  • Rui Feng
    Department of Pharmacy, The Fourth Hospital of Hebei Medical University Shijiazhuang 050000, Hebei, China.