Can ChatGPT give holistic and accurate patient-centred information to oncology patients? A mixed-methods evaluation with stakeholders

Journal: medRxiv
Published Date:

Abstract

Abstract Objective More people than ever before are living with cancer. Patient education is a core component of cancer care, and patients are increasingly using large language models (LLMs), such as ChatGPT, for advice. The objectives of this study were to evaluate the ability of ChatGPT to explain specialist cancer care records (multidisciplinary team (MDT) meeting reports) to patients and to understand key stakeholders' views and opinions about the technology. Methods Six simulated MDT meeting reports were created by cancer clinicians. MDT reports and 184 realistic patient-centred queries were input into ChatGPT4.0 web version. Three experiments were conducted to analyse ChatGPT's responses: (1) clinician sense-checking (2) clinician and lay annotation (3) focus groups (including cancer patients, caregivers, computer scientists, and clinicians). Results ChatGPT was able to summarise complex oncology information into simpler language, to provide definitions of complex terms and to answer questions about clinical care. However, clinician sense-checking identified problems with accuracy, language and content. In clinician annotation, 92.6% of ChatGPT's responses were judged problematic. Across all evaluation methods, six recurring themes were identified: accuracy, language, trust, content, personalisation and integration challenges. Patients and clinicians found the summaries and definitions useful; however, the responses were not tailored to the individual patient or to what the report might mean for them. Conclusion This study highlights current challenges in using LLMs to explain complex cancer diagnoses and treatment records, including inaccurate information, inappropriate language, limited personalisation, AI distrust and challenges in integrating LLMs into clinical workflow. Understanding of the limitations is crucial for clinicians, patients, computer scientists and policy makers. The issues should be addressed before deploying LLMs in clinical settings. Keywords: ChatGPT, MDT report, Cancer Care, LLMs evaluation, Focus groups, stakeholders

Authors

  • Sun
  • M.; Reiter
  • E.; Murchie
  • P.; Kiltie
  • A. E.; Ramsay
  • G.; Duncan
  • L.; Adam
  • R.