Prescreening in oncology trials using medical records. Natural language processing applied on lung cancer multidisciplinary team meeting reports.

Journal: Health informatics journal
Published Date:

Abstract

Defining profiles of patients that could benefit from relevant anti-cancer treatments is essential. An increasing number of specific criteria are necessary to be eligible to specific anti-cancer therapies. This study aimed to develop an automated algorithm able to detect patient and tumor characteristics to reduce the time-consuming prescreening for trial inclusions without delay. Hence, 640 anonymized multidisciplinary team meetings (MTM) reports concerning lung cancers from one French teaching hospital data warehouse between 2018 and 2020 were annotated. To automate the extraction of eight major eligibility criteria, corresponding to 52 classes, regular expressions were implemented. The RegEx's evaluation gave a F1-score of 93% in average, a positive predictive value (precision) of 98% and sensitivity (recall) of 92%. However, in MTM, fill rates variabilities among patient and tumor information remained important (from 31% to 100%). Genetic mutations and rearrangement test results were the least reported characteristics and also the hardest to automatically extract. To ease prescreening in clinical trials, the PreScIOUs study demonstrated the additional value of rule based and machine learning based methods applied on lung cancer MTM reports.

Authors

  • Marie Ansoborlo
    CHRU Bretonneau, University Hospital, 2 Boulevard Tonnelé, 37044 Tours, France.
  • Christophe Gaborit
    CHRU Bretonneau, University Hospital, 2 Boulevard Tonnelé, 37044 Tours, France.
  • Leslie Grammatico-Guillon
    Department of medical information, University Hospital of Tours, Tours, France.
  • Marc Cuggia
    Univ Rennes, CHU Rennes, Inserm, LTSI - UMR 1099, F-35000 Rennes, France.
  • Guillaume Bouzille
    Univ Rennes, CHU Rennes, Inserm, LTSI - UMR 1099, F-35000 Rennes, France.