Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.

Journal: BMC medical informatics and decision making
PMID:

Abstract

BACKGROUND: [F] Fluorodeoxyglucose (FDG) PET-CT is a clinical imaging modality widely used in diagnosing and staging lung cancer. The clinical findings of PET-CT studies are contained within free text reports, which can currently only be categorised by experts manually reading them. Pre-trained transformer-based language models (PLMs) have shown success in extracting complex linguistic features from text. Accordingly, we developed a multi-task 'TNMu' classifier to classify the presence/absence of tumour, node, metastasis ('TNM') findings (as defined by The Eight Edition of TNM Staging for Lung Cancer). This is combined with an uncertainty classification task ('u') to account for studies with ambiguous TNM status.

Authors

  • Stephen H Barlow
    School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK. stephen.barlow@kcl.ac.uk.
  • Sugama Chicklore
    School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK.
  • Yulan He
  • Sébastien Ourselin
    Wellcome / EPSRC Centre for Interventional and Surgical Sciences (WEISS), University College London, UK.
  • Thomas Wagner
    Department of Nuclear Medicine, Royal Free Hospital, London, UK.
  • Anna Barnes
    King's Technology Evaluation Centre (KiTEC), School of Biomedical Engineering & Imaging Science, King's College London, London, United Kingdom.
  • Gary J R Cook
    Cancer Imaging Department, School of Biomedical Engineering and Imaging Sciences, King's College London, London, SE1 7EH, UK. gary.cook@kcl.ac.uk.