Automatic Processing of Anatomic Pathology Reports in the Italian Language to Enhance the Reuse of Clinical Data.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Medical reports often contain a lot of relevant information in the form of free text. To reuse these unstructured texts for biomedical research, it is important to extract structured data from them. In this work, we adapted a previously developed information extraction system to the oncology domain, to process a set of anatomic pathology reports in the Italian language. The information extraction system relies on a domain ontology, which was adapted and refined in an iterative way. The final output was evaluated by a domain expert, with promising results.

Authors

  • Natalia Viani
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 5, 27100, Pavia, PV, Italy. Electronic address: natalia.viani01@universitadipavia.it.
  • Lorenzo Chiudinelli
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Cristina Tasca
    ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
  • Alberto Zambelli
    ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
  • Mauro Bucalo
    BIOMERIS, Pavia, Italy.
  • Arianna Ghirardi
    ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
  • Nicola Barbarini
    BIOMERIS, Pavia, Italy.
  • Eleonora Sfreddo
    ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
  • Lucia Sacchi
    1 Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Carlo Tondini
    ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
  • Riccardo Bellazzi
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.