Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.

Journal: Journal of biomedical informatics
Published Date:

Abstract

BACKGROUND: In oncology, electronic health records contain textual key information for the diagnosis, staging, and treatment planning of patients with cancer. However, text data processing requires a lot of time and effort, which limits the utilization of these data. Recent advances in natural language processing (NLP) technology, including large language models, can be applied to cancer research. Particularly, extracting the information required for the pathological stage from surgical pathology reports can be utilized to update cancer staging according to the latest cancer staging guidelines.

Authors

  • Hyeongmin Cho
    ezCaretech Research & Development Center, Jung-gu, Seoul, Republic of Korea.
  • Sooyoung Yoo
    Office of eHealth Research and Business, Seoul National University Bundang Hospital, Seongnam, Republic of Korea.
  • Borham Kim
    Office of eHealth Research and Business, Seoul National University Bundang Hospital, Seongnam, Republic of Korea.
  • Sowon Jang
    From the Department of Radiology, Seoul National University Bundang Hospital, 300 Gumi-dong, Bundang-gu, Seongnam-si, Gyeonggi-do 13620, Korea (S.J., H.S., Junghoon Kim, Jihang Kim, K.W.L., S.S.L., K.H.L.); Department of Radiology, Konkuk University Medical Center, Seoul, Korea (Y.J.S.); Seoul National University College of Medicine, Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (K.W.L.); Department of Public Health Science, Graduate School of Public Health, Seoul National University, Seoul, Korea (W.L.); and Program in Biomedical Radiation Sciences, Department of Transdisciplinary Studies, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, Korea (S.L.).
  • Leonard Sunwoo
    Department of Radiology, Seoul National University Bundang Hospital, 82, Gumi-ro 173 Beon-gil, Bundang-gu, Seongnam-si, Gyeonggi-do 13620, Republic of Korea.
  • Sanghwan Kim
    ezCaretech Research & Development Center, Jung-gu, Seoul, Republic of Korea.
  • Donghyoung Lee
    ezCaretech Research & Development Center, Jung-gu, Seoul, Republic of Korea.
  • Seok Kim
    Office of eHealth Research and Business, Seoul National University Bundang Hospital, Seongnam, Republic of Korea.
  • SeJin Nam
    National Center of Excellence in Software, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon, 34134, Republic of Korea.
  • Jin-Haeng Chung
    Department of Pathology, Seoul National University College of Medicine, Seoul National University Bundang Hospital, Seongnam, Korea (the Republic of).