Evaluating the ChatGPT family of models for biomedical reasoning and classification.

Journal: Journal of the American Medical Informatics Association : JAMIA
Published Date:

Abstract

OBJECTIVE: Large language models (LLMs) have shown impressive ability in biomedical question-answering, but have not been adequately investigated for more specific biomedical applications. This study investigates ChatGPT family of models (GPT-3.5, GPT-4) in biomedical tasks beyond question-answering.

Authors

  • Shan Chen
    National Academy of Economic Security, Beijing Jiaotong University, Beijing 100044, China.
  • Yingya Li
    Computational Health Informatics Program, Boston Children's Hospital, and Harvard Medical School, Boston, MA 02115, United States.
  • Sheng Lu
    Department of General Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, PR China. Electronic address: lusheng@vip.126.com.
  • Hoang Van
    Computational Health Informatics Program, Boston Children's Hospital, and Harvard Medical School, Boston, MA 02115, United States.
  • Hugo J W L Aerts
    Department of Radiation Oncology, Brigham and Women's Hospital, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, United States.
  • Guergana K Savova
    Department of Pediatrics, Children's Hospital of Boston, Boston.
  • Danielle S Bitterman
    Department of Radiation Oncology, Brigham and Women's Hospital, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, United States.