A generalist vision-language foundation model for diverse biomedical tasks.

Journal: Nature medicine
Published Date:

Abstract

Traditional biomedical artificial intelligence (AI) models, designed for specific tasks or modalities, often exhibit limited flexibility in real-world deployment and struggle to utilize holistic information. Generalist AI holds the potential to address these limitations due to its versatility in interpreting different data types and generating tailored outputs for diverse needs. However, existing biomedical generalist AI solutions are typically heavyweight and closed source to researchers, practitioners and patients. Here, we describe BiomedGPT, the first open-source and lightweight vision-language foundation model, designed as a generalist capable of performing various biomedical tasks. BiomedGPT achieved state-of-the-art results in 16 out of 25 experiments while maintaining a computing-friendly model scale. We also conducted human evaluations to assess the capabilities of BiomedGPT in radiology visual question answering, report generation and summarization. BiomedGPT exhibits robust prediction ability with a low error rate of 3.8% in question answering, satisfactory performance with an error rate of 8.3% in writing complex radiology reports, and competitive summarization ability with a nearly equivalent preference score to human experts. Our method demonstrates that effective training with diverse data can lead to more practical biomedical AI for improving diagnosis and workflow efficiency.

Authors

  • Kai Zhang
    Anhui Province Key Laboratory of Respiratory Tumor and Infectious Disease, First Affiliated Hospital of Bengbu Medical University, Bengbu, China.
  • Rong Zhou
  • Eashan Adhikarla
    Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA.
  • Zhiling Yan
    Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA.
  • Yixin Liu
    Beijing Key Laboratory of Agricultural Genetic Resources and Biotechnology, Beijing Functional Flower Engineering Technology Research Center, Beijing Agro-Biotechnology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China.
  • Jun Yu
  • Zhengliang Liu
    School of Computing, University of Georgia, Athens, GA, United States.
  • Xun Chen
    Department of Electronic Science and Technology, University of Science and Technology of China, Hefei 230027, China. xunchen@ece.ubc.ca.
  • Brian D Davison
    Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA.
  • Hui Ren
  • Jing Huang
    Department of Nephrology, The Second Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.
  • Chen Chen
    The George Institute for Global Health, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.
  • Yuyin Zhou
    Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
  • Sunyang Fu
    Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, USA.
  • Wei Liu
    Department of Radiation Oncology, Mayo Clinic, Scottsdale, AZ, United States.
  • Tianming Liu
    School of Computing, University of Georgia, Athens, GA, United States.
  • Xiang Li
    Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States.
  • Yong Chen
    Department of Urology, Chongqing University Fuling Hospital, Chongqing, China.
  • Lifang He
    Department of Healthcare Policy and Research, Weill Cornell Medical College, Cornell University, NY.
  • James Zou
    Department of Biomedical Data Science, Stanford University, Stanford, California.
  • Quanzheng Li
    Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States.
  • Hongfang Liu
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.
  • Lichao Sun
    School of Education, Communication & Society, King's College London, London SE5 9RJ, UK.