BioInstruct: instruction tuning of large language models for biomedical natural language processing.

Journal: Journal of the American Medical Informatics Association : JAMIA
Published Date:

Abstract

OBJECTIVES: To enhance the performance of large language models (LLMs) in biomedical natural language processing (BioNLP) by introducing a domain-specific instruction dataset and examining its impact when combined with multi-task learning principles.

Authors

  • Hieu Tran
    Manning College of Information and Computer Sciences, University of Massachusetts Amherst, Amherst, MA 01003, United States.
  • Zhichao Yang
    Guangdong Provincial Key Laboratory of Advanced Biomaterials, Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, China.
  • Zonghai Yao
    Manning College of Information and Computer Sciences, University of Massachusetts Amherst, Amherst, MA 01003, United States.
  • Hong Yu
    University of Massachusetts Medical School, Worcester, MA.