Impact of high-quality, mixed-domain data on the performance of medical language models.

Journal: Journal of the American Medical Informatics Association : JAMIA
Published Date:

Abstract

OBJECTIVE: To optimize the training strategy of large language models for medical applications, focusing on creating clinically relevant systems that efficiently integrate into healthcare settings, while ensuring high standards of accuracy and reliability.

Authors

  • Maxime Griot
    Institute of NeuroScience, Université catholique de Louvain, Brussels, 1200, Belgium.
  • Coralie Hemptinne
    Ophthalmology, Cliniques Universitaires Saint-Luc, Brussels, 1200, Belgium.
  • Jean Vanderdonckt
    Louvain Research Institute in Management and Organizations, Université catholique de Louvain, Louvain-la-Neuve, 1348, Belgium.
  • Demet Yuksel
    Institute of NeuroScience, Université catholique de Louvain, Brussels, 1200, Belgium.