MediAlbertina: An European Portuguese medical language model.
Journal:
Computers in biology and medicine
PMID:
39362002
Abstract
BACKGROUND: Patient medical information often exists in unstructured text containing abbreviations and acronyms deemed essential to conserve time and space but posing challenges for automated interpretation. Leveraging the efficacy of Transformers in natural language processing, our objective was to use the knowledge acquired by a language model and continue its pre-training to develop an European Portuguese (PT-PT) healthcare-domain language model.