A clinical trials corpus annotated with UMLS entities to enhance the access to evidence-based medicine.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: The large volume of medical literature makes it difficult for healthcare professionals to keep abreast of the latest studies that support Evidence-Based Medicine. Natural language processing enhances the access to relevant information, and gold standard corpora are required to improve systems. To contribute with a new dataset for this domain, we collected the Clinical Trials for Evidence-Based Medicine in Spanish (CT-EBM-SP) corpus.

Authors

  • Leonardo Campillos-Llanos
    Computational Linguistics Laboratory, Universidad Autónoma de Madrid, C/Francisco Tomás y Valiente 1. Cantoblanco Campus, 28049, Madrid, Spain. leonardo.campillos@uam.es.
  • Ana Valverde-Mateos
    Medical Terminology Unit, Spanish Royal Academy of Medicine., C/Arrieta 12, 28013, Madrid, Spain.
  • Adrián Capllonch-Carrión
    Complejo Asistencial Hospital Benito Menni., C/Jardines 1, 28350, Ciempozuelos, Madrid, Spain.
  • Antonio Moreno-Sandoval
    Computational Linguistics Laboratory, Universidad Autónoma de Madrid, C/Francisco Tomás y Valiente 1. Cantoblanco Campus, 28049, Madrid, Spain.