Acquiring Plausible Predications from MEDLINE by Clustering MeSH Annotations.

Journal: Studies in health technology and informatics
Published Date:

Abstract

The massive accumulation of biomedical knowledge is reflected by the growth of the literature database MEDLINE with over 23 million bibliographic records. All records are manually indexed by MeSH descriptors, many of them refined by MeSH subheadings. We use subheading information to cluster types of MeSH descriptor co-occurrences in MEDLINE by processing co-occurrence information provided by the UMLS. The goal is to infer plausible predicates to each resulting cluster. In an initial experiment this was done by grouping disease-pharmacologic substance co-occurrences into six clusters. Then, a domain expert manually performed the assignment of meaningful predicates to the clusters. The mean accuracy of the best ten generated biomedical facts of each cluster was 85%. This result supports the evidence of the potential of MeSH subheadings for extracting plausible medical predications from MEDLINE.

Authors

  • Jose Antonio Miñarro-Giménez
    Institute of Medical Informatics, Statistics, and Documentation, Medical University of Graz, Austria.
  • Markus Kreuzthaler
    Institute of Medical Informatics, Statistics, and Documentation, Medical University of Graz, Austria.
  • Johannes Bernhardt-Melischnig
    Institute of Medical Informatics, Statistics, and Documentation, Medical University of Graz, Austria.
  • Catalina Martínez-Costa
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.
  • Stefan Schulz
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.