Indexing Publicly Available Health Data with Medical Subject Headings (MeSH): An Evaluation of Term Coverage.
Journal:
Studies in health technology and informatics
Published Date:
Jan 1, 2015
Abstract
As part of the Open Government Initiative, the United States federal government published datasets to increase collaboration, transparency, consumer participation, and research, and are available online at HealthData.gov. Currently, HealthData.gov does not adequately support the accessibility goal of the Open Government Initiative due to issues of retrieving relevant data because of inadequately cataloguing and lack of indexing with a standardized terminology. Given the commonalities between the HealthData.gov and MEDLINE metadata, Medical Subject Headings (MeSH) may offer an indexing solution, but there needs to be a formal evaluation of the efficacy of MeSH for covering the dataset concepts. The purpose of this study was to determine if MeSH adequately covers the HealthData.gov concepts. The noun and noun phrases from the HealthData.gov metadata were extracted and mapped to MeSH using MetaMap. The frequency of no exact, partical and no matches with MeSH terms were determined. The results of this study revealed that about 70% of the HealthData.gov concepts partially or exactly matched MeSH terms. Therefore, MeSH may be a favorable terminology for indexing the HealthData.gov datasets.