Assessing citation integrity in biomedical publications: corpus annotation and NLP models.

Journal: Bioinformatics (Oxford, England)
PMID:

Abstract

MOTIVATION: Citations have a fundamental role in scholarly communication and assessment. Citation accuracy and transparency is crucial for the integrity of scientific evidence. In this work, we focus on quotation errors, errors in citation content that can distort the scientific evidence and that are hard to detect for humans. We construct a corpus and propose natural language processing (NLP) methods to identify such errors in biomedical publications.

Authors

  • Maria Janina Sarol
    Informatics Programs, University of Illinois Urbana-Champaign, Champaign, IL 61820, United States.
  • Shufan Ming
    School of Information Sciences, University of Illinois Urbana-Champaign, 501 E Daniel St., Champaign, 61820, IL, USA.
  • Shruthan Radhakrishna
    Department of Computer Science, University of Illinois Urbana-Champaign, Champaign, IL 61801, United States.
  • Jodi Schneider
    School of Information Sciences, University of Illinois at Urbana-Champaign, Champaign, IL, USA.
  • Halil Kilicoglu
    School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, IL 61820, United States.