Assessing citation integrity in biomedical publications: corpus annotation and NLP models.
Journal:
Bioinformatics (Oxford, England)
PMID:
38924508
Abstract
MOTIVATION: Citations have a fundamental role in scholarly communication and assessment. Citation accuracy and transparency is crucial for the integrity of scientific evidence. In this work, we focus on quotation errors, errors in citation content that can distort the scientific evidence and that are hard to detect for humans. We construct a corpus and propose natural language processing (NLP) methods to identify such errors in biomedical publications.