The impact of transitive annotation on the training of taxonomic classifiers.

Journal: Frontiers in microbiology

Published Date: Jan 3, 2024

Abstract

INTRODUCTION: A common task in the analysis of microbial communities involves assigning taxonomic labels to the sequences derived from organisms found in the communities. Frequently, such labels are assigned using machine learning algorithms that are trained to recognize individual taxonomic groups based on training data sets that comprise sequences with known taxonomic labels. Ideally, the training data should rely on labels that are experimentally verified-formal taxonomic labels require knowledge of physical and biochemical properties of organisms that cannot be directly inferred from sequence alone. However, the labels associated with sequences in biological databases are most commonly computational predictions which themselves may rely on computationally-generated data-a process commonly referred to as "transitive annotation."

Authors

Harihara Subrahmaniam Muralidharan

Department of Computer Science, University of Maryland, College Park, MD, United States.
Noam Y Fox

Department of Computer Science, University of Maryland, College Park, MD, United States.
Mihai Pop

Department of Computer Science, University of Maryland, College Park, MD, United States.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (38235435)

The impact of transitive annotation on the training of taxonomic classifiers.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals