Fangorn Forest (F2): a machine learning approach to classify genes and genera in the family Geminiviridae.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Geminiviruses infect a broad range of cultivated and non-cultivated plants, causing significant economic losses worldwide. The studies of the diversity of species, taxonomy, mechanisms of evolution, geographic distribution, and mechanisms of interaction of these pathogens with the host have greatly increased in recent years. Furthermore, the use of rolling circle amplification (RCA) and advanced metagenomics approaches have enabled the elucidation of viromes and the identification of many viral agents in a large number of plant species. As a result, determining the nomenclature and taxonomically classifying geminiviruses turned into complex tasks. In addition, the gene responsible for viral replication (particularly, the viruses belonging to the genus Mastrevirus) may be spliced due to the use of the transcriptional/splicing machinery in the host cells. However, the current tools have limitations concerning the identification of introns.

Authors

  • Jose Cleydson F Silva
    Departamento de Informática, Universidade Federal de Viçosa, Viçosa, Brazil.
  • Thales F M Carvalho
    Departamento de Informática, Universidade Federal de Viçosa, Viçosa, Brazil.
  • Elizabeth P B Fontes
    National Institute of Science and Technology in Plant-Pest Interactions/BIOAGRO, Universidade Federal de Viçosa, Viçosa, Brazil. bbfontes@ufv.br.
  • Fabio R Cerqueira
    Departamento de Informática, Universidade Federal de Viçosa, Viçosa, Brazil.