Multi-class boosting for the analysis of multiple incomplete views on microbiome data.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Microbiome dysbiosis has recently been associated with different diseases and disorders. In this context, machine learning (ML) approaches can be useful either to identify new patterns or learn predictive models. However, data to be fed to ML methods can be subject to different sampling, sequencing and preprocessing techniques. Each different choice in the pipeline can lead to a different view (i.e., feature set) of the same individuals, that classical (single-view) ML approaches may fail to simultaneously consider. Moreover, some views may be incomplete, i.e., some individuals may be missing in some views, possibly due to the absence of some measurements or to the fact that some features are not available/applicable for all the individuals. Multi-view learning methods can represent a possible solution to consider multiple feature sets for the same individuals, but most existing multi-view learning methods are limited to binary classification tasks or cannot work with incomplete views.

Authors

  • Andrea Simeon
    BioSense Institute, University of Novi Sad, dr Zorana Djindjića 1, Novi Sad, 21000, Serbia. andrea.simeon@biosense.rs.
  • Miloš Radovanović
    Faculty of Sciences, University of Novi Sad, Trg Dositeja Obradovića 3, Novi Sad, 21000, Serbia.
  • Tatjana Lončar-Turukalo
    Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, Novi Sad, 21000, Serbia.
  • Michelangelo Ceci
    Dept. of Computer Science, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy.
  • Sanja Brdar
    BioSensе Institute - Research Institute for Information Technologies in Biosystems, University of Novi Sad, Dr Zorana Djindjica 1, 21000 Novi Sad, Serbia.
  • Gianvito Pio
    Dept. of Computer Science, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy.