Robust identification of molecular phenotypes using semi-supervised learning.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Modern molecular profiling techniques are yielding vast amounts of data from patient samples that could be utilized with machine learning methods to provide important biological insights and improvements in patient outcomes. Unsupervised methods have been successfully used to identify molecularly-defined disease subtypes. However, these approaches do not take advantage of potential additional clinical outcome information. Supervised methods can be implemented when training classes are apparent (e.g., responders or non-responders to treatment). However, training classes can be difficult to define when assessing relative benefit of one therapy over another using gold standard clinical endpoints, since it is often not clear how much benefit each individual patient receives.

Authors

  • Heinrich Roder
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA.
  • Carlos Oliveira
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA.
  • Lelia Net
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA.
  • Benjamin Linstid
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA.
  • Maxim Tsypin
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA.
  • Joanna Roder
    Biodesix Inc, 2970 Wilderness Pl, Ste100, Boulder, CO, 80301, USA. joanna.roder@biodesix.com.