SIMLR: A Tool for Large-Scale Genomic Analyses by Multi-Kernel Learning.

Journal: Proteomics
Published Date:

Abstract

SIMLR (Single-cell Interpretation via Multi-kernel LeaRning), an open-source tool that implements a novel framework to learn a sample-to-sample similarity measure from expression data observed for heterogenous samples, is presented here. SIMLR can be effectively used to perform tasks such as dimension reduction, clustering, and visualization of heterogeneous populations of samples. SIMLR was benchmarked against state-of-the-art methods for these three tasks on several public datasets, showing it to be scalable and capable of greatly improving clustering performance, as well as providing valuable insights by making the data more interpretable via better a visualization. SIMLR is available on https://github.com/BatzoglouLabSU/SIMLRGitHub in both R and MATLAB implementations. Furthermore, it is also available as an R package on http://bioconductor.org.

Authors

  • Bo Wang
    Department of Clinical Laboratory Medicine Center, Inner Mongolia Autonomous Region People's Hospital, Hohhot, Inner Mongolia, China.
  • Daniele Ramazzotti
    Department of Computer Science, Stanford University, Stanford, CA, USA.
  • Luca De Sano
    Department of Informatics, University of Milano-Bicocca, Milan, Italy.
  • Junjie Zhu
    Hunan University; zhujunjie@hnu.edu.cn.
  • Emma Pierson
    Department of Computer Science, Stanford University, Stanford, CA, USA.
  • Serafim Batzoglou
    Department of Computer Science, Stanford University, Stanford, CA, USA.