LedPred: an R/bioconductor package to predict regulatory sequences using support vector machines.

Journal: Bioinformatics (Oxford, England)
PMID:

Abstract

UNLABELLED: Supervised classification based on support vector machines (SVMs) has successfully been used for the prediction of cis-regulatory modules (CRMs). However, no integrated tool using such heterogeneous data as position-specific scoring matrices, ChIP-seq data or conservation scores is currently available. Here, we present LedPred, a flexible SVM workflow that predicts new regulatory sequences based on the annotation of known CRMs, which are associated to a large variety of feature types. LedPred is provided as an R/Bioconductor package connected to an online server to avoid installation of non-R software. Due to the heterogeneous CRM feature integration, LedPred excels at the prediction of regulatory sequences in Drosophila and mouse datasets compared with similar SVM-based software.

Authors

  • Denis Seyres
    INSERM, UMR1090 TAGC, Marseille, F-13288 France, Aix-Marseille Université, UMR1090 TAGC, Marseille, F-13288 France.
  • Elodie Darbo
    Cancer Research UK, London Research Institute, London WC2A 3LY, UK.
  • Laurent Perrin
    INSERM, UMR1090 TAGC, Marseille, F-13288 France, Aix-Marseille Université, UMR1090 TAGC, Marseille, F-13288 France, CNRS, Marseille, France and.
  • Carl Herrmann
    IPMB, Universität Heidelberg and Department of Theoretical Bioinformatics, DKFZ, Heidelberg 69120, Germany.
  • Aitor González
    INSERM, UMR1090 TAGC, Marseille, F-13288 France, Aix-Marseille Université, UMR1090 TAGC, Marseille, F-13288 France.