Biosignature Discovery for Substance Use Disorders Using Statistical Learning.

Journal: Trends in molecular medicine
Published Date:

Abstract

There are limited biomarkers for substance use disorders (SUDs). Traditional statistical approaches are identifying simple biomarkers in large samples, but clinical use cases are still being established. High-throughput clinical, imaging, and 'omic' technologies are generating data from SUD studies and may lead to more sophisticated and clinically useful models. However, analytic strategies suited for high-dimensional data are not regularly used. We review strategies for identifying biomarkers and biosignatures from high-dimensional data types. Focusing on penalized regression and Bayesian approaches, we address how to leverage evidence from existing studies and knowledge bases, using nicotine metabolism as an example. We posit that big data and machine learning approaches will considerably advance SUD biomarker discovery. However, translation to clinical practice, will require integrated scientific efforts.

Authors

  • James W Baurley
    BioRealm LLC, Walnut, California, USA.
  • Christopher S McMahan
    School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA.
  • Carolyn M Ervin
    BioRealm, Culver City, CA, USA.
  • Bens Pardamean
    BioRealm, Culver City, CA, USA; Bina Nusantara University, Jakarta, Indonesia.
  • Andrew W Bergen
    BioRealm, Culver City, CA, USA; Oregon Research Institute, Eugene, OR, USA.