Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Despite experimental and curation efforts, the extent of enzyme promiscuity on substrates continues to be largely unexplored and under documented. Providing computational tools for the exploration of the enzyme-substrate interaction space can expedite experimentation and benefit applications such as constructing synthesis pathways for novel biomolecules, identifying products of metabolism on ingested compounds, and elucidating xenobiotic metabolism. Recommender systems (RS), which are currently unexplored for the enzyme-substrate interaction prediction problem, can be utilized to provide enzyme recommendations for substrates, and vice versa. The performance of Collaborative-Filtering (CF) RSs; however, hinges on the quality of embedding vectors of users and items (enzymes and substrates in our case). Importantly, enhancing CF embeddings with heterogeneous auxiliary data, specially relational data (e.g. hierarchical, pairwise or groupings), remains a challenge.

Authors

  • Xinmeng Li
    Department of Computer Science, Tufts University, Massachusetts, United States of America.
  • Li-Ping Liu
    College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, China.
  • Soha Hassoun
    Department of Computer Science, Tufts University, Massachusetts, United States of America.