Visualization and Interpretation of Support Vector Machine Activity Predictions.

Journal: Journal of chemical information and modeling

PMID: 25988274

Abstract

Support vector machines (SVMs) are among the preferred machine learning algorithms for virtual compound screening and activity prediction because of their frequently observed high performance levels. However, a well-known conundrum of SVMs (and other supervised learning methods) is the black box character of their predictions, which makes it difficult to understand why models succeed or fail. Herein we introduce an approach to rationalize the performance of SVM models based upon the Tanimoto kernel compared with the linear kernel. Model comparison and interpretation are facilitated by a visualization technique, making it possible to identify descriptor features that determine compound activity predictions. An implementation of the methodology has been made freely available.

Authors

Jenny Balfer

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113, Bonn, Germany.
Jürgen Bajorath

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany.

Keywords

Algorithms Drug Evaluation, Preclinical Humans Ligands Linear Models Mitogen-Activated Protein Kinase 14 Protein Kinase Inhibitors Receptor, Cannabinoid, CB2 Receptors, Dopamine D2 Software Support Vector Machine

External Resources

View on PubMed Access via DOI PubMed (25988274)

Visualization and Interpretation of Support Vector Machine Activity Predictions.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals