Protocol to calculate and compare exact Shapley values for different kernels in support vector machine models using binary features.

Journal: STAR protocols
PMID:

Abstract

The Shapley value formalism from cooperative game theory was adapted to explain predictions of machine learning models. Here, we present a protocol to calculate and compare exact Shapley values for support vector machine models with commonly used kernels and binary input features. We describe steps for installing software, preparing data, and calculating Shapley values with customizable Python scripts. We then detail procedures for analyzing results via correlation analysis and feature mapping. For complete details on the use and execution of this protocol, please refer to Roth and Bajorath..

Authors

  • Jannik P Roth
    Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany; Lamarr Institute for Machine Learning and Artificial Intelligence, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany. Electronic address: jproth@bit.uni-bonn.de.
  • Jürgen Bajorath
    Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany.