Distributed One-Class Support Vector Machine.

Journal: International journal of neural systems
Published Date:

Abstract

This paper presents a novel distributed one-class classification approach based on an extension of the ν-SVM method, thus permitting its application to Big Data data sets. In our method we will consider several one-class classifiers, each one determined using a given local data partition on a processor, and the goal is to find a global model. The cornerstone of this method is the novel mathematical formulation that makes the optimization problem separable whilst avoiding some data points considered as outliers in the final solution. This is particularly interesting and important because the decision region generated by the method will be unaffected by the position of the outliers and the form of the data will fit more precisely. Another interesting property is that, although built in parallel, the classifiers exchange data during learning in order to improve their individual specialization. Experimental results using different datasets demonstrate the good performance in accuracy of the decision regions of the proposed method in comparison with other well-known classifiers while saving training time due to its distributed nature.

Authors

  • Enrique Castillo
    Department of Applied Mathematics and Computer Science, University of Cantabria, Av. Los Castros, Santander, 39005, Spain.
  • Diego Peteiro-Barral
    Department of Computer Science, University of A Coruña, Campus de Elviña s/n, A Coruña, 15071, Spain.
  • Bertha Guijarro Berdiñas
    Department of Computer Science, University of A Coruña, Campus de Elviña s/n, A Coruña, 15071, Spain.
  • Oscar Fontenla-Romero
    Department of Computer Science, University of A Coruña, Campus de Elviña s/n, A Coruña, 15071, Spain.