Learning Performance of Weighted Distributed Learning With Support Vector Machines.

Journal: IEEE transactions on cybernetics
PMID:

Abstract

The divide-and-conquer strategy is a very effective method of dealing with big data. Noisy samples in big data usually have a great impact on algorithmic performance. In this article, we introduce Markov sampling and different weights for distributed learning with the classical support vector machine (cSVM). We first estimate the generalization error of weighted distributed cSVM algorithm with uniformly ergodic Markov chain (u.e.M.c.) samples and obtain its optimal convergence rate. As applications, we obtain the generalization bounds of weighted distributed cSVM with strong mixing observations and independent and identically distributed (i.i.d.) samples, respectively. We also propose a novel weighted distributed cSVM based on Markov sampling (DM-cSVM). The numerical studies of benchmark datasets show that the DM-cSVM algorithm not only has better performance but also has less total time of sampling and training compared to other distributed algorithms.

Authors

  • Bin Zou
    Department of Applied Linguistics, Xi'an Jiaotong-Liverpool University, Suzhou, China.
  • Hongwei Jiang
    Faculty of Mathematics and Statistics, Hubei Key Laboratory of Applied Mathematics, Hubei University, Wuhan 430062, China. Electronic address: jhw940466281@163.com.
  • Chen Xu
    Department of Mathematics and Statistics, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
  • Jie Xu
    Second Affiliated Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China.
  • Xinge You
    School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China; Shenzhen Huazhong University of Science and Technology Research Institute, China.
  • Yuan Yan Tang
    Faculty of Science and Technology, University of Macau, China. Electronic address: yytang@umac.mo.