A new Growing Neural Gas for clustering data streams.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Clustering data streams is becoming the most efficient way to cluster a massive dataset. This task requires a process capable of partitioning observations continuously with restrictions of memory and time. In this paper we present a new algorithm, called G-Stream, for clustering data streams by making one pass over the data. G-Stream is based on growing neural gas, that allows us to discover clusters of arbitrary shapes without any assumptions on the number of clusters. By using a reservoir, and applying a fading function, the quality of clustering is improved. The performance of the proposed algorithm is evaluated on public datasets.

Authors

  • Mohammed Ghesmoune
    University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France. Electronic address: Mohammed.Ghesmoune@lipn.univ-paris13.fr.
  • Mustapha Lebbah
    University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France. Electronic address: Mustapha.Lebbah@lipn.univ-paris13.fr.
  • Hanene Azzag
    University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France. Electronic address: Hanene.Azzag@lipn.univ-paris13.fr.