Machine-learned cluster identification in high-dimensional data.

Journal: Journal of biomedical informatics
Published Date:

Abstract

BACKGROUND: High-dimensional biomedical data are frequently clustered to identify subgroup structures pointing at distinct disease subtypes. It is crucial that the used cluster algorithm works correctly. However, by imposing a predefined shape on the clusters, classical algorithms occasionally suggest a cluster structure in homogenously distributed data or assign data points to incorrect clusters. We analyzed whether this can be avoided by using emergent self-organizing feature maps (ESOM).

Authors

  • Alfred Ultsch
    DataBionics Research Group, University of Marburg, Hans - Meerwein - Straße, 35032 Marburg, Germany.
  • Jörn Lötsch
    Institute of Clinical Pharmacology, Goethe - University, Theodor Stern Kai 7, 60590 Frankfurt am Main, Germany.