Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space
Journal:
arXiv
Published Date:
Feb 24, 2025
Abstract
Embedding tree-like data, from hierarchies to ontologies and taxonomies,
forms a well-studied problem for representing knowledge across many domains.
Hyperbolic geometry provides a natural solution for embedding trees, with
vastly superior performance over Euclidean embeddings. Recent literature has
shown that hyperbolic tree embeddings can even be placed on top of neural
networks for hierarchical knowledge integration in deep learning settings. For
all applications, a faithful embedding of trees is needed, with combinatorial
constructions emerging as the most effective direction. This paper identifies
and solves two key limitations of existing works. First, the combinatorial
construction hinges on finding highly separated points on a hypersphere, a
notoriously difficult problem. Current approaches achieve poor separation,
degrading the quality of the corresponding hyperbolic embedding. We propose
highly separated Delaunay tree embeddings (HS-DTE), which integrates angular
separation in a generalized formulation of Delaunay embeddings, leading to
lower embedding distortion. Second, low-distortion requires additional
precision. The current approach for increasing precision is to use multiple
precision arithmetic, which renders the embeddings useless on GPUs in deep
learning settings. We reformulate the combinatorial construction using floating
point expansion arithmetic, leading to superior embedding quality while
retaining utility on accelerated hardware.