Grasp the Graph (GtG) 2.0: Ensemble of GNNs for High-Precision Grasp Pose Detection in Clutter
Journal:
arXiv
Published Date:
May 5, 2025
Abstract
Grasp pose detection in cluttered, real-world environments remains a
significant challenge due to noisy and incomplete sensory data combined with
complex object geometries. This paper introduces Grasp the Graph 2.0 (GtG 2.0)
method, a lightweight yet highly effective hypothesis-and-test robotics
grasping framework which leverages an ensemble of Graph Neural Networks for
efficient geometric reasoning from point cloud data. Building on the success of
GtG 1.0, which demonstrated the potential of Graph Neural Networks for grasp
detection but was limited by assumptions of complete, noise-free point clouds
and 4-Dof grasping, GtG 2.0 employs a conventional Grasp Pose Generator to
efficiently produce 7-Dof grasp candidates. Candidates are assessed with an
ensemble Graph Neural Network model which includes points within the gripper
jaws (inside points) and surrounding contextual points (outside points). This
improved representation boosts grasp detection performance over previous
methods using the same generator. GtG 2.0 shows up to a 35% improvement in
Average Precision on the GraspNet-1Billion benchmark compared to
hypothesis-and-test and Graph Neural Network-based methods, ranking it among
the top three frameworks. Experiments with a 3-Dof Delta Parallel robot and
Kinect-v1 camera show a success rate of 91% and a clutter completion rate of
100%, demonstrating its flexibility and reliability.