Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.

Journal: Medical image analysis
Published Date:

Abstract

BACKGROUND AND OBJECTIVE: Surgical tool detection, segmentation, and 3D pose estimation are crucial components in Computer-Assisted Laparoscopy (CAL). The existing frameworks have two main limitations. First, they do not integrate all three components. Integration is critical; for instance, one should not attempt computing pose if detection is negative. Second, they have highly specific requirements, such as the availability of a CAD model. We propose an integrated and generic framework whose sole requirement for the 3D pose is that the tool shaft is cylindrical. Our framework makes the most of deep learning and geometric 3D vision by combining a proposed Convolutional Neural Network (CNN) with algebraic geometry. We show two applications of our framework in CAL: tool-aware rendering in Augmented Reality (AR) and tool-based 3D measurement.

Authors

  • Md Kamrul Hasan
    Marquette University, Milwaukee, WI, USA.
  • Lilian Calvet
    EnCoV, Institut Pascal, CNRS, Université Clermont Auvergne, Clermont-Ferrand, France.
  • Navid Rabbani
    EnCoV, Institut Pascal, UMR 6602 CNRS/Université Clermont-Auvergne, Clermont-Ferrand, France.
  • Adrien Bartoli
    EnCoV, Institut Pascal, CNRS, Université Clermont Auvergne, Clermont-Ferrand, France.