Dissecting self-supervised learning methods for surgical computer vision.

Journal: Medical image analysis

Published Date: May 24, 2023

Abstract

The field of surgical computer vision has undergone considerable breakthroughs in recent years with the rising popularity of deep neural network-based methods. However, standard fully-supervised approaches for training such models require vast amounts of annotated data, imposing a prohibitively high cost; especially in the clinical domain. Self-Supervised Learning (SSL) methods, which have begun to gain traction in the general computer vision community, represent a potential solution to these annotation costs, allowing to learn useful representations from only unlabeled data. Still, the effectiveness of SSL methods in more complex and impactful domains, such as medicine and surgery, remains limited and unexplored. In this work, we address this critical need by investigating four state-of-the-art SSL methods (MoCo v2, SimCLR, DINO, SwAV) in the context of surgical computer vision. We present an extensive analysis of the performance of these methods on the Cholec80 dataset for two fundamental and popular tasks in surgical context understanding, phase recognition and tool presence detection. We examine their parameterization, then their behavior with respect to training data quantities in semi-supervised settings. Correct transfer of these methods to surgery, as described and conducted in this work, leads to substantial performance gains over generic uses of SSL - up to 7.4% on phase recognition and 20% on tool presence detection - as well as state-of-the-art semi-supervised phase recognition approaches by up to 14%. Further results obtained on a highly diverse selection of surgical datasets exhibit strong generalization properties. The code is available at https://github.com/CAMMA-public/SelfSupSurg.

Authors

Sanat Ramesh

Altair Robotics Lab, Department of Computer Science, University of Verona, Verona, Italy. sanat.ramesh@univr.it.
Vinkle Srivastav

ICube, University of Strasbourg, CNRS, France. Electronic address: srivastav@unistra.fr.
Deepak Alapatt

ICube, University of Strasbourg, CNRS, IHU Strasbourg, France.
Tong Yu
Aditya Murali

University of Strasbourg, UMR 7357 CNRS, ICube, Strasbourg, France.
Luca Sestini

ICube, University of Strasbourg, CNRS, IHU Strasbourg, France; Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy. Electronic address: sestini@unistra.fr.
Chinedu Innocent Nwoye

ICube, University of Strasbourg, CNRS, IHU, Strasbourg, France. nwoye.chinedu@gmail.com.
Idris Hamoud

CNRS, ICube, University of Strasbourg, Strasbourg, France. ihamoud@unistra.fr.
Saurav Sharma

ICube, University of Strasbourg, CNRS, Strasbourg 67000, France.
Antoine Fleurentin

IHU Strasbourg, Strasbourg 67000, France.
Georgios Exarchakis

ICube, University of Strasbourg, CNRS, Strasbourg 67000, France; IHU Strasbourg, Strasbourg 67000, France.
Alexandros Karargyris

IHU Strasbourg, Strasbourg, France.
Nicolas Padoy

IHU Strasbourg, Strasbourg, France.

Keywords

Computers Humans Neural Networks, Computer Supervised Machine Learning

External Resources

View on PubMed Access via DOI PubMed (37270898)

Dissecting self-supervised learning methods for surgical computer vision.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals