Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge.

Journal: Scientific reports
PMID:

Abstract

Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, appearance, and location makes the detection of polyps challenging. Moreover, colonoscopy surveillance and removal of polyps are highly operator-dependent procedures and occur in a highly complex organ topology. There exists a high missed detection rate and incomplete removal of colonic polyps. To assist in clinical procedures and reduce missed rates, automated methods for detecting and segmenting polyps using machine learning have been achieved in past years. However, the major drawback in most of these methods is their ability to generalise to out-of-sample unseen datasets from different centres, populations, modalities, and acquisition systems. To test this hypothesis rigorously, we, together with expert gastroenterologists, curated a multi-centre and multi-population dataset acquired from six different colonoscopy systems and challenged the computational expert teams to develop robust automated detection and segmentation methods in a crowd-sourcing Endoscopic computer vision challenge. This work put forward rigorous generalisability tests and assesses the usability of devised deep learning methods in dynamic and actual clinical colonoscopy procedures. We analyse the results of four top performing teams for the detection task and five top performing teams for the segmentation task. Our analyses demonstrate that the top-ranking teams concentrated mainly on accuracy over the real-time performance required for clinical applicability. We further dissect the devised methods and provide an experiment-based hypothesis that reveals the need for improved generalisability to tackle diversity present in multi-centre datasets and routine clinical procedures.

Authors

  • Sharib Ali
    Institute of Biomedical Engineering, Big Data Institute, Department of Engineering Science, University of Oxford, Oxford, UK. sharib.ali@eng.ox.ac.uk.
  • Noha Ghatwary
    University of Lincoln, Lincoln, UK. nghatwary@lincoln.ac.uk.
  • Debesh Jha
    Department of Information and Communication Engineering, Chosun University, 309 Pilmun-Daero, Dong-Gu, Gwangju 61452, Republic of Korea.
  • Ece Isik-Polat
    Graduate School of Informatics, Middle East Technical University, 06800, Ankara, Turkey.
  • Gorkem Polat
    Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.
  • Chen Yang
    Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, China.
  • Wuyang Li
    City University of Hong Kong, Kowloon, Hong Kong.
  • Adrian Galdran
  • Miguel-Ángel González Ballester
    BCN MedTech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018, Barcelona, Spain.
  • Vajira Thambawita
    SimulaMet, Oslo, Norway.
  • Steven Hicks
    SimulaMet, Oslo, Norway.
  • Sahadev Poudel
    Department of IT Convergence Engineering, Gachon University, Seongnam, 13120, South Korea.
  • Sang-Woong Lee
    National Research Center for Dementia, Gwangju, Republic of Korea.
  • Ziyi Jin
    Biosensor National Special Laboratory, Key Laboratory of Biomedical Engineering of Ministry of Education, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, 310027, People's Republic of China.
  • Tianyuan Gan
    Biosensor National Special Laboratory, Key Laboratory of Biomedical Engineering of Ministry of Education, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, 310027, People's Republic of China.
  • ChengHui Yu
    Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
  • Jiangpeng Yan
    Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong, China.
  • Doyeob Yeo
    Smart Sensing and Diagnosis Research Division, Korea Atomic Energy Research Institute, Taejon, 34057, Republic of Korea.
  • Hyunseok Lee
    Department of Radiation Oncology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.
  • Nikhil Kumar Tomar
  • Mahmood Haithami
    Computer Science Department, University of Nottingham, Malaysia Campus, 43500, Semenyih, Malaysia.
  • Amr Ahmed
    Faculty of Science and Engineering, School of Computer Science, University of Nottingham, Jalan Broga, 43500, Semenyih Selangor Darul Ehsan, Malaysia.
  • Michael A Riegler
    SimulaMet, Oslo, Norway.
  • Christian Daul
    CRAN UMR 7039, University of Lorraine, CNRS, Nancy, France.
  • Pål Halvorsen
    Center for Digital Engineering Simula Metropolitan, Fornebu 1364, Norway.
  • Jens Rittscher
    Department of Engineering Science, University of Oxford, Oxford, United Kingdom.
  • Osama E Salem
    Faculty of Medicine, University of Alexandria, Alexandria, 21131, Egypt.
  • Dominique Lamarque
    Université de Versailles St-Quentin en Yvelines, Hôpital Ambroise Paré, France.
  • Renato Cannizzaro
    CRO Centro Riferimento Oncologico IRCCS, Aviano, Italy.
  • Stefano Realdon
    Instituto Onclologico Veneto, IOV-IRCCS, Padova, Italy.
  • Thomas de Lange
    Department of Transplantation, Oslo University Hospital, Oslo 0424, Norway.
  • James E East
    Translational Gastroenterology Unit, Nuffield Department of Medicine, Experimental Medicine Div., John Radcliffe Hospital, University of Oxford, Oxford, UK.