NuInsSeg: A fully annotated dataset for nuclei instance segmentation in H&E-stained histological images.

Journal: Scientific data
PMID:

Abstract

In computational pathology, automatic nuclei instance segmentation plays an essential role in whole slide image analysis. While many computerized approaches have been proposed for this task, supervised deep learning (DL) methods have shown superior segmentation performances compared to classical machine learning and image processing techniques. However, these models need fully annotated datasets for training which is challenging to acquire, especially in the medical domain. In this work, we release one of the biggest fully manually annotated datasets of nuclei in Hematoxylin and Eosin (H&E)-stained histological images, called NuInsSeg. This dataset contains 665 image patches with more than 30,000 manually segmented nuclei from 31 human and mouse organs. Moreover, for the first time, we provide additional ambiguous area masks for the entire dataset. These vague areas represent the parts of the images where precise and deterministic manual annotations are impossible, even for human experts. The dataset and detailed step-by-step instructions to generate related segmentation masks are publicly available on the respective repositories.

Authors

  • Amirreza Mahbod
  • Christine Polak
    Institute for Pathophysiology and Allergy Research, Medical University of Vienna, Vienna, 1090, Austria.
  • Katharina Feldmann
    Institute for Pathophysiology and Allergy Research, Medical University of Vienna, Vienna, 1090, Austria.
  • Rumsha Khan
    Institute for Pathophysiology and Allergy Research, Medical University of Vienna, Vienna, 1090, Austria.
  • Katharina Gelles
    Institute for Pathophysiology and Allergy Research, Medical University of Vienna, Vienna, 1090, Austria.
  • Georg Dorffner
    Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Spitalgasse 23, 1090, Vienna, Austria.
  • Ramona Woitek
    Department of Radiology, University of Cambridge, Cambridge CB2 0QQ, UK; Cancer Research UK Cambridge Centre, University of Cambridge, Cambridge CB2 0RE, UK; Department of Biomedical Imaging and Image-guided Therapy, Medical University Vienna, Vienna 1090, Austria. Electronic address: rw585@cam.ac.uk.
  • Sepideh Hatamikia
    Austrian Center for Medical Innovation and Technology, Vienna, Austria.
  • Isabella Ellinger