High-resolution AI image dataset for diagnosing oral submucous fibrosis and squamous cell carcinoma.

Journal: Scientific data
PMID:

Abstract

Oral cancer is a global health challenge with a difficult histopathological diagnosis. The accurate histopathological interpretation of oral cancer tissue samples remains difficult. However, early diagnosis is very challenging due to a lack of experienced pathologists and inter- observer variability in diagnosis. The application of artificial intelligence (deep learning algorithms) for oral cancer histology images is very promising for rapid diagnosis. However, it requires a quality annotated dataset to build AI models. We present ORCHID (ORal Cancer Histology Image Database), a specialized database generated to advance research in AI-based histology image analytics of oral cancer and precancer. The ORCHID database is an extensive multicenter collection of high-resolution images captured at 1000X effective magnification (100X objective lens), encapsulating various oral cancer and precancer categories, such as oral submucous fibrosis (OSMF) and oral squamous cell carcinoma (OSCC). Additionally, it also contains grade-level sub-classifications for OSCC, such as well- differentiated (WD), moderately-differentiated (MD), and poorly-differentiated (PD). The database seeks to aid in developing innovative artificial intelligence-based rapid diagnostics for OSMF and OSCC, along with subtypes.

Authors

  • Nisha Chaudhary
    Multidisciplinary Centre for Advanced Research and Studies, Jamia Millia Islamia, New Delhi, India.
  • Arpita Rai
    Rajendra Institute of Medical Sciences, Ranchi, Jharkhand, India.
  • Aakash Madhav Rao
    Department of Computer Science, Ashoka University, Sonipat, Haryana, India.
  • Md Imam Faizan
    Multidisciplinary Centre for Advanced Research and Studies, Jamia Millia Islamia, New Delhi, India.
  • Jeyaseelan Augustine
    Maulana Azad Institute of Dental Sciences, New Delhi, India.
  • Akhilanand Chaurasia
    Department of Oral Medicine and Radiology, King George's Medical University, Lucknow, India.
  • Deepika Mishra
    All India Institute of Medical Sciences, New Delhi, India.
  • Akhilesh Chandra
    Banaras Hindu University, Banaras, Uttar Pradesh, India.
  • Varnit Chauhan
    Multidisciplinary Centre for Advanced Research and Studies, Jamia Millia Islamia, New Delhi, India.
  • Tanveer Ahmad
    Innovation Education and Research Center for On-Device AI Software (Bk21), Department of Computer Science and Engineering, Chungnam National University, Daejeon 34134, Republic of Korea.