A deep learning model to predict RNA-Seq expression of tumours from whole slide images.

Journal: Nature communications
Published Date:

Abstract

Deep learning methods for digital pathology analysis are an effective way to address multiple clinical questions, from diagnosis to prediction of treatment outcomes. These methods have also been used to predict gene mutations from pathology images, but no comprehensive evaluation of their potential for extracting molecular features from histology slides has yet been performed. We show that HE2RNA, a model based on the integration of multiple data modes, can be trained to systematically predict RNA-Seq profiles from whole-slide images alone, without expert annotation. Through its interpretable design, HE2RNA provides virtual spatialization of gene expression, as validated by CD3- and CD20-staining on an independent dataset. The transcriptomic representation learned by HE2RNA can also be transferred on other datasets, even of small size, to increase prediction performance for specific molecular phenotypes. We illustrate the use of this approach in clinical diagnosis purposes such as the identification of tumors with microsatellite instability.

Authors

  • Benoit Schmauch
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Alberto Romagnoni
    Centre de recherche sur l'inflammation UMR 1149, Inserm - Université Paris Diderot, 75018, Paris, France.
  • Elodie Pronier
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Charlie Saillard
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Pascale Maillé
    INSERM U955, Team "Pathophysiology and Therapy of Chronic Viral Hepatitis and Related Cancers", Créteil, France.
  • Julien Calderaro
    Department of Pathology, Henri Mondor University Hospital, Créteil, France.
  • Aurelie Kamoun
    Programme cartes d'identite des tumeurs, Ligue nationale contre le cancer, Paris, France.
  • Meriem Sefta
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Sylvain Toldo
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Mikhail Zaslavskiy
    Owkin Lab, Owkin, Inc., New York, NY, USA.
  • Thomas Clozel
    OWKIN Paris, France.
  • Matahi Moarii
    OWKIN Paris, France.
  • Pierre Courtiol
    OWKIN Paris, France.
  • Gilles Wainrib
    Ecole Normale Supérieure, Département d'Informatique, équipe DATA, Paris, France.