Machine learning demonstrates that somatic mutations imprint invariant morphologic features in myelodysplastic syndromes.

Journal: Blood
Published Date:

Abstract

Morphologic interpretation is the standard in diagnosing myelodysplastic syndrome (MDS), but it has limitations, such as varying reliability in pathologic evaluation and lack of integration with genetic data. Somatic events shape morphologic features, but the complexity of morphologic and genetic changes makes clear associations challenging. This article interrogates novel clinical subtypes of MDS using a machine-learning technique devised to identify patterns of cooccurrence among morphologic features and genomic events. We sequenced 1079 MDS patients and analyzed bone marrow morphologic alterations and other clinical features. A total of 1929 somatic mutations were identified. Five distinct morphologic profiles with unique clinical characteristics were defined. Seventy-seven percent of higher-risk patients clustered in profile 1. All lower-risk (LR) patients clustered into the remaining 4 profiles: profile 2 was characterized by pancytopenia, profile 3 by monocytosis, profile 4 by elevated megakaryocytes, and profile 5 by erythroid dysplasia. These profiles could also separate patients with different prognoses. LR MDS patients were classified into 8 genetic signatures (eg, signature A had TET2 mutations, signature B had both TET2 and SRSF2 mutations, and signature G had SF3B1 mutations), demonstrating association with specific morphologic profiles. Six morphologic profiles/genetic signature associations were confirmed in a separate analysis of an independent cohort. Our study demonstrates that nonrandom or even pathognomonic relationships between morphology and genotype to define clinical features can be identified. This is the first comprehensive implementation of machine-learning algorithms to elucidate potential intrinsic interdependencies among genetic lesions, morphologies, and clinical prognostic in attributes of MDS.

Authors

  • Yasunobu Nagata
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Ran Zhao
    College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.
  • Hassan Awada
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Cassandra M Kerr
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Inom Mirzaev
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Sunisa Kongkiatkamon
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Aziz Nazha
    Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, United States; Department of Hematology and Medical Oncology, Cleveland Clinic, United States; Center for Clinical Artificial Intelligence, Cleveland Clinic, United States. Electronic address: nazhaa@ccf.org.
  • Hideki Makishima
    Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
  • Tomas Radivoyevitch
    Department of Quantitative Health Sciences and.
  • Jacob G Scott
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.
  • Mikkael A Sekeres
    Leukemia Program, Department of Hematology and Medical Oncology, Cleveland Clinic, Cleveland, OH; and.
  • Brian P Hobbs
    Department of Quantitative Health Sciences and.
  • Jaroslaw P Maciejewski
    Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH.