ICSDA: a multi-modal deep learning model to predict breast cancer recurrence and metastasis risk by integrating pathological, clinical and gene expression data.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Breast cancer patients often have recurrence and metastasis after surgery. Predicting the risk of recurrence and metastasis for a breast cancer patient is essential for the development of precision treatment. In this study, we proposed a novel multi-modal deep learning prediction model by integrating hematoxylin & eosin (H&E)-stained histopathological images, clinical information and gene expression data. Specifically, we segmented tumor regions in H&E into image blocks (256 × 256 pixels) and encoded each image block into a 1D feature vector using a deep neural network. Then, the attention module scored each area of the H&E-stained images and combined image features with clinical and gene expression data to predict the risk of recurrence and metastasis for each patient. To test the model, we downloaded all 196 breast cancer samples from the Cancer Genome Atlas with clinical, gene expression and H&E information simultaneously available. The samples were then divided into the training and testing sets with a ratio of 7: 3, in which the distributions of the samples were kept between the two datasets by hierarchical sampling. The multi-modal model achieved an area-under-the-curve value of 0.75 on the testing set better than those based solely on H&E image, sequencing data and clinical data, respectively. This study might have clinical significance in identifying high-risk breast cancer patients, who may benefit from postoperative adjuvant treatment.

Authors

  • Yuhua Yao
    College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, China; School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China. Electronic address: yaoyuhua2288@163.com.
  • Yaping Lv
    School of Mathematics and Statistics, Hainan Normal University, Haikou 570100, China.
  • Ling Tong
  • Yuebin Liang
    Genies Beijing Co., Ltd., Beijing 100102, China.
  • Shuxue Xi
    Genies Beijing Co., Ltd., Beijing 100102, China.
  • Binbin Ji
    Genies Beijing Co., Ltd., Beijing 100102, China.
  • Guanglu Zhang
    School of Mathematics and Statistics, Hainan Normal University, Haikou 570100, China.
  • Ling Li
    College of Communication Engineering, Jilin University, Changchun, Jilin China.
  • Geng Tian
    Department of Sciences, Genesis (Beijing) Co. Ltd., Beijing, China.
  • Min Tang
    Department of Nephrology, Blood Purification Research Center, the First Affiliated Hospital, Fujian Medical University, Fuzhou, People's Republic of China.
  • Xiyue Hu
    Dept. of Colorectal Surgery, National Cancer Center/ Cancer Hospital, Chinese Academy of Medical Science, 17 Panjiayuan Nanli, Chaoyang District, Beijing, China, 100021.
  • Shijun Li
    Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei Province, 430070 China. Electronic address: lishijun@mail.hzau.edu.cn.
  • Jialiang Yang
    Department of Sciences, Genesis (Beijing) Co. Ltd., Beijing, China.