Enhancing deep convolutional neural network scheme for breast cancer diagnosis with unlabeled data.

Journal: Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society
Published Date:

Abstract

In this study we developed a graph based semi-supervised learning (SSL) scheme using deep convolutional neural network (CNN) for breast cancer diagnosis. CNN usually needs a large amount of labeled data for training and fine tuning the parameters, and our proposed scheme only requires a small portion of labeled data in training set. Four modules were included in the diagnosis system: data weighing, feature selection, dividing co-training data labeling, and CNN. 3158 region of interests (ROIs) with each containing a mass extracted from 1874 pairs of mammogram images were used for this study. Among them 100 ROIs were treated as labeled data while the rest were treated as unlabeled. The area under the curve (AUC) observed in our study was 0.8818, and the accuracy of CNN is 0.8243 using the mixed labeled and unlabeled data.

Authors

  • Wenqing Sun
    Department of Electrical and Computer Engineering, University of Texas at El Paso, 500 West University Avenue, El Paso, TX 79968, USA.
  • Tzu-Liang Bill Tseng
    Department of Industrial, Manufacturing and Systems Engineering, University of Texas at El Paso, 500 West University Avenue, El Paso, TX 79968, USA.
  • Jianying Zhang
    Food Inspection & Quarantine Center, Shenzhen Customs, Shenzhen 518045, China.
  • Wei Qian
    Department of Electrical and Computer Engineering, University of Texas at El Paso, 500 West University Avenue, El Paso, TX 79968, USA; Sino-Dutch Biomedical and Information Engineering School, Northeastern University, No.11, Lane 3, Wenhua Road, Heping District, Shenyang, Liaoning 110819, China. Electronic address: wqian@utep.edu.