The study on ultrasound image classification using a dual-branch model based on Resnet50 guided by U-net segmentation results.

Journal: BMC medical imaging
Published Date:

Abstract

In recent years, the incidence of nodular thyroid diseases has been increasing annually. Ultrasonography has become a routine diagnostic tool for thyroid nodules due to its high real-time capabilities and low invasiveness. However, thyroid images obtained from current ultrasound tests often have low resolution and are plagued by significant noise interference. Regional differences in medical conditions and varying levels of physician experience can impact the accuracy and efficiency of diagnostic results. With the advancement of deep learning technology, deep learning models are used to identify whether a nodule in a thyroid ultrasound image is benign or malignant. This helps to close the gap between doctors' experience and equipment differences, improving the accuracy of the initial diagnosis of thyroid nodules. To cope with the problem that thyroid ultrasound images contain complex background and noise as well as poorly defined local features. in this paper, we first construct an improved ResNet50 classification model that uses a two-branch input and incorporates a global attention lightening module. This model is used to improve the accuracy of benign and malignant nodule classification in thyroid ultrasound images and to reduce the computational effort due to the two-branch structure.We constructed a U-net segmentation model incorporating our proposed ACR module, which uses hollow convolution with different dilation rates to capture multi-scale contextual information for feature extraction of nodules in thyroid ultrasound images and uses the results of the segmentation task as an auxiliary branch for the classification task to guide the classification model to focus on the lesion region more efficiently in the case of weak local features. The classification model is guided to focus on the lesion region more efficiently, and the classification and segmentation sub-networks are respectively improved specifically for this study, which is used to improve the accuracy of classifying the benign and malignant nature of the nodules in thyroid ultrasound images. The experimental results show that the four evaluation metrics of accuracy, precision, recall, and f1 of the improved model are 96.01%, 93.3%, 98.8%, and 96.0%, respectively. The improvements were 5.7%, 1.6%, 13.1%, and 7.4%, respectively, compared with the baseline classification model.

Authors

  • Xu Yang
    Department of Food Science and Technology, The Ohio State University, Columbus, OH, United States.
  • Shuo'ou Qu
    School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130022, China.
  • Zhilin Wang
    School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130022, China.
  • Lingxiao Li
    Department of Computer Science, Boston University, Boston, USA.
  • Xiaofeng An
    Education Quality Monitoring Center, Jilin Engineering Normal University, Changchun, 130052, China. an.xf@jlenu.edu.cn.
  • Zhibin Cong
    Department of Electrodiagnosis, The Affiliated Hospital to Changchun University of Traditional Chinese Medicine, Changchun, 130021, China. yangcongman2003@126.com.