YOLO-TARC: YOLOv10 with Token Attention and Residual Convolution for Small Void Detection in Root Canal X-Ray Images.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

The detection of small voids or defects in X-ray images of tooth root canals still faces challenges. To address the issue, this paper proposes an improved YOLOv10 that combines Token Attention with Residual Convolution (ResConv), termed YOLO-TARC. To overcome the limitations of existing deep learning models in effectively retaining key features of small objects and their insufficient focusing capabilities, we introduce three improvements. First, ResConv is designed to ensure the transmission of discriminative features of small objects during feature propagation, leveraging the ability of residual connections to transmit information from one layer to the next. Second, to tackle the issue of weak focusing capabilities on small targets, a Token Attention module is introduced before the third small object detection head. By tokenizing feature maps and enhancing local focusing, it enables the model to pay closer attention to small targets. Additionally, to optimize the training process, a bounding box loss function is adopted to achieve faster and more accurate bounding box predictions. YOLO-TARC simultaneously enhances the ability to retain detailed information of small targets and improves their focusing capabilities, thereby increasing detection accuracy. Experimental results on a private root canal X-ray image dataset demonstrate that YOLO-TARC outperforms other state-of-the-art object detection models, achieving a 7.5% improvement to 80.8% in mAP50 and a 6.2% increase to 80.0% in Recall. YOLO-TARC can contribute to more accurate and efficient objective postoperative evaluation of root canal treatments.

Authors

  • Yin Pan
    Department of Ultrasound, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China; Wenzhou Key Laboratory of Structural & Functional Imaging, Wenzhou, Zhejiang, China.
  • Zhenpeng Zhang
    College of Mechatronics and Control Engineering, Shenzhen University, Shenzhen 518060, China.
  • Xueyang Zhang
    Department of Stomatology, Shunde Hospital, Southern Medical University (The First People's Hospital of Shunde, Foshan), Foshan, 528308, Guangdong, China. zxy123@smu.edu.cn.
  • Zhi Zeng
    Department of Pathology, Renmin Hospital of Wuhan University, Wuhan, China.
  • Yibin Tian
    College of Mechatronics and Control Engineering & State Key Laboratory of Radio Frequency Heterogenous Integration, Shenzhen University, Shenzhen, 518060, China.