CephTransX: An attention enhanced feature fusion network leveraging neighborhood rough set approach for cephalometric landmark prediction.

Journal: Computers in biology and medicine
PMID:

Abstract

The convergence of medical imaging, computer vision, and orthodontics has made automatic cephalometric landmark detection a pivotal area of research. Accurate cephalometric analysis is crucial in orthodontics, orthognathic and maxillofacial surgery for diagnosis, treatment planning, and monitoring craniofacial growth. In this research study, a multi-branch fused feature extraction network titled CephTransX is proposed to automatically predict landmark coordinates from cephalometric radiographs. The initial sequential branch enhances discriminative local feature learning and feature extraction through parallel feature fusion by integrating Convolved Pooled Normalized (CPN) and Gradient Optimized Multi-Path Bottleneck (GMB) blocks with Channel and Spatial Attention (CSAT) module. The Swin Transformer (ST) branch efficiently handles long-range dependencies and extracts global features in cephalometric radiographs. The multi-branch fused features along with features from skip connections of CPN and GMB blocks are concatenated using a Coordinate Attention module (CoAT) to captures the positional relationships between various landmark features. A Landmark Discriminative Deviation Factor (LDDF) is determined by applying the Neighborhood Rough Set (NRS) approach to analyse the surrounding features of each landmark by considering spatial relationships or similarity measures between the landmarks and neighboring regions. The Spatial Pyramid Pooling (SPP) layer incorporated in the final phase of CephTransX model extracts multi-scale features by pooling over sub-regions of varying sizes, enabling the network to capture both local and global context for precise cephalometric landmark identification. The CephTransX framework achieved an average Successful Detection Rates (SDR) of 88.71 % and 79.05 % in 2 mm using the 2015 International Symposium on Biomedical Imaging (ISBI) grand challenge dental X-ray analysis dataset. The effectiveness of the CephTransX model is evaluated using a private clinical dataset obtained from Solanki Dental Care Clinic in Sharjah, UAE, and attained an average SDR of 74.38 % in 2 mm precision range.

Authors

  • R Neeraja
    School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India. Electronic address: neeraja.r2020@vitstudent.ac.in.
  • L Jani Anbarasi
    School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India.