Comparative analysis of convolutional neural network models for the histopathological differentiation of acinic cell carcinoma and secretory carcinoma.

Journal: Oral surgery, oral medicine, oral pathology and oral radiology
Published Date:

Abstract

OBJECTIVE: Although artificial intelligence tools show promise for enhancing the diagnosis of head and neck lesions, few studies have tested these resources for the microscopic diagnosis of salivary gland tumors. Specifically, the microscopic differentiation between acinic cell carcinoma and secretory carcinoma has never been addressed in this context. Therefore, this exploratory study aimed to comparatively evaluate the feasibility of applying convolutional neural networks for the microscopic differentiation between acinic cell carcinomas and secretory carcinomas. METHODS: A cross-sectional study using whole-slide images from 46 patients with acinic cell carcinoma (n = 26) or secretory carcinoma (n = 20) was conducted. Eight CNNs (ResNet-50, InceptionV3, VGG16, Xception, MobileNet, DenseNet121, EfficientNetB0, and EfficientNetV2B0) were trained and evaluated for accuracy, sensitivity, specificity, F1-score, and AUC. Performance was measured in training, validation, and test subsets. Accuracy and loss curves were also presented. RESULTS: InceptionV3 demonstrated the best overall performance, with the lowest loss (1.39), highest accuracy (0.81), sensitivity (0.90), and F1-score (0.81). VGG16 achieved the highest AUC (0.86) and precision (0.77). DenseNet121 showed the lowest performance in terms of accuracy (0.65) and F1-Score (0.52), but the highest specificity (0.85). CONCLUSION: This proof-of-concept study suggests that convolutional neural networks may be feasible tools to support the microscopic differentiation between acinic cell carcinoma and secretory carcinoma. The performance of these models critically depends on the size of the dataset and the quality of annotations. The findings should be interpreted cautiously given the limited dataset and potential sources of bias. Further validation with larger, multicenter datasets is needed before any clinical application can be considered.

Authors

Keywords

No keywords available for this article.