Swin Transformer–Driven Cervical Cell Classification with Explainable AI and Web-Based Screening
DOI:
https://doi.org/10.32996/jmhs.2026.7.5.5Keywords:
Cervical cancer screening, Pap smear images, vision transformers, deep learning, explainable AI, Grad-CAM, medical image analysisAbstract
Accurate interpretation of cervical cytology images is essential for effective cervical cancer screening, yet manual assessment is time-consuming and subject to observer variability. This paper presents a transformer-based deep learning framework for automated cervical cell classification using Pap smear images. We conduct a systematic evaluation of modern attention-driven architectures, including MaxViT, Swin Transformer, EfficientFormer, and HorNet, under a unified preprocessing and training pipeline designed to handle staining variability and class imbalance. To enhance model transparency and clinical trust, explainable AI is integrated via Grad-CAM, enabling visual localization of cytomorphological regions that drive model decisions. Experiments on the Herlev and SIPaKMeD datasets demonstrate that the proposed Swin Transformer achieves superior and consistent performance, reaching 99.27% accuracy on Herlev and 98.82% accuracy on SIPaKMeD, with high MCC and PR-AUC values. In addition, a lightweight web-based application is developed to support dataset selection, real-time inference, confidence reporting, and visual explanation. The results confirm that hierarchical transformer architectures can deliver accurate, interpretable, and deployable solutions for computer-aided cervical cancer screening.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 https://creativecommons.org/licenses/by/4.0/

This work is licensed under a Creative Commons Attribution 4.0 International License.

Aims & scope
Call for Papers
Article Processing Charges
Publications Ethics
Google Scholar Citations
Recruitment