Research Article Open Access

Transforming Retinal Diagnostics: Advanced Detection of Diabetic Retinopathy Using Vision Transformers and Capsule Networks

Vishal Sharma1, Rishu 1, Vinay Kukreja1, Ayush Dogra1 and Bhawna Goyal2,3
  • 1 Centre for Research Impact and Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
  • 2 Marwadi University Research Centre Derailment of Engineering, Rajkot, Gujarat, India
  • 3 Faculty of Engineering, Sohar University, Sohar, Oman

Abstract

Diabetic Retinopathy (DR), nowadays is one of the leading causes of blindness worldwide, it is a severe complication of diabetes mellitus that affects the retina blood vessels. Accurate diagnosis depends on early detection of DR. The study aims to develop a hybrid model that is the combination of a Vision Transformer and Capsule Network (ViT-CapsNet) to classify the DR at early stages. The ViT-CapsNet model is proposed to detect the DR from the retinal images at the early stage. The eyepieces public dataset is used. The data preprocessing takes place in which the resizing and data augmentation are used to improve the quality and increase the diversity of the data. Then, the Vision transformer extracts the global features from the retinal fundus image while the capsule network preserves the spatial relationships and hierarchies within the data, also classified into different classes that are No DR, Mild DR, Moderate DR, Severe DR and Proliferative DR. The ViT-CapsNet model has a precision, recall and F1-Score with values of 0.92, 0.91 and 0.91 respectively. The ViT-CapsNet model shows an accuracy of 94% compared to the other traditional methods such as CNN (88%), ResNet (90%), and EfficientNet (92%). The AUC-ROC scores for classes No DR, Mild DR, Moderate DR, Severe DR, and Proliferative DR are 0.56, 0.48, 0.44, 0.45, and 0.51 respectively.

Journal of Computer Science
Volume 21 No. 2, 2025, 304-321

DOI: https://doi.org/10.3844/jcssp.2025.304.321

Submitted On: 25 September 2024 Published On: 14 January 2025

How to Cite: Sharma, V., R., Kukreja, V., Dogra, A. & Goyal, B. (2025). Transforming Retinal Diagnostics: Advanced Detection of Diabetic Retinopathy Using Vision Transformers and Capsule Networks. Journal of Computer Science, 21(2), 304-321. https://doi.org/10.3844/jcssp.2025.304.321

  • 146 Views
  • 48 Downloads
  • 0 Citations

Download

Keywords

  • Vision Transformers
  • Capsule Networks
  • Diabetic Retinopathy
  • Retinal Images
  • Deep Learning