Precision Diagnosis of Glaucoma with VLLM Ensemble Deep Learning

Soohyun Wang, Byoungkug Kim, Jiheon Kang, Doo Seop Eom

Research output: Contribution to journalArticlepeer-review

Abstract

This paper focuses on improving automated approaches to glaucoma diagnosis, a severe disease that leads to gradually narrowing vision and potentially blindness due to optic nerve damage occurring without the patient’s awareness. Early diagnosis is crucial. By utilizing advanced deep learning technologies and robust image processing capabilities, this study employed four types of input data (retina fundus image, region of interest (ROI), vascular region of interest (VROI), and color palette images) to reflect structural issues. We addressed the issue of data imbalance with a modified loss function and proposed an ensemble model based on the vision large language model (VLLM), which improved the accuracy of glaucoma classification. The results showed that the models developed for each dataset achieved 1% to 10% higher accuracy and 8% to 29% improved sensitivity compared to conventional single-image analysis. On the REFUGE dataset, we achieved a high accuracy of 0.9875 and a sensitivity of 0.9. Particularly in the ORIGA dataset, which is challenging in terms of achieving high accuracy, we confirmed a significant increase, with an 11% improvement in accuracy and a 29% increase in sensitivity. This research can significantly contribute to the early detection and management of glaucoma, indicating potential clinical applications. These advancements will not only further the development of glaucoma diagnostic technologies but also play a vital role in improving patients’ quality of life.

Original languageEnglish
Article number4588
JournalApplied Sciences (Switzerland)
Volume14
Issue number11
DOIs
Publication statusPublished - 2024 Jun

Bibliographical note

Publisher Copyright:
© 2024 by the authors.

Keywords

  • color palette
  • computer aided diagnosis
  • data imbalance
  • deep learning
  • glaucoma classification
  • medical image analysis
  • ROI
  • vision large language model
  • VROI

ASJC Scopus subject areas

  • General Materials Science
  • Instrumentation
  • General Engineering
  • Process Chemistry and Technology
  • Computer Science Applications
  • Fluid Flow and Transfer Processes

Fingerprint

Dive into the research topics of 'Precision Diagnosis of Glaucoma with VLLM Ensemble Deep Learning'. Together they form a unique fingerprint.

Cite this