Diffusion-Based Data Augmentation Methodology for Improved Performance in Ocular Disease Diagnosis Using Retinography Images

dc.contributor.author Aktas, Burak
dc.contributor.author Ates, Doga Deniz
dc.contributor.author Duzyel, Okan
dc.contributor.author Gumus, Abdurrahman
dc.date.accessioned 2024-12-25T20:49:30Z
dc.date.available 2024-12-25T20:49:30Z
dc.date.issued 2024
dc.description.abstract Deep learning models, integral components of contemporary technological landscapes, exhibit enhanced learning capabilities with larger datasets. Traditional data augmentation techniques, while effective in generating new data, have limitations, especially in fields like ocular disease diagnosis. In response, alternative augmentation approaches, including the utilization of generative AI, have emerged. In our study, we employed a diffusion-based model (Stable Diffusion) to synthesize data by faithfully recreating crucial vascular structures in the retina, vital for detecting eye diseases by using the Ocular Disease Intelligent Recognition dataset. Our goal was to augment retinography images for ocular disease diagnosis using diffusion-based models, optimizing the outputs of the fine-tuned Stable Diffusion model, and ensuring the generated data closely resembles real-world scenarios. This strategic approach resulted in improved performance in classification models and augmentation outperformed traditional methods, exhibiting high precision rates ranging from 85% to 76.2% and recall values of 86%, and 75% for 5 classes. Beyond performance enhancement, we demonstrated that the inclusion of synthetic data, coupled with data reduction using the t-SNE method, effectively addressed dataset imbalance. As a result of synthetic data addition, notable increases of 3.4% in the precision metric and 12.8% in the recall metric were observed in the 7-class case. Strategically synthesizing data addressed underrepresented classes, creating a balanced dataset for comprehensive model learning. Surpassing performance improvements, this approach underscores synthetic data's ability to overcome the limitations of traditional methods, particularly in sensitive medical domains like ocular disease diagnosis, ensuring accurate classification. The codes of the study will be shared on GitHub in a way that benefits everyone interested: https://github.com/miralab-ai/generative-data-augmentation. en_US
dc.identifier.doi 10.1007/s13042-024-02485-w
dc.identifier.issn 1868-8071
dc.identifier.issn 1868-808X
dc.identifier.scopus 2-s2.0-85212076581
dc.identifier.uri https://doi.org/10.1007/s13042-024-02485-w
dc.identifier.uri https://hdl.handle.net/11147/15200
dc.language.iso en en_US
dc.publisher Springer Heidelberg en_US
dc.relation.ispartof International Journal of Machine Learning and Cybernetics
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Image classification en_US
dc.subject Data augmentation en_US
dc.subject Diffusion-based models en_US
dc.subject t-SNE en_US
dc.subject Medical image synthesis en_US
dc.subject Dataset imbalance en_US
dc.title Diffusion-Based Data Augmentation Methodology for Improved Performance in Ocular Disease Diagnosis Using Retinography Images en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.scopusid 59472597500
gdc.author.scopusid 59473279900
gdc.author.scopusid 58135677500
gdc.author.scopusid 35315599800
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Izmir Institute of Technology en_US
gdc.description.departmenttemp [Aktas, Burak; Ates, Doga Deniz; Duzyel, Okan; Gumus, Abdurrahman] Izmir Inst Technol, Dept Elect & Elect Engn, Izmir, Turkiye en_US
gdc.description.endpage 3864
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.startpage 3843
gdc.description.volume 16
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q3
gdc.identifier.openalex W4405272991
gdc.identifier.wos WOS:001375441800001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.762401E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 3.0050402E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 4.91444291
gdc.openalex.normalizedpercentile 0.91
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 0
gdc.plumx.mendeley 11
gdc.plumx.scopuscites 5
gdc.scopus.citedcount 5
gdc.wos.citedcount 4
relation.isAuthorOfPublication.latestForDiscovery f91aafc1-9437-405b-bbf7-51344a6961f7
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4018-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
s13042-024-02485-w.pdf
Size:
7.02 MB
Format:
Adobe Portable Document Format
Description:
article