Semantic Guided Autoregressive Diffusion Based Data Augmentation Using Visual Instructions

dc.contributor.author Yavuzcan, Ege
dc.contributor.author Kus, Omer
dc.contributor.author Gumus, Abdurrahman
dc.date.accessioned 2025-09-25T18:56:06Z
dc.date.available 2025-09-25T18:56:06Z
dc.date.issued 2025
dc.description.abstract Recent breakthroughs in generative image models, especially those based on diffusion techniques, have radically transformed the landscape of text-guided image synthesis by delivering exceptional fidelity and detailed semantic control. In this study, we present an iterative editing framework that harnesses the inherent strengths of these generative models to progressively refine images with precision. Our approach begins by generating diverse textual descriptions from an initial image, from which the most effective prompt is selected to drive further refinement through a fine-tuned Stable Diffusion process. This pipeline, as detailed in our flow diagram, orchestrates a series of controlled image modifications that preserve the original context while accommodating deliberate stylistic and semantic adjustments. By cycling the augmented output back into the system, our method achieves a harmonious balance between innovation and consistency, paving the way for highquality, context-aware visual transformations. This dynamic, auto-regressive strategy underscores the transformative potential of modern image generation models for applications that require detailed, controlled creative expression. The code is available on Github. © 2025 Elsevier B.V., All rights reserved. en_US
dc.identifier.doi 10.1109/ISAS66241.2025.11101945
dc.identifier.isbn 9798331514822
dc.identifier.scopus 2-s2.0-105014936486
dc.identifier.uri https://doi.org/10.1109/ISAS66241.2025.11101945
dc.identifier.uri https://hdl.handle.net/11147/18453
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof -- 9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025 -- Gaziantep -- 211342 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Auto-Regressive Refinement en_US
dc.subject Diffusion-Based Data Augmentation en_US
dc.subject Generative Image Models en_US
dc.subject Iterative Image Editing en_US
dc.subject Text-Guided Image Synthesis en_US
dc.subject Iterative Methods en_US
dc.subject Semantics en_US
dc.subject Visualization en_US
dc.subject Auto-Regressive en_US
dc.subject Auto-Regressive Refinement en_US
dc.subject Data Augmentation en_US
dc.subject Diffusion-Based Data Augmentation en_US
dc.subject Generative Image Model en_US
dc.subject Guided Images en_US
dc.subject Image Editing en_US
dc.subject Image Modeling en_US
dc.subject Images Synthesis en_US
dc.subject Iterative Image Editing en_US
dc.subject Text-Guided Image Synthesis en_US
dc.subject Diffusion en_US
dc.title Semantic Guided Autoregressive Diffusion Based Data Augmentation Using Visual Instructions
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 60083885800
gdc.author.scopusid 60082919500
gdc.author.scopusid 35315599800
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology en_US
gdc.description.departmenttemp [Yavuzcan] Ege, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Kus] Omer, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Gumus] Abdurrahman, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
gdc.identifier.openalex W4413178875
gdc.index.type Scopus
gdc.openalex.collaboration National
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.29
gdc.opencitations.count 0
gdc.plumx.scopuscites 0
gdc.scopus.citedcount 0
relation.isAuthorOfPublication.latestForDiscovery ce5ce1e2-17ef-4da2-946d-b7a26e44e461
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4003-8abe-a4dfe192da5e

Files