Iterative Semantic Refinement: A Vision Language Model-Driven Approach to Auto-Regressive Image Editing

dc.contributor.author Yavuzcan, Ege
dc.contributor.author Kus, Omer
dc.contributor.author Gumus, Abdurrahman
dc.date.accessioned 2025-09-25T18:56:01Z
dc.date.available 2025-09-25T18:56:01Z
dc.date.issued 2025
dc.description.abstract Recent advancements in Visual Language Models (VLMs) have significantly improved text-to-image generation by enabling more nuanced and semantically rich textual prompts, highlighting the transformative impact of these models on image synthesis. In this work, we leverage these robust capabilities to develop an auto-regressive editing framework that systematically refines images through careful, step-by-step modifications. Our method concisely balances subtle adjustments with meaningful semantic shifts, ensuring that each editing stage preserves the core context while introducing precise variations. By integrating improvements from controllable image editing models, we enhance the precision and stability of our edits and demonstrate the effectiveness of our approach in maintaining visual coherence. This integration results in a powerful strategy for producing diverse, high-quality outputs that align with finely tuned semantic goals. Centered on the strength of VLMs, this framework opens up a new paradigm for image synthesis, offering a blend of creative flexibility and consistent contextual fidelity that holds promise for a variety of applications requiring intricate and controlled visual transformations. © 2025 Elsevier B.V., All rights reserved. en_US
dc.identifier.doi 10.1109/ISAS66241.2025.11101889
dc.identifier.isbn 9798331514822
dc.identifier.scopus 2-s2.0-105014940484
dc.identifier.uri https://doi.org/10.1109/ISAS66241.2025.11101889
dc.identifier.uri https://hdl.handle.net/11147/18442
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof -- 9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025 -- Gaziantep -- 211342 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Auto-Regressive Editing en_US
dc.subject Controllable Image Editing en_US
dc.subject Image Synthesis en_US
dc.subject Semantic Image Editing en_US
dc.subject Visual Language Models en_US
dc.subject Blending en_US
dc.subject Computational Linguistics en_US
dc.subject Computer Vision en_US
dc.subject Iterative Methods en_US
dc.subject Semantics en_US
dc.subject Visual Languages en_US
dc.subject Auto-Regressive en_US
dc.subject Auto-Regressive Editing en_US
dc.subject Controllable Image Editing en_US
dc.subject Image Editing en_US
dc.subject Images Synthesis en_US
dc.subject Semantic Image Editing en_US
dc.subject Semantic Images en_US
dc.subject Semantic Refinement en_US
dc.subject Visual Language Model en_US
dc.subject Image Enhancement en_US
dc.title Iterative Semantic Refinement: A Vision Language Model-Driven Approach to Auto-Regressive Image Editing
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 60083885800
gdc.author.scopusid 60082919500
gdc.author.scopusid 35315599800
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology en_US
gdc.description.departmenttemp [Yavuzcan] Ege, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Kus] Omer, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Gumus] Abdurrahman, Department of Electronics and Communication Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
gdc.identifier.openalex W4413179159
gdc.index.type Scopus
gdc.openalex.collaboration National
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.29
gdc.opencitations.count 0
gdc.plumx.scopuscites 0
gdc.scopus.citedcount 0
relation.isAuthorOfPublication.latestForDiscovery ce5ce1e2-17ef-4da2-946d-b7a26e44e461
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4003-8abe-a4dfe192da5e

Files