Iterative Semantic Refinement: A Vision Language Model-Driven Approach to Auto-Regressive Image Editing

Loading...

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Recent advancements in Visual Language Models (VLMs) have significantly improved text-to-image generation by enabling more nuanced and semantically rich textual prompts, highlighting the transformative impact of these models on image synthesis. In this work, we leverage these robust capabilities to develop an auto-regressive editing framework that systematically refines images through careful, step-by-step modifications. Our method concisely balances subtle adjustments with meaningful semantic shifts, ensuring that each editing stage preserves the core context while introducing precise variations. By integrating improvements from controllable image editing models, we enhance the precision and stability of our edits and demonstrate the effectiveness of our approach in maintaining visual coherence. This integration results in a powerful strategy for producing diverse, high-quality outputs that align with finely tuned semantic goals. Centered on the strength of VLMs, this framework opens up a new paradigm for image synthesis, offering a blend of creative flexibility and consistent contextual fidelity that holds promise for a variety of applications requiring intricate and controlled visual transformations. © 2025 Elsevier B.V., All rights reserved.

Description

Keywords

Auto-Regressive Editing, Controllable Image Editing, Image Synthesis, Semantic Image Editing, Visual Language Models, Blending, Computational Linguistics, Computer Vision, Iterative Methods, Semantics, Visual Languages, Auto-Regressive, Auto-Regressive Editing, Controllable Image Editing, Image Editing, Images Synthesis, Semantic Image Editing, Semantic Images, Semantic Refinement, Visual Language Model, Image Enhancement

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

-- 9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025 -- Gaziantep -- 211342

Volume

Issue

Start Page

End Page

PlumX Metrics
Citations

Scopus : 0

Page Views

3

checked on Apr 27, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals

SDG data is not available