NOISE MAP GUIDANCE: INVERSION WITH SPATIAL CONTEXT FOR REAL IMAGE EDITING

Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae Hyun Oh, Yonghyun Jeong

    Research output: Contribution to conferencePaperpeer-review

    Abstract

    Text-guided diffusion models have become a popular tool in image synthesis, known for producing high-quality and diverse images. However, their application to editing real images often encounters hurdles primarily due to the text condition deteriorating the reconstruction quality and subsequently affecting editing fidelity. Null-text Inversion (NTI) has made strides in this area, but it fails to capture spatial context and requires computationally intensive per-timestep optimization. Addressing these challenges, we present NOISE MAP GUIDANCE (NMG), an inversion method rich in a spatial context, tailored for real-image editing. Significantly, NMG achieves this without necessitating optimization, yet preserves the editing quality. Our empirical investigations highlight NMG's adaptability across various editing techniques and its robustness to variants of DDIM inversions.

    Original languageEnglish
    Publication statusPublished - 2024
    Event12th International Conference on Learning Representations, ICLR 2024 - Hybrid, Vienna, Austria
    Duration: 2024 May 72024 May 11

    Conference

    Conference12th International Conference on Learning Representations, ICLR 2024
    Country/TerritoryAustria
    CityHybrid, Vienna
    Period24/5/724/5/11

    Bibliographical note

    Publisher Copyright:
    © 2024 12th International Conference on Learning Representations, ICLR 2024. All rights reserved.

    ASJC Scopus subject areas

    • Language and Linguistics
    • Computer Science Applications
    • Education
    • Linguistics and Language

    Fingerprint

    Dive into the research topics of 'NOISE MAP GUIDANCE: INVERSION WITH SPATIAL CONTEXT FOR REAL IMAGE EDITING'. Together they form a unique fingerprint.

    Cite this