Skip to main navigation Skip to search Skip to main content

Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation

  • Nadav Z. Cohen
  • , Oron Nir
  • , Ariel Shamir

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Balancing content fidelity and artistic style is a pivotal challenge in image generation. While traditional style transfer methods and modern Denoising Diffusion Probabilistic Models (DDPMs) strive to achieve this balance, they often struggle to do so without sacrificing either style, content, or sometimes both. This work addresses this challenge by analyzing the ability of DDPMs to maintain content and style equilibrium. We introduce a novel method to identify sensitivities within the DDPM attention layers, identifying specific layers that correspond to different stylistic aspects. By directing conditional inputs only to these sensitive layers, our approach enables fine-grained control over style and content, significantly reducing issues arising from over-constrained inputs. Our findings demonstrate that this method enhances recent stylization techniques by better aligning style and content, ultimately improving the quality of generated visual content.

Original languageEnglish
Pages (from-to)2641-2650
Number of pages10
JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs
StatePublished - 1 Jan 2025
Externally publishedYes
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 - Nashville, United States
Duration: 11 Jun 202515 Jun 2025

Keywords

  • art
  • conditioning
  • diffusion
  • generative models
  • style

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation'. Together they form a unique fingerprint.

Cite this