Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model Training

  • Myunsoo Kim*
  • , Donghyeon Ki
  • , Seong Woong Shim
  • , Byung Jun Lee
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

As a highly expressive generative model, diffusion models have demonstrated exceptional success across various domains, including image generation, natural language processing, and combinatorial optimization. However, as data distributions grow more complex, training these models to convergence becomes increasingly computationally intensive. While diffusion models are typically trained using uniform timestep sampling, our research shows that the variance in stochastic gradients varies significantly across timesteps, with high-variance timesteps becoming bottlenecks that hinder faster convergence. To address this issue, we introduce a non-uniform timestep sampling method that prioritizes these more critical timesteps. Our method tracks the impact of gradient updates on the objective for each timestep, adaptively selecting those most likely to minimize the objective effectively. Experimental results demonstrate that this approach not only accelerates the training process, but also leads to improved performance at convergence. Furthermore, our method shows robust performance across various datasets, scheduling strategies, and diffusion architectures, outperforming previously proposed timestep sampling and weighting heuristics that lack this degree of robustness.

Original languageEnglish
Pages (from-to)2513-2522
Number of pages10
JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs
Publication statusPublished - 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 - Nashville, United States
Duration: 2025 Jun 112025 Jun 15

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • diffusion models
  • image generation

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model Training'. Together they form a unique fingerprint.

Cite this