来自杂志 Machine Intelligence Research 的文献。
当前共找到 1 篇文献分享。
1.
林海onrush
(2025-11-30 19:22):
#paper, DOI: 10.1007/s11633-025-1562-4, DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models。 这篇论文针对扩散模型在使用大尺度 guidance(如 classifier/classifier-free guidance)时,高阶 ODE 求解器不稳定、质量反而不如一阶 DDIM 的问题,提出了基于“数据预测”参数化的高阶采样方法 DPM-Solver++。作者先把原本基于噪声预测的扩散 ODE 重新写成基于数据预测 (x_\theta) 的形式,推导出更稳定的二阶单步和多步求解器(2S/2M),并在像素空间模型中结合 dynamic thresholding 来缓解大 guidance 造成的像素爆掉和训练-测试分布不匹配;同时还给出了对应的 SDE 版本 SDE-DPM-Solver++,把 DDIM 的部分变体统一到一个框架下。实验表明,在 ImageNet 有指导采样和 Stable Diffusion 文本生成图像等任务上,DPM-Solver++ 在仅用 10–20 次网络前向传播的情况下就能达到或接近以往数百步采样的质量,相比现有训练-free 快速采样方法更加稳定、高效。
Machine Intelligence Research,
2025-8.
DOI: 10.1007/s11633-025-1562-4
Abstract:
Abstract Diffusion probabilistic models (DPMs) have achieved impressive success in high-resolution image synthesis, especially in recent large-scale text-to-image generation applications. An essential technique for improving the sample quality of DPMs …
>>>
Abstract Diffusion probabilistic models (DPMs) have achieved impressive success in high-resolution image synthesis, especially in recent large-scale text-to-image generation applications. An essential technique for improving the sample quality of DPMs is guided sampling, which usually needs a large guidance scale to obtain the best sample quality. The commonly-used fast sampler for guided sampling is denoising diffusion implicit models (DDIM), a first-order diffusion ordinary differential equation (ODE) solver that generally needs 100 to 250 steps for high-quality samples. Although recent works propose dedicated high-order solvers and achieve a further speedup for sampling without guidance, their effectiveness for guided sampling has not been well-tested before. In this work, we demonstrate that previous high-order fast samplers suffer from instability issues, and they even become slower than DDIM when the guidance scale grows larger. To further speed up guided sampling, we propose DPM-Solver++, a high-order solver for the guided sampling of DPMs. DPM-Solver++ solves the diffusion ODE with the data prediction model and adopts thresholding methods to keep the solution matches training data distribution. We further propose a multistep variant of DPM-Solver++ to address the instability issue by reducing the effective step size. Experiments show that DPM-Solver++ can generate high-quality samples within only 15 to 20 steps for guided sampling by pixel-space and latent-space DPMs.
<<<
翻译