文献收藏与分享平台

尹志 (2025-03-31 15:06):

#paper：doi：doi.org/10.48550/arXiv.2502.11974, Image Inversion: A Survey from GANs to Diffusion and Beyond(2025). 综述了image inversion常见的算法模型，很新，主要介绍了GAN和diffusion模型，也提了DiT和Rectified Flow框架。image inversion的核心问题涉及latent space, 对其它生成式AI的问题都非常重要。

arXiv, 2025-02-17T16:20:48Z. DOI: 10.48550/arXiv.2502.11974

Image Inversion: A Survey from GANs to Diffusion and Beyond

翻译

Yinan Chen, Jiangning Zhang, Yali Bi, Xiaobin Hu, Teng Hu, Zhucun Xue, Ran Yi, Yong Liu, Ying Tai

Abstract:

Image inversion is a fundamental task in generative models, aiming to mapimages back to their latent representations to enable downstream applicationssuch as editing, restoration, and style transfer. This paper provides acomprehensive review of the latest advancements in image inversion techniques,focusing on two main paradigms: Generative Adversarial Network (GAN) inversionand diffusion model inversion. We categorize these techniques based on theiroptimization methods. For GAN inversion, we systematically classify existingmethods into encoder-based approaches, latent optimization approaches, andhybrid approaches, analyzing their theoretical foundations, technicalinnovations, and practical trade-offs. For diffusion model inversion, weexplore training-free strategies, fine-tuning methods, and the design ofadditional trainable modules, highlighting their unique advantages andlimitations. Additionally, we discuss several popular downstream applicationsand emerging applications beyond image tasks, identifying current challengesand future research directions. By synthesizing the latest developments, thispaper aims to provide researchers and practitioners with a valuable referenceresource, promoting further advancements in the field of image inversion. Wekeep track of the latest works at https://github.com/RyanChenYN/ImageInversion

翻译

Related Links:

https://doi.org/10.48550/arXiv.2502.11974