尹志 (2025-03-31 15:06):
#paper:doi:doi.org/10.48550/arXiv.2502.11974, Image Inversion: A Survey from GANs to Diffusion and Beyond(2025). 综述了image inversion常见的算法模型,很新,主要介绍了GAN和diffusion模型,也提了DiT和Rectified Flow框架。image inversion的核心问题涉及latent space, 对其它生成式AI的问题都非常重要。
arXiv, 2025-02-17T16:20:48Z. DOI: 10.48550/arXiv.2502.11974
Image Inversion: A Survey from GANs to Diffusion and Beyond
Yinan Chen, Jiangning Zhang, Yali Bi, Xiaobin Hu, Teng Hu, Zhucun Xue, Ran Yi, Yong Liu, Ying Tai
Abstract:
Image inversion is a fundamental task in generative models, aiming to map<br>images back to their latent representations to enable downstream applications<br>such as editing, restoration, and style transfer. This paper provides a<br>comprehensive review of the latest advancements in image inversion techniques,<br>focusing on two main paradigms: Generative Adversarial Network (GAN) inversion<br>and diffusion model inversion. We categorize these techniques based on their<br>optimization methods. For GAN inversion, we systematically classify existing<br>methods into encoder-based approaches, latent optimization approaches, and<br>hybrid approaches, analyzing their theoretical foundations, technical<br>innovations, and practical trade-offs. For diffusion model inversion, we<br>explore training-free strategies, fine-tuning methods, and the design of<br>additional trainable modules, highlighting their unique advantages and<br>limitations. Additionally, we discuss several popular downstream applications<br>and emerging applications beyond image tasks, identifying current challenges<br>and future research directions. By synthesizing the latest developments, this<br>paper aims to provide researchers and practitioners with a valuable reference<br>resource, promoting further advancements in the field of image inversion. We<br>keep track of the latest works at https://github.com/RyanChenYN/ImageInversion
回到顶部