title: Isometric Representation Learning for Disentangled Latent Space of Diffusion Models

authors: Jaehoon Hahm, Junho Lee, Sunghyun Kim, Joonseok Lee

year: 2023

doi: None

zotero: zotero://select/items/@hahmIsometricRepresentationLearning2023


Isometric Representation Learning for Disentangled Latent Space of Diffusion Models

Diffusion models have made remarkable progress in capturing and reproducing real-world data. Despite their success and further potential, however, their latent space, the core of diffusion models still remains underexplored. In fact, the latent spaces of existing diffusion models still do not perfectly align with the human perception, entangling multiple concepts in a distorted space. In this paper, we present \textit{Isometric Diffusion}, equipping a diffusion model with isometric representation learning to better reflect human intuition and understanding of visual data. Specifically, we propose a novel loss to promote isometry between the latent space and the data manifold, enabling a semantically clear and geometrically sound latent space. This approach allows smoother interpolation and more precise control over attributes directly in the latent space. Our extensive experiments demonstrate the effectiveness of Isometric Diffusion, marking a significant advance in aligning latent spaces with perceptual semantics. This work paves the way for fine-grained data generation and manipulation.