DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Graduate School of Artificial Intelligence, KAIST

^* Indicates Equal Contribution

ICLR 2025

Abstract

Score distillation sampling (SDS) has emerged as an effective framework in text-driven 3D editing tasks, leveraging diffusion models for 3D consistent editing. However, existing SDS-based 3D editing methods suffer from long training times and produce low-quality results. We identify that the root cause of this performance degradation is their conflict with the sampling dynamics of diffusion models. Addressing this conflict allows us to treat SDS as a diffusion reverse process for 3D editing via sampling from data space. In contrast, existing methods naively distill the score function using diffusion models. From these insights, we propose DreamCatalyst, a novel framework that considers these sampling dynamics in the SDS framework. Specifically, we devise the optimization process of our DreamCatalyst to approximate the diffusion reverse process in editing tasks, thereby aligning with diffusion sampling dynamics. As a result, DreamCatalyst successfully reduces training time and improves editing quality. Our method offers two modes: (1) a fast mode that edits Neural Radiance Fields (NeRF) scenes approximately 23 times faster than current state-of-the-art NeRF editing methods, and (2) a high-quality mode that produces superior results about 8 times faster than these methods. Notably, our high-quality mode outperforms current state-of-the-art NeRF editing methods in terms of both speed and quality. DreamCatalyst also surpasses the state-of-the-art 3D Gaussian Splatting (3DGS) editing methods, establishing itself as an effective and model-agnostic 3D editing solution.

BibTeX

@misc{kim2024dreamcatalystfasthighquality3d, title={DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation}, author={Jiwook Kim and Seonho Lee and Jaeyo Shin and Jiho Choi and Hyunjung Shim}, year={2024}, eprint={2407.11394}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2407.11394}, }

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Demo Video 1

Demo Video 2

Abstract

Architecture of DreamCatalyst

Source

"Turn him into Batman"

"Turn him into Joker"

"Turn him into Storm Trooper"

"Turn him into a bald"

"Turn him into Hulk"

"Turn him into a clown"

"Turn him into Darth Vader"

Source

"Turn his face into Einstein"

"Turn his face into a skull"

"Make him a mustache"

"Turn him into Leonardo Dicaprio"

Source

"Turn the bear statue into a polar bear"

"Turn the bear statue into a grizzly bear"

"Turn the bear statue into an asiatic black bear"

"Turn the bear statue into a panda"

"Turn the bear statue into a skull bear"

Source

"Turn only the plant above the flowerpot into a tulip and keep soil"

"Turn only the plant above the flowerpot into a rose and keep soil"

"Turn only the plant above the flowerpot into a sunflower and keep soil"

Source

"Make it autumn"

Source

"Make it look like it just snowed"

"Make it sunset"

BibTeX