We propose a training-free approach to 3D editing that enables the editing of a single shape within a few minutes. The edited 3D mesh aligns well with the prompts, and remains identical for regions that are not intended to be altered. To this end, we first project the 3D object onto 4-view images and perform synchronized multi-view image editing along with user-guided text prompts and user-provided rough masks. However, the targeted regions to be edited are ambiguous due to projection from 3D to 2D. To ensure precise editing only in intended regions, we develop a 3D segmentation pipeline that detects edited areas in 3D space, followed by a merging algorithm to seamlessly integrate edited 3D regions with the original input. Extensive experiments demonstrate the superiority of our method over previous approaches, enabling fast, high-quality editing while preserving unintended regions.
Prompt & Mask | Input Shape | Edited Shape | ||||
---|---|---|---|---|---|---|
truck carrying |
||||||
|
||||||
viking with a red pepper mustache wearing a helmet | ||||||
|
||||||
robotic arm holding a |
||||||
|
||||||
hogwarts castle with |
||||||
|
||||||
|
||||||
|
happy ginger cat ... | sad ginger cat ... | ginger cat with sunglasses ... |
---|---|---|
Input Shape | w/o Merging | w/o Average Merging | Ours |
---|---|---|---|
@misc{erkoc2024preditor3d,
title={PrEditor3D: Fast and Precise 3D Shape Editing},
author={Ziya Erkoç and Can Gümeli and Chaoyang Wang and Matthias Nießner and Angela Dai and Peter Wonka and Hsin-Ying Lee and Peiye Zhuang},
year={2024},
eprint={2412.06592},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2412.06592},
}