PrEditor3D: Fast and Precise 3D Shape Editing

Ziya Erkoç¹ Can Gümeli¹ Chaoyang Wang² Matthias Niessner¹ Angela Dai¹ Peter Wonka^2,3 Hsin-Ying Lee² Peiye Zhuang²

¹TUM ²Snap Inc ³KAUST

Work partially done while Ziya and Can were interning at Snap

arXiv Video GitHub

CVPR 2025

PrEditor3D quickly edits a 3D shape with a given text prompt and a mask.

angel holding a ~~stone~~ fruit basket

~~chicken~~ sad ginger cat in a racing car

viking with a red pepper mustache wearing a helmet

dog with a blonde straight hair

Abstract

We propose a training-free approach to 3D editing that enables the editing of a single shape within a few minutes. The edited 3D mesh aligns well with the prompts, and remains identical for regions that are not intended to be altered. To this end, we first project the 3D object onto 4-view images and perform synchronized multi-view image editing along with user-guided text prompts and user-provided rough masks. However, the targeted regions to be edited are ambiguous due to projection from 3D to 2D. To ensure precise editing only in intended regions, we develop a 3D segmentation pipeline that detects edited areas in 3D space, followed by a merging algorithm to seamlessly integrate edited 3D regions with the original input. Extensive experiments demonstrate the superiority of our method over previous approaches, enabling fast, high-quality editing while preserving unintended regions.

Video

Precise Editing Results

Prompt & Mask

Input Shape

Edited Shape

truck carrying ~~clay~~ pizza

viking with a red pepper mustache wearing a helmet

robotic arm holding a ~~Oreo cookie~~ pizza over a coffee mug

hogwarts castle with ~~main~~ clock tower in the middle

~~chicken~~ cat with a tail in a racing car

Diverse Editing Results

With the same prompt and different seeds, we can generate diverse results.

By adjusting the prompt, we can control different aspects of editing such as appearance and mood.

happy ginger cat ...	sad ginger cat ...	ginger cat with sunglasses ...

Ablation Results

Our merging algorithm allows us to keep original parts of the shape fixed. In the example below, naively doing multi-view editing without any merging changes important properties of the car as well. With our mergin operation, we can recover the car back. However, that can lead to disjoint parts which we resolve by doing additional averaging around the new part.

Input Shape	w/o Merging	w/o Average Merging	Ours

BibTeX

@InProceedings{Erkoc_2025_CVPR,
    author    = {Erko\c{c}, Ziya and G\"umeli, Can and Wang, Chaoyang and Nie{\ss}ner, Matthias and Dai, Angela and Wonka, Peter and Lee, Hsin-Ying and Zhuang, Peiye},
    title     = {PrEditor3D: Fast and Precise 3D Shape Editing},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {640-649}
}