The advent of 3D Gaussian Splatting (3DGS) has revolutionized 3D editing, offering efficient, high-fidelity rendering and enabling precise local manipulations. Currently, diffusion-based 2D editing models are harnessed to modify multi-view rendered images, which then guide the editing of 3DGS models. However, this approach faces a critical issue of multi-view inconsistency, where the guidance images exhibit significant discrepancies across views, leading to mode collapse and visual artifacts of 3DGS. To this end, we introduce View-consistent Editing (VcEdit), a novel framework that seamlessly incorporates 3DGS into image editing processes, ensuring multi-view consistency in edited guidance images and effectively mitigating mode collapse issues. VcEdit employs two innovative consistency modules: the Cross-attention Consistency Module and the Editing Consistency Module, both designed to reduce inconsistencies in edited images. By incorporating these consistency modules into an iterative pattern, VcEdit proficiently resolves the issue of multi-view inconsistency, facilitating high-quality 3DGS editing across a diverse range of scenes.
3D高斯喷溅(3DGS)的出现彻底革新了3D编辑,提供了高效、高保真的渲染并实现了精确的局部操作。目前,扩散基础的2D编辑模型被用于修改多视图渲染图像,这些图像随后指导3DGS模型的编辑。然而,这种方法面临一个关键问题,即多视图不一致性,其中指导图像在不同视图中展现出显著差异,导致模式崩溃和3DGS的视觉缺陷。为此,我们引入了视图一致性编辑(VcEdit),一个将3DGS无缝整合到图像编辑过程中的新颖框架,确保编辑后的指导图像具有多视图一致性,并有效缓解模式崩溃问题。VcEdit采用了两个创新的一致性模块:交叉注意力一致性模块和编辑一致性模块,都旨在减少编辑图像中的不一致性。通过将这些一致性模块纳入迭代模式,VcEdit熟练地解决了多视图不一致性问题,促进了在多样化场景中进行高质量3DGS编辑。