Abstract
We tackle the task of geometric image editing, where an object within animage is repositioned, reoriented, or reshaped while preserving overall scenecoherence. Previous diffusion-based editing methods often attempt to handle allrelevant subtasks in a single step, proving difficult when transformationsbecome large or structurally complex. We address this by proposing a decoupledpipeline that separates object transformation, source region inpainting, andtarget region refinement. Both inpainting and refinement are implemented usinga training-free diffusion approach, FreeFine. In experiments on our newGeoBench benchmark, which contains both 2D and 3D editing scenarios, FreeFineoutperforms state-of-the-art alternatives in image fidelity, and editprecision, especially under demanding transformations. Code and benchmark areavailable at: https://github.com/CIawevy/FreeFine