Abstract
Vector graphics are widely used in digital art and valued by designers fortheir scalability and layer-wise topological properties. However, the creationand editing of vector graphics necessitate creativity and design expertise,leading to a time-consuming process. In this paper, we propose a novel pipelinethat generates high-quality customized vector graphics based on textual promptswhile preserving the properties and layer-wise information of a given exemplarSVG. Our method harnesses the capabilities of large pre-trained text-to-imagemodels. By fine-tuning the cross-attention layers of the model, we generatecustomized raster images guided by textual prompts. To initialize the SVG, weintroduce a semantic-based path alignment method that preserves and transformscrucial paths from the exemplar SVG. Additionally, we optimize path parametersusing both image-level and vector-level losses, ensuring smooth shapedeformation while aligning with the customized raster image. We extensivelyevaluate our method using multiple metrics from vector-level, image-level, andtext-level perspectives. The evaluation results demonstrate the effectivenessof our pipeline in generating diverse customizations of vector graphics withexceptional quality. The project page ishttps://intchous.github.io/SVGCustomization.