Adversarial Attacks on Image Generation With Made-Up Words

Abstract

Text-guided image generation models can be prompted to generate images usingnonce words adversarially designed to robustly evoke specific visual concepts.Two approaches for such generation are introduced: macaronic prompting, whichinvolves designing cryptic hybrid words by concatenating subword units fromdifferent languages; and evocative prompting, which involves designing noncewords whose broad morphological features are similar enough to that of existingwords to trigger robust visual associations. The two methods can also becombined to generate images associated with more specific visual concepts. Theimplications of these techniques for the circumvention of existing approachesto content moderation, and particularly the generation of offensive or harmfulimages, are discussed.

Quick Read (beta)

loading the full paper ...