Abstract
Image watermarking methods are not tailored to handle small watermarkedareas. This restricts applications in real-world scenarios where parts of theimage may come from different sources or have been edited. We introduce adeep-learning model for localized image watermarking, dubbed the WatermarkAnything Model (WAM). The WAM embedder imperceptibly modifies the input image,while the extractor segments the received image into watermarked andnon-watermarked areas and recovers one or several hidden messages from theareas found to be watermarked. The models are jointly trained at low resolutionand without perceptual constraints, then post-trained for imperceptibility andmultiple watermarks. Experiments show that WAM is competitive with state-of-theart methods in terms of imperceptibility and robustness, especially againstinpainting and splicing, even on high-resolution images. Moreover, it offersnew capabilities: WAM can locate watermarked areas in spliced images andextract distinct 32-bit messages with less than 1 bit error from multiple smallregions - no larger than 10% of the image surface - even for small $256\times256$ images.