Abstract
We construct targeted audio adversarial examples on automatic speechrecognition. Given any audio waveform, we can produce another that is over99.9% similar, but transcribes as any phrase we choose (at a rate of up to 50characters per second). We apply our iterative optimization-based attack toMozilla's implementation DeepSpeech end-to-end, and show it has a 100% successrate. The feasibility of this attack introduce a new domain to studyadversarial examples.
Quick Read (beta)
loading the full paper ...