Abstract
We introduce a new paradigm for active sound modification: Active SpeechEnhancement (ASE). While Active Noise Cancellation (ANC) algorithms focus onsuppressing external interference, ASE goes further by actively shaping thespeech signal -- both attenuating unwanted noise components and amplifyingspeech-relevant frequencies -- to improve intelligibility and perceptualquality. To enable this, we propose a novel Transformer-Mamba-basedarchitecture, along with a task-specific loss function designed to jointlyoptimize interference suppression and signal enrichment. Our method outperformsexisting baselines across multiple speech processing tasks -- includingdenoising, dereverberation, and declipping -- demonstrating the effectivenessof active, targeted modulation in challenging acoustic environments.