BiECVC: Gated Diversification of Bidirectional Contexts for Learned Video Compression

  • 2025-07-24 16:57:30
  • Wei Jiang, Junru Li, Kai Zhang, Li Zhang
  • 0

Abstract

Recent forward prediction-based learned video compression (LVC) methods haveachieved impressive results, even surpassing VVC reference software VTM underthe Low Delay B (LDB) configuration. In contrast, learned bidirectional videocompression (BVC) remains underexplored and still lags behind its forward-onlycounterparts. This performance gap is mainly due to the limited ability toextract diverse and accurate contexts: most existing BVCs primarily exploittemporal motion while neglecting non-local correlations across frames.Moreover, they lack the adaptability to dynamically suppress harmful contextsarising from fast motion or occlusion. To tackle these challenges, we proposeBiECVC, a BVC framework that incorporates diversified local and non-localcontext modeling along with adaptive context gating. For local contextenhancement, BiECVC reuses high-quality features from lower layers and alignsthem using decoded motion vectors without introducing extra motion overhead. Tomodel non-local dependencies efficiently, we adopt a linear attention mechanismthat balances performance and complexity. To further mitigate the impact ofinaccurate context prediction, we introduce Bidirectional Context Gating,inspired by data-dependent decay in recent autoregressive language models, todynamically filter contextual information based on conditional coding results.Extensive experiments demonstrate that BiECVC achieves state-of-the-artperformance, reducing the bit-rate by 13.4% and 15.7% compared to VTM 13.2under the Random Access (RA) configuration with intra periods of 32 and 64,respectively. To our knowledge, BiECVC is the first learned video codec tosurpass VTM 13.2 RA across all standard test datasets.

 

Quick Read (beta)

loading the full paper ...