Flash STU: Fast Spectral Transform Units

  • 2024-09-17 13:01:14
  • Y. Isabel Liu, Windsor Nguyen, Yagiz Devre, Evan Dogariu, Anirudha Majumdar, Elad Hazan
  • 0

Abstract

This paper describes an efficient, open source PyTorch implementation of theSpectral Transform Unit. We investigate sequence prediction tasks over severalmodalities including language, robotics, and simulated dynamical systems. Wefind that for the same parameter count, the STU and its variants outperform theTransformer as well as other leading state space models across variousmodalities.

 

Quick Read (beta)

loading the full paper ...