Trajectory-aligned Space-time Tokens for Few-shot Action Recognition

  • 2024-07-25 18:59:31
  • Pulkit Kumar, Namitha Padmanabhan, Luke Luo, Sai Saketh Rambhatla, Abhinav Shrivastava
  • 0


We propose a simple yet effective approach for few-shot action recognition,emphasizing the disentanglement of motion and appearance representations. Byharnessing recent progress in tracking, specifically point trajectories andself-supervised representation learning, we build trajectory-aligned tokens(TATs) that capture motion and appearance information. This approachsignificantly reduces the data requirements while retaining essentialinformation. To process these representations, we use a Masked Space-timeTransformer that effectively learns to aggregate information to facilitatefew-shot action recognition. We demonstrate state-of-the-art results onfew-shot action recognition across multiple datasets. Our project page isavailable at


Quick Read (beta)

loading the full paper ...