Abstract
Test-time training adapts to a new test distribution on the fly by optimizinga model for each test input using self-supervision. In this paper, we usemasked autoencoders for this one-sample learning problem. Empirically, oursimple method improves generalization on many visual benchmarks fordistribution shifts. Theoretically, we characterize this improvement in termsof the bias-variance trade-off.
Quick Read (beta)
loading the full paper ...