Abstract
We propose a novel framework for exploring generalization errors of transferlearning through the lens of differential calculus on the space of probabilitymeasures. In particular, we consider two main transfer learning scenarios,$\alpha$-ERM and fine-tuning with the KL-regularized empirical riskminimization and establish generic conditions under which the generalizationerror and the population risk convergence rates for these scenarios arestudied. Based on our theoretical results, we show the benefits of transferlearning with a one-hidden-layer neural network in the mean-field regime undersome suitable integrability and regularity assumptions on the loss andactivation functions.
Quick Read (beta)
                                loading the full paper ...