Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery

  • 2025-06-10 18:54:02
  • Filip Kovačević, Yihan Zhang, Marco Mondelli
  • 0

Abstract

Multi-index models provide a popular framework to investigate thelearnability of functions with low-dimensional structure and, also due to theirconnections with neural networks, they have been object of recent intensivestudy. In this paper, we focus on recovering the subspace spanned by thesignals via spectral estimators -- a family of methods routinely used inpractice, often as a warm-start for iterative algorithms. Our main technicalcontribution is a precise asymptotic characterization of the performance ofspectral methods, when sample size and input dimension grow proportionally andthe dimension $p$ of the space to recover is fixed. Specifically, we locate thetop-$p$ eigenvalues of the spectral matrix and establish the overlaps betweenthe corresponding eigenvectors (which give the spectral estimators) and a basisof the signal subspace. Our analysis unveils a phase transition phenomenon inwhich, as the sample complexity grows, eigenvalues escape from the bulk of thespectrum and, when that happens, eigenvectors recover directions of the desiredsubspace. The precise characterization we put forward enables the optimizationof the data preprocessing, thus allowing to identify the spectral estimatorthat requires the minimal sample size for weak recovery.

 

Quick Read (beta)

loading the full paper ...