Abstract
We present `GL-LowPopArt`, a novel Catoni-style estimator for generalizedlow-rank trace regression. Building on `LowPopArt` (Jang et al., 2024), itemploys a two-stage approach: nuclear norm regularization followed by matrixCatoni estimation. We establish state-of-the-art estimation error bounds,surpassing existing guarantees (Fan et al., 2019; Kang et al., 2022), andreveal a novel experimental design objective, $\mathrm{GL}(\pi)$. The keytechnical challenge is controlling bias from the nonlinear inverse linkfunction, which we address by our two-stage approach. We prove a *local*minimax lower bound, showing that our `GL-LowPopArt` enjoys instance-wiseoptimality up to the condition number of the ground-truth Hessian. Applicationsinclude generalized linear matrix completion, where `GL-LowPopArt` achieves astate-of-the-art Frobenius error guarantee, and **bilinear dueling bandits**, anovel setting inspired by general preference learning (Zhang et al., 2024). Ouranalysis of a `GL-LowPopArt`-based explore-then-commit algorithm reveals a new,potentially interesting problem-dependent quantity, along with improved Bordaregret bound than vectorization (Wu et al., 2024).