Abstract
Interpretable machine learning is rapidly becoming a crucial tool forscientific discovery. Among existing approaches, variational autoencoders(VAEs) have shown promise in extracting the hidden physical features of someinput data, with no supervision nor prior knowledge of the system at study.Yet, the ability of VAEs to create meaningful, interpretable representationsrelies on their accurate approximation of the underlying probabilitydistribution of their input. When dealing with quantum data, VAEs must henceaccount for its intrinsic randomness and complex correlations. While VAEs havebeen previously applied to quantum data, they have often neglected itsprobabilistic nature, hindering the extraction of meaningful physicaldescriptors. Here, we demonstrate that two key modifications enable VAEs tolearn physically meaningful latent representations: a decoder capable offaithfully reproduce quantum states and a probabilistic loss tailored to thistask. Using benchmark quantum spin models, we identify regimes where standardmethods fail while the representations learned by our approach remainmeaningful and interpretable. Applied to experimental data from Rydberg atomarrays, the model autonomously uncovers the phase structure without access toprior labels, Hamiltonian details, or knowledge of relevant order parameters,highlighting its potential as an unsupervised and interpretable tool for thestudy of quantum systems.