Abstract
We propose a method to explore the flavor structure of quarks and leptonswith reinforcement learning. As a concrete model, we utilize a basicvalue-based algorithm for models with $U(1)$ flavor symmetry. By trainingneural networks on the $U(1)$ charges of quarks and leptons, the agent finds 21models to be consistent with experimentally measured masses and mixing anglesof quarks and leptons. In particular, an intrinsic value of normal orderingtends to be larger than that of inverted ordering, and the normal ordering iswell fitted with the current experimental data in contrast to the invertedordering. A specific value of effective mass for the neutrinoless double betadecay and a sizable leptonic CP violation induced by an angular component offlavon field are predicted by autonomous behavior of the agent. Our findingresults indicate that the reinforcement learning can be a new method forunderstanding the flavor structure.