Abstract
Global climate models (GCMs) are the main tools for understanding andpredicting climate change. However, due to limited numerical resolutions, thesemodels suffer from major structural uncertainties; e.g., they cannot resolvecritical processes such as small-scale eddies in atmospheric and oceanicturbulence. Thus, such small-scale processes have to be represented as afunction of the resolved scales via closures (parametrization). The accuracy ofthese closures is particularly important for capturing climate extremes.Traditionally, such closures are based on heuristics and simplifyingassumptions about the unresolved physics. Recently, supervised-learnedclosures, trained offline on high-fidelity data, have been shown to outperformthe classical physics-based closures. However, this approach requires asignificant amount of high-fidelity training data and can also lead toinstabilities. Reinforcement learning is emerging as a potent alternative fordeveloping such closures as it requires only low-order statistics and leads tostable closures. In Scientific Multi-Agent Reinforcement Learning (SMARL)computational elements serve a dual role of discretization points and learningagents. We leverage SMARL and fundamentals of turbulence physics to learnclosures for prototypes of atmospheric and oceanic turbulence. The policy istrained using only the enstrophy spectrum, which is nearly invariant and can beestimated from a few high-fidelity samples (these few samples are far fromenough for supervised/offline learning). We show that these closures lead tostable low-resolution simulations that, at a fraction of the cost, canreproduce the high-fidelity simulations' statistics, including the tails of theprobability density functions. The results demonstrate the high potential ofSMARL for closure modeling for GCMs, especially in the regime of scarce dataand indirect observations.