Abstract
We present NiceWebRL, a research tool that enables researchers to use machinereinforcement learning (RL) environments for online human subject experiments.NiceWebRL is a Python library that allows any Jax-based environment to betransformed into an online interface, supporting both single-agent andmulti-agent environments. As such, NiceWebRL enables AI researchers to comparetheir algorithms to human performance, cognitive scientists to test MLalgorithms as theories for human cognition, and multi-agent researchers todevelop algorithms for human-AI collaboration. We showcase NiceWebRL with 3case studies that demonstrate its potential to help develop Human-like AI,Human-compatible AI, and Human-assistive AI. In the first case study(Human-like AI), NiceWebRL enables the development of a novel RL model ofcognition. Here, NiceWebRL facilitates testing this model against humanparticipants in both a grid world and Craftax, a 2D Minecraft domain. In oursecond case study (Human-compatible AI), NiceWebRL enables the development of anovel multi-agent RL algorithm that can generalize to human partners in theOvercooked domain. Finally, in our third case study (Human-assistive AI), weshow how NiceWebRL can allow researchers to study how an LLM can assist humanson complex tasks in XLand-Minigrid, an environment with millions ofhierarchical tasks. The library is available athttps://github.com/KempnerInstitute/nicewebrl.