Abstract
Game design hinges on understanding how static rules and content translateinto dynamic player behavior - something modern generative systems that inspectonly a game's code or assets struggle to capture. We present an automateddesign iteration framework that closes this gap by pairing a reinforcementlearning (RL) agent, which playtests the game, with a large multimodal model(LMM), which revises the game based on what the agent does. In each loop the RLplayer completes several episodes, producing (i) numerical play metrics and/or(ii) a compact image strip summarising recent video frames. The LMM designerreceives a gameplay goal and the current game configuration, analyses the playtraces, and edits the configuration to steer future behaviour toward the goal.We demonstrate results that LMMs can reason over behavioral traces supplied byRL agents to iteratively refine game mechanics, pointing toward practical,scalable tools for AI-assisted game design.