Abstract
Many real-world problems (e.g., resource management, autonomous driving, drugdiscovery) require optimizing multiple, conflicting objectives. Multi-objectivereinforcement learning (MORL) extends classic reinforcement learning to handlemultiple objectives simultaneously, yielding a set of policies that capturevarious trade-offs. However, the MORL field lacks complex, realisticenvironments and benchmarks. We introduce a water resource (Nile river basin)management case study and model it as a MORL environment. We then benchmarkexisting MORL algorithms on this task. Our results show that specialized watermanagement methods outperform state-of-the-art MORL approaches, underscoringthe scalability challenges MORL algorithms face in real-world scenarios.