Abstract
Addressing global challenges such as greenhouse gas emissions and resourceinequity demands advanced AI-driven coordination among autonomous agents. Wepropose CH-MARL (Constrained Hierarchical Multiagent Reinforcement Learning), anovel framework that integrates hierarchical decision-making with dynamicconstraint enforcement and fairness-aware reward shaping. CH-MARL employs areal-time constraint-enforcement layer to ensure adherence to global emissioncaps, while incorporating fairness metrics that promote equitable resourcedistribution among agents. Experiments conducted in a simulated maritimelogistics environment demonstrate considerable reductions in emissions, alongwith improvements in fairness and operational efficiency. Beyond thisdomain-specific success, CH-MARL provides a scalable, generalizable solution tomulti-agent coordination challenges in constrained, dynamic settings, thusadvancing the state of the art in reinforcement learning.