Leveraging Organizational Hierarchy to Simplify Reward Design in Cooperative Multi-agent Reinforcement Learning

L Liu, V Ustun, R Kumar - The International FLAIRS Conference …, 2024 - journals.flvc.org
The International FLAIRS Conference Proceedings, 2024journals.flvc.org
The effectiveness of multi-agent reinforcement learning (MARL) hinges largely on the
meticulous arrangement of objectives. Yet, conventional MARL methods might not
completely harness the inherent structures present in environmental states and agent
relationships for goal organization. This study is conducted within the domain of military
training simulations, which are typically characterized by complexity, heterogeneity, non-
stationary and doctrine-driven environments with a clear organizational hierarchy and a top …
Abstract
The effectiveness of multi-agent reinforcement learning (MARL) hinges largely on the meticulous arrangement of objectives. Yet, conventional MARL methods might not completely harness the inherent structures present in environmental states and agent relationships for goal organization. This study is conducted within the domain of military training simulations, which are typically characterized by complexity, heterogeneity, non-stationary and doctrine-driven environments with a clear organizational hierarchy and a top-down chain of command. This research investigates the approximation and integration of the organizational hierarchy into MARL for cooperative training scenarios, with the goal of streamlining the processes of reward engineering and enhancing team coordination. In the preliminary experiments, we employed two-tiered commander-subordinate feudal hierarchical (CSFH) networks to separate the prioritized team goal and individual goals. The empirical results demonstrate that the proposed framework enhances learning efficiency. It guarantees the learning of a prioritized policy for the commander agent and encourages subordinate agents to explore areas of interest more frequently, guided by appropriate soft constraints imposed by the commander.
journals.flvc.org
Showing the best result for this search. See all results