Publications

Refine Results

(Filters Applied) Clear All

R&D Areas

R&D Groups

Year

Items per page

By

Sheng Li Clear filter

Learning emergent discrete message communication for cooperative reinforcement learning

February 24, 2021

Conference Paper

Author:

Sheng Li

…

Published in:

37th Conf. on Uncertainty in Artificial Intelligence, UAI 2021, early access, 26-30 July 2021.

Topic:

communications

R&D area:

R&D group:

Homeland Protection Systems

Summary

Communication is a important factor that enables agents work cooperatively in multi-agent reinforcement learning (MARL). Most previous work uses continuous message communication whose high representational capacity comes at the expense of interpretability. Allowing agents to learn their own discrete message communication protocol emerged from a variety of domains can increase the interpretability for human designers and other agents. This paper proposes a method to generate discrete messages analogous to human languages, and achieve communication by a broadcast-and-listen mechanism based on self-attention. We show that discrete message communication has performance comparable to continuous message communication but with much a much smaller vocabulary size. Furthermore, we propose an approach that allows humans to interactively send discrete messages to agents.

READ LESS

Summary

Learning emergent discrete message communication for cooperative reinforcement learning

Towards a distributed framework for multi-agent reinforcement learning research

September 22, 2020

Conference Paper

Author:

Yutai Zhou

…

Published in:

2020 IEEE High Performance Extreme Computing Conf., HPEC, 22-24 September 2020.

Topic:

high performance computing

R&D area:

R&D group:

Summary

Some of the most important publications in deep reinforcement learning over the last few years have been fueled by access to massive amounts of computation through large scale distributed systems. The success of these approaches in achieving human-expert level performance on several complex video-game environments has motivated further exploration into the limits of these approaches as computation increases. In this paper, we present a distributed RL training framework designed for super computing infrastructures such as the MIT SuperCloud. We review a collection of challenging learning environments—such as Google Research Football, StarCraft II, and Multi-Agent Mujoco— which are at the frontier of reinforcement learning research. We provide results on these environments that illustrate the current state of the field on these problems. Finally, we also quantify and discuss the computational requirements needed for performing RL research by enumerating all experiments performed on these environments.

READ LESS

Summary

Towards a distributed framework for multi-agent reinforcement learning research

Deep implicit coordination graphs for multi-agent reinforcement learning [e-print]

June 19, 2020

Journal Article

Author:

Sheng Li

…

Published in:

https://arxiv.org/abs/2006.11438

Topic:

machine learning

R&D area:

Cyber Security and Information Sciences

Summary

Multi-agent reinforcement learning (MARL) requires coordination to efficiently solve certain tasks. Fully centralized control is often infeasible in such domains due to the size of joint action spaces. Coordination graph based formalization allows reasoning about the joint action based on the structure of interactions. However, they often require domain expertise in their design. This paper introduces the deep implicit coordination graph (DICG) architecture for such scenarios. DICG consists of a module for inferring the dynamic coordination graph structure which is then used by a graph neural network based module to learn to implicitly reason about the joint actions or values. DICG allows learning the tradeoff between full centralization and decentralization via standard actor-critic methods to significantly improve coordination for domains with large number of agents. We apply DICG to both centralized-training-centralized-execution and centralized-training-decentralized-execution regimes. We demonstrate that DICG solves the relative overgeneralization pathology in predatory-prey tasks as well as outperforms various MARL baselines on the challenging StarCraft II Multi-agent Challenge (SMAC) and traffic junction environments.

READ LESS

Summary

Deep implicit coordination graphs for multi-agent reinforcement learning [e-print]

Publications

Refine Results

By

Learning emergent discrete message communication for cooperative reinforcement learning

Summary

Summary

Towards a distributed framework for multi-agent reinforcement learning research

Summary

Summary

Deep implicit coordination graphs for multi-agent reinforcement learning [e-print]

Summary

Summary

Showing Results