2014-Multi-Agent Machine Learning: A Reinforcement Approach
2018-Deep Multi-Agent Reinforcement Learning
2019-The StarCraft Multi-Agent Challenge-SMAC
2019-CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification-CityFlow
2005-An Overview of Cooperative and Competitive Multiagent Learning
2005-Cooperative multi-agent learning: the state of the art
2005-Evolutionary game theory and multi-agent reinforcement learning
2005-From single-agent to multi-agent reinforcement learning: Foundational concepts and methods
2007-If multi-agent learning is the answer, what is the question?
2008-A comprehensive survey of multi-agent reinforcement learning
2010-Multi-agent reinforcement learning: An overview
2012-Game theory and multi-agent reinforcement learning
2015-Evolutionary Dynamics of Multi-Agent Learning: A Survey
2016-Deep Reinforcement Learning Variants of Multi-Agent Learning Algorithms
2019-A survey and critique of multiagent deep reinforcement learning
2019-A review of cooperative multi-agent deep reinforcement learning
2019-Multi-Agent Reinforcement Learning:A Selective Overview of Theories and Algorithms
2019-A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
2019-Autonomously Reusing Knowledge in Multiagent Reinforcement Learning
2019-多智能体强化学习综述
2019-多智能体深度强化学习研究综述
2020-Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
1997-Multi-agent reinforcement learning: Independent vs. cooperative agents-IQL
2015-Multiagent cooperation and competition with deep reinforcement learning-IDQN
2018-Mean Field Multi-Agent Reinforcement Learning-平均场
2019-Factorized Q-Learning for Large-Scale Multi-Agent Systems
2002-Multiagent Planning with Factored MDPs
2017-Value-decomposition networks for cooperative multi-agent learning-VDN
2018-QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning-QMIX
2019-QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning-QTRAN
2020-Learning Nearly Decomposable Value Functions Via Communication Minimization-NDQ
2020-Deep Coordination Graphs-DCG
2020-The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning
2020-Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning-Qatten
2020-Action Semantics Network: Considering the Effects of Actions in Multiagent Systems-ASN
2020-SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning
2020-On the Robustness of Cooperative Multi-Agent Reinforcement Learning
2020-MAVEN: Multi-Agent Variational Exploration-MAVEN
2020-Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
2017-Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments-MADDPG
2018-Actor-Attention-Critic for Multi-Agent Reinforcement Learning-MAAC
2018-Counterfactual Multi-Agent Policy Gradients-COMA
2018-Multiagent Soft Q-Learning
2018-Credit assignment for collective multiagent RL with global rewards
2020-CM3: COOPERATIVE MULTI-GOAL MULTI-STAGE MULTI-AGENT REINFORCEMENT LEARNING
2018-Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
2016-Learning to communicate with deep multi-agent reinforcement learning
2016-Learning multiagent communication with backpropagation
2016-Learning to communicate to solve riddles with deep distributed recurrent q-networks
2017-Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games
2020-Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning
2020-Multiagent Rollout Algorithms and Reinforcement Learning
2015-Fictitious Self-Play in Extensive-Form Games
2016-Deep reinforcement learning from self-play in imperfect-information games
2017-A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
2018-Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
2019-On the Utility of Learning about Humans for Human-AI Coordination
2017-Accelerating Multiagent Reinforcement Learning through Transfer Learning
2017-Simultaneously Learning and Advising in Multiagent Reinforcement Learning