NmetaQ: An n-agent reinforcement learning algorithm based on meta equilibrium

Yujing Hu; Zhaonan Sun; Xingguo Chen; Yang Gao; Ruili Wang

NmetaQ: An n-agent reinforcement learning algorithm based on meta equilibrium

Yujing Hu, Zhaonan Sun, Xingguo Chen, Yang Gao, Ruili Wang

Research output: Contribution to conference › Paper › peer-review

Abstract

Multi-agent reinforcement learning (MARL) has been widely studied over the last years. In MARL, one approach is to combine game theory with reinforcement learning (RL) to help with selecting actions and updating policies. Markov games are adopted in this approach as the framework and policies are learnt based on equilibrium theories. Several algorithms have been proposed based on this idea, such as minimax-Q, NashQ, FFQ, Correlated-Q and MetaQ. However, some of these algorithms are proposed only for 2-agent problems while the others have difficulty in dealing with problems with more than 2 agents. Since many tasks involve more than 2 agents in the real world, an algorithm which can deal with n-agent (n > 2) problems is needed. In this paper, we propose nMetaQ based on MetaQ. nMetaQ can be applied to a multi-agent environment that has more than 2 agents. Experimental results demonstrate the empirical convergence of nMetaQ and show its satisfactory adaptive performance. The most important advantage of nMetaQ is that it can work efficiently and effectively in an n-agent (n > 2) environment while previous methods may not.

Original language	English
Pages	87-94
Number of pages	8
Publication status	Published - 2012
Externally published	Yes
Event	2012 Workshop on Adaptive and Learning Agents, ALA 2012 - Held in Conjunction with the 11th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012 - Valencia, Spain Duration: 4 Jun 2012 → 5 Jun 2012

Conference

Conference	2012 Workshop on Adaptive and Learning Agents, ALA 2012 - Held in Conjunction with the 11th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012
Country/Territory	Spain
City	Valencia
Period	4/06/12 → 5/06/12

Keywords

Game theory
Metagame
Multi-agent reinforcement learning
Nmetaq

ASJC Scopus subject areas

Artificial Intelligence
Software

Cite this

@conference{ad17c5bd82e5472d921099225ca47367,

title = "NmetaQ: An n-agent reinforcement learning algorithm based on meta equilibrium",

abstract = "Multi-agent reinforcement learning (MARL) has been widely studied over the last years. In MARL, one approach is to combine game theory with reinforcement learning (RL) to help with selecting actions and updating policies. Markov games are adopted in this approach as the framework and policies are learnt based on equilibrium theories. Several algorithms have been proposed based on this idea, such as minimax-Q, NashQ, FFQ, Correlated-Q and MetaQ. However, some of these algorithms are proposed only for 2-agent problems while the others have difficulty in dealing with problems with more than 2 agents. Since many tasks involve more than 2 agents in the real world, an algorithm which can deal with n-agent (n > 2) problems is needed. In this paper, we propose nMetaQ based on MetaQ. nMetaQ can be applied to a multi-agent environment that has more than 2 agents. Experimental results demonstrate the empirical convergence of nMetaQ and show its satisfactory adaptive performance. The most important advantage of nMetaQ is that it can work efficiently and effectively in an n-agent (n > 2) environment while previous methods may not.",

keywords = "Game theory, Metagame, Multi-agent reinforcement learning, Nmetaq",

author = "Yujing Hu and Zhaonan Sun and Xingguo Chen and Yang Gao and Ruili Wang",

year = "2012",

language = "English",

pages = "87--94",

note = "2012 Workshop on Adaptive and Learning Agents, ALA 2012 - Held in Conjunction with the 11th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012 ; Conference date: 04-06-2012 Through 05-06-2012",

}

NmetaQ: An n-agent reinforcement learning algorithm based on meta equilibrium. / Hu, Yujing; Sun, Zhaonan; Chen, Xingguo et al.
2012. 87-94 Paper presented at 2012 Workshop on Adaptive and Learning Agents, ALA 2012 - Held in Conjunction with the 11th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012, Valencia, Spain.

Research output: Contribution to conference › Paper › peer-review

TY - CONF

T1 - NmetaQ

T2 - 2012 Workshop on Adaptive and Learning Agents, ALA 2012 - Held in Conjunction with the 11th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2012

AU - Hu, Yujing

AU - Sun, Zhaonan

AU - Chen, Xingguo

AU - Gao, Yang

AU - Wang, Ruili

PY - 2012

Y1 - 2012

N2 - Multi-agent reinforcement learning (MARL) has been widely studied over the last years. In MARL, one approach is to combine game theory with reinforcement learning (RL) to help with selecting actions and updating policies. Markov games are adopted in this approach as the framework and policies are learnt based on equilibrium theories. Several algorithms have been proposed based on this idea, such as minimax-Q, NashQ, FFQ, Correlated-Q and MetaQ. However, some of these algorithms are proposed only for 2-agent problems while the others have difficulty in dealing with problems with more than 2 agents. Since many tasks involve more than 2 agents in the real world, an algorithm which can deal with n-agent (n > 2) problems is needed. In this paper, we propose nMetaQ based on MetaQ. nMetaQ can be applied to a multi-agent environment that has more than 2 agents. Experimental results demonstrate the empirical convergence of nMetaQ and show its satisfactory adaptive performance. The most important advantage of nMetaQ is that it can work efficiently and effectively in an n-agent (n > 2) environment while previous methods may not.

AB - Multi-agent reinforcement learning (MARL) has been widely studied over the last years. In MARL, one approach is to combine game theory with reinforcement learning (RL) to help with selecting actions and updating policies. Markov games are adopted in this approach as the framework and policies are learnt based on equilibrium theories. Several algorithms have been proposed based on this idea, such as minimax-Q, NashQ, FFQ, Correlated-Q and MetaQ. However, some of these algorithms are proposed only for 2-agent problems while the others have difficulty in dealing with problems with more than 2 agents. Since many tasks involve more than 2 agents in the real world, an algorithm which can deal with n-agent (n > 2) problems is needed. In this paper, we propose nMetaQ based on MetaQ. nMetaQ can be applied to a multi-agent environment that has more than 2 agents. Experimental results demonstrate the empirical convergence of nMetaQ and show its satisfactory adaptive performance. The most important advantage of nMetaQ is that it can work efficiently and effectively in an n-agent (n > 2) environment while previous methods may not.

KW - Game theory

KW - Metagame

KW - Multi-agent reinforcement learning

KW - Nmetaq

UR - http://www.scopus.com/inward/record.url?scp=84876901520&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84876901520

SP - 87

EP - 94

Y2 - 4 June 2012 through 5 June 2012

ER -

NmetaQ: An n-agent reinforcement learning algorithm based on meta equilibrium

Abstract

Conference

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this