Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
Existence of multiagent equilibria with limited agents
Journal of Artificial Intelligence Research
Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Rationality and convergence are two important criterions for multi-agent learning. A novel method called Pareto-Q learning is prompted for cooperative general-sum games, with the Pareto Optimum allowing rationality and social conventions benefiting the convergence. Experiments with the grid game suggest the efficiency of Pareto-Q. Compared with the single-agent Q-learning and Nash agent Q-learning, Pareto-Q learning performs best.