Cooperation learning in Multi-Agent Systems with annotation and reward

Authors:
Tetsuya Yoshida
Affiliations:
Graduate School of Information Science and Technology, Hokkaido University, N-14, W-9, Sapporo, Hokkaido 060-0814, Japan. Tel.: +81 11 706 7253/ Fax: +81 11 706 7808/ E-mail: yoshida@meme.hokudai. ...
Venue:
International Journal of Knowledge-based and Intelligent Engineering Systems
Year:
2007

Citing 23
Cited 2

Heuristic classification

Artificial Intelligence
Practical Issues in Temporal Difference Learning

Machine Learning
Capturing Design Rationale in Concurrent Engineering Teams

Computer
Functional Representation as Design Rationale

Computer
A translation approach to portable ontology specifications

Knowledge Acquisition - Special issue: Current issues in knowledge modeling
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
Database management systems

Database management systems
Software agents

Software agents
KQML as an agent communication language

Software agents
Readings in agents

Readings in agents
Multiagent systems: a modern approach to distributed artificial intelligence

Multiagent systems: a modern approach to distributed artificial intelligence
Multi-Agent Systems: An Introduction to Distributed Artificial Intelligence

Multi-Agent Systems: An Introduction to Distributed Artificial Intelligence
Readings in GroupWare and Computer-Supported Cooperative Work: Assisting Human-Human Collaboration

Readings in GroupWare and Computer-Supported Cooperative Work: Assisting Human-Human Collaboration
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Heterogeneous Agent Systems

Heterogeneous Agent Systems
Bridging Work Practice and System Design: Integrating Systemic Analysis, Appreciative Intervention and Practitioner Participation

Computer Supported Cooperative Work
Participatory Design: Issues and Concerns

Computer Supported Cooperative Work
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Co-operative Reinforcement Learning By Payoff Filters (Extended Abstract)

ECML '95 Proceedings of the 8th European Conference on Machine Learning
On Multiagent Q-Learning in a Semi-Competitive Domain

IJCAI '95 Proceedings of the Workshop on Adaption and Learning in Multi-Agent Systems
A reinforcement learning approach to job-shop scheduling

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Detecting difference of usage of terms as difference of structure

Cognitive Systems Research

Global and Local Approach to Complex Systems Modeling Using Dynamic Neural Networks--- Analogy with Multiagent Systems

KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
Building a model of an intelligent multi-agent system based on distributed knowledge bases for solving problems automatically

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a novel approach for enabling agents to learn to cooperate each other based on annotation and reward in Multi-Agent Systems (MAS). We propose two methods toward cooperation learning in MAS: 1) a cooperation method, and 2) a learning method. As for 1), our method enables each agent to interact with other agents by sending its proposal and receiving the counterproposals for the proposal from the agents. The counterproposals are constructed by modifying the communicated proposal, and are utilized for clarifying the difference in opinions among agents. Conflict resolution is conducted to reduce the difference to facilitate cooperation. Furthermore, annotation, which acts as a kind of design rationale, is added onto the communicated proposal and counterproposals to facilitate conflict resolution. As for 2), we propose an extension of reinforcement learning method so that agents can learn the appropriate behavior based on the reward which is given through the interaction among agents. We define two kinds of reward for each agent: the reward for the proposal constructed by the agent, and the reward for coherence among agents. Comparative simulation studies for micro satellite design, which fits for cooperative problem solving by MAS, were conducted to evaluate the proposed approach. The results are encouraging and show that it is worth following this path. Especially, the results indicate that the appropriate balance between exploration and exploitation, which is important in cooperative problem solving in general, can be learned with the proposed approach.