Multiobjective reinforcement learning for traffic signal control using vehicular ad hoc network

Authors:
Duan Houli;Li Zhiheng;Zhang Yi
Affiliations:
Department of Automation, Tsinghua University, Beijing, China;Department of Automation, Tsinghua University, Beijing, China;Department of Automation, Tsinghua University, Beijing, China
Venue:
EURASIP Journal on Advances in Signal Processing - Special title on vehicular ad hoc networks
Year:
2010

Citing 4
Cited 2

Learning to Predict by the Methods of Temporal Differences

Machine Learning
Multi-Agent Reinforcement Leraning for Traffic Light Control

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Queue Spillovers in Transportation Networks with a Route Choice

Transportation Science
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework

Engineering Applications of Artificial Intelligence
A survey of multi-objective sequential decision-making

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a newmultiobjective control algorithm based on reinforcement learning for urban traffic signal control, namedmulti-RL. A multiagent structure is used to describe the traffic system. A vehicular ad hoc network is used for the data exchange among agents. A reinforcement learning algorithm is applied to predict the overall value of the optimization objective given vehicles' states. The policy which minimizes the cumulative value of the optimization objective is regarded as the optimal one. In order to make the method adaptive to various traffic conditions, we also introduce a multiobjective control scheme in which the optimization objective is selected adaptively to real-time traffic states. The optimization objectives include the vehicle stops, the average waiting time, and the maximum queue length of the next intersection. In addition, we also accommodate a priority control to the buses and the emergency vehicles through ourmodel. The simulation results indicated that our algorithm could performmore efficiently than traditional traffic light control methods.