Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Partially observable Markov decision processes for spoken dialog systems
Computer Speech and Language
The hidden information state dialogue manager: a real-world POMDP-based system
NAACL-Demonstrations '07 Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations
Scaling POMDPs for Spoken Dialog Management
IEEE Transactions on Audio, Speech, and Language Processing
ACM Transactions on Speech and Language Processing (TSLP)
Hi-index | 0.00 |
This is a demonstration of a voice dialer, implemented as a partially observable Markov decision process (POMDP). A real-time graphical display shows the POMDP's probability distribution over different possible dialog states, and shows how system output is generated and selected. The system demonstrated here includes several recent advances, including an action selection mechanism which unifies a hand-crafted controller and reinforcement learning. The voice dialer itself is in use today in AT&T Labs and receives daily calls.