1 Billion Pages = 1 Million Dollars? mining the web to play "who wants to be a millionaire?"

Authors:
Shyong K. Lam;David M. Pennock;Dan Cosley;Steve Lawrence
Affiliations:
Computer Science Dept., University of Minnesota, Minneapolis, MN;Overture Services, Inc., Pasadena, CA;Computer Science Dept., University of Minnesota, Minneapolis, MN;NEC Laboratories America, Princeton, NJ
Venue:
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Year:
2002

Citing 20
Cited 5

Probabilistic inference and influence diagrams

Operations Research
Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
The development of a world class Othello program

Artificial Intelligence - Special issue on computer chess
A world championship caliber checkers program

Artificial Intelligence
CYC: a large-scale investment in knowledge infrastructure

Communications of the ACM
Deep Blue system overview

ICS '95 Proceedings of the 9th international conference on Supercomputing
Proverb: the probabilistic cruciverbalist

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Intelligent agents in computer games

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Retrieving descriptive phrases from large amounts of free text

Proceedings of the ninth international conference on Information and knowledge management
Learning search engine specific query transformations for question answering

Proceedings of the 10th international conference on World Wide Web
Exploiting redundancy in question answering

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Computer Go: an AI oriented survey

Artificial Intelligence
The penguin: using the web as a database for descriptive and dynamic grammar and spell checking

CHI '02 Extended Abstracts on Human Factors in Computing Systems
Probabilistic question answering on the web

Proceedings of the 11th international conference on World Wide Web
Introduction to Bayesian Networks

Introduction to Bayesian Networks
On the MSE robustness of batching estimators

Proceedings of the 33nd conference on Winter simulation
Context and Page Analysis for Improved Web Search

IEEE Internet Computing
Scaling Reinforcement Learning toward RoboCup Soccer

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
A Real World Implementation of Answer Extraction

DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications
Answer extraction

ANLC '00 Proceedings of the sixth conference on Applied natural language processing

Is question answering an acquired skill?

Proceedings of the 13th international conference on World Wide Web
"Language Is the Skin of My Thought": Integrating Wikipedia and AI to Support a Guillotine Player

AI*IA '09: Proceedings of the XIth International Conference of the Italian Association for Artificial Intelligence Reggio Emilia on Emergent Perspectives in Artificial Intelligence
Using syntactic and semantic structural kernels for classifying definition questions in Jeopardy!

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Web-Based multiple choice question answering for english and arabic questions

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Besting the quiz master: crowdsourcing incremental classification games

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

We exploit the redundancy and volume of information on the web to build a computerized player for the ABC TV game show "Who Wants To Be A Millionaire?". The player consists of a question-answering module and a decision-making module. The question-answering module utilizes question transformation techniques, natural language parsing, multiple information retrieval algorithms, and multiple search engines; results are combined in the spirit of ensemble learning using an adaptive weighting scheme. Empirically, the system correctly answers about 75% of questions from the Millionaire CD-ROM, 3rd edition--general-interest trivia questions often about popular culture and common knowledge. The decision-making module chooses from allowable actions in the game in order to maximize expected risk-adjusted winnings, where the estimated probability of answering correctly is a function of past performance and confidence in correctly answering the current question. When given a six question head start (i.e., when starting from the $2,000 level), we find that the system performs about as well on average as humans starting at the beginning. Our system demonstrates the potential of simple but well-chosen techniques for mining answers from unstructured information such as the web.