User simulations for evaluating answers to question series

Authors:
Jimmy Lin
Affiliations:
College of Information Studies, Department of Computer Science, Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742, USA
Venue:
Information Processing and Management: an International Journal
Year:
2007

Citing 14
Cited 4

Towards interactive query expansion

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
The potential and actual effectiveness of interactive query expansion

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Do batch and user evaluations give the same results?

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Using information scent to model user information needs and actions and the Web

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Simulation Studies of Different Dimensions of Users' Interests and their Impact on User Modeling and Information Filtering

Information Retrieval
Quantitative evaluation of passage retrieval algorithms for question answering

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Eye-tracking analysis of user behavior in WWW search

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Combining the language model and inference network approaches to retrieval

Information Processing and Management: an International Journal - Special issue: Bayesian networks and information retrieval
Evaluation of resources for question answering evaluation

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
When will information retrieval be "good enough"?

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating implicit feedback models using searcher simulations

ACM Transactions on Information Systems (TOIS)
Using question series to evaluate question answering system effectiveness

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Automatically evaluating answers to definition questions

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Will pyramids built of nuggets topple over?

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics

Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians

Journal of Biomedical Informatics
The measurement of user satisfaction with question answering systems

Information and Management
Human question answering performance using an interactive document retrieval system

Proceedings of the 4th Information Interaction in Context Symposium
Report on the SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation (MUBE 2013)

ACM SIGIR Forum

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, question series have become one focus of research in question answering. These series are comprised of individual factoid, list, and ''other'' questions organized around a central topic, and represent abstractions of user-system dialogs. Existing evaluation methodologies have yet to catch up with this richer task model, as they fail to take into account contextual dependencies and different user behaviors. This paper presents a novel simulation-based methodology for evaluating answers to question series that addresses some of these shortcomings. Using this methodology, we examine two different behavior models: a ''QA-styled'' user and an ''IR-styled'' user. Results suggest that an off-the-shelf document retrieval system is competitive with state-of-the-art QA systems in this task. Advantages and limitations of evaluations based on user simulations are also discussed.