The use of MMR, diversity-based reranking for reordering documents and producing summaries
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Automatic segmentation of text into structured records
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Evaluation of DEFINDER: a system to mine definitions from consumer-oriented medical text
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Maximum Entropy Markov Models for Information Extraction and Segmentation
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Mining topic-specific concepts and definitions on the web
WWW '03 Proceedings of the 12th international conference on World Wide Web
Sentence reduction for automatic text summarization
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Unsupervised learning of soft patterns for generating definitions from online news
Proceedings of the 13th international conference on World Wide Web
Web-scale information extraction in knowitall: (preliminary results)
Proceedings of the 13th international conference on World Wide Web
Evaluation of an extraction-based approach to answering definitional questions
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to the special issue on temporal information processing
ACM Transactions on Asian Language Information Processing (TALIP) - Special Issue on Temporal Information Processing
Centroid-based summarization of multiple documents
Information Processing and Management: an International Journal
Answering what-is questions by Virtual Annotation
HLT '01 Proceedings of the first international conference on Human language technology research
Producing biographical summaries: combining linguistic knowledge with corpus statistics
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Learning surface text patterns for a Question Answering system
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Automatic evaluation of summaries using N-gram co-occurrence statistics
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Evaluating answers to definition questions
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
An improved extraction pattern representation model for automatic IE pattern acquisition
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Generic soft pattern models for definitional question answering
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Learning extraction patterns for subjective expressions
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Building a terminological database from heterogeneous definitional sources
dg.o '03 Proceedings of the 2003 annual national conference on Digital government research
Cascading use of soft and hard matching pattern rules for weakly supervised information extraction
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Automatically evaluating answers to definition questions
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Will pyramids built of nuggets topple over?
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Hierarchical hidden Markov models for information extraction
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Efficiently inducing features of conditional random fields
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
ACM Transactions on Asian Language Information Processing (TALIP)
From text question-answering to multimedia QA on web-scale media resources
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Retrieving good, better, and best answers to questions in advertisements
Proceedings of the eleventh international workshop on Web information and data management
Answering opinion questions with random walks on graphs
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Finding short definitions of terms on web pages
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
BioSnowball: automated population of Wikis
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting glosses to disambiguate word senses
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Learning word-class lattices for definition and hypernym extraction
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
An automatic definition extraction in Arabic language
NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
Multimedia answering: enriching text QA with media information
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
How to extract Arabic definitions from the web? Arabic definition question answering system
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Contextual Language Models For Ranking Answers To Natural Language Definition Questions
Computational Intelligence
Learning regular expressions to template-based FAQ retrieval systems
Knowledge-Based Systems
Hi-index | 0.00 |
We explore probabilistic lexico-syntactic pattern matching, also known as soft pattern matching, in a definitional question answering system. Most current systems use regular expression-based hard matching patterns to identify definition sentences. Such rigid surface matching often fares poorly when faced with language variations. We propose two soft matching models to address this problem: one based on bigrams and the other on the Profile Hidden Markov Model (PHMM). Both models provide a theoretically sound method to model pattern matching as a probabilistic process that generates token sequences. We demonstrate the effectiveness of the models on definition sentence retrieval for definitional question answering. We show that both models significantly outperform the state-of-the-art manually constructed hard matching patterns on recent TREC data. A critical difference between the two models is that the PHMM has a more complex topology. We experimentally show that the PHMM can handle language variations more effectively but requires more training data to converge. While we evaluate soft pattern models only on definitional question answering, we believe that both models are generic and can be extended to other areas where lexico-syntactic pattern matching can be applied.