MURAX: a robust linguistic approach for question answering using an on-line encyclopedia
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Scaling question answering to the web
ACM Transactions on Information Systems (TOIS)
TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
New Directions in Question Answering
New Directions in Question Answering
Probabilistic question answering on the Web: Research Articles
Journal of the American Society for Information Science and Technology
Learning surface text patterns for a Question Answering system
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Lightweight web-based fact repositories for textual question answering
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Clustering and exploring search results using timeline constructions
Proceedings of the 18th ACM conference on Information and knowledge management
Towards recency ranking in web search
Proceedings of the third ACM international conference on Web search and data mining
From information to knowledge: harvesting entities and relationships from web sources
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Harvesting facts from textual web sources by constrained label propagation
Proceedings of the 20th ACM international conference on Information and knowledge management
Chapter 3: search for knowledge
Search Computing
Handling temporal information in web search engines
ACM SIGMOD Record
YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia
Artificial Intelligence
Hi-index | 0.00 |
This paper shifts the focus of Web search towards finding and exploiting small text nuggets, rather than full-length documents, assumming that the type of targeted information (e.g., date) is specified in the queries. Each nugget is a document sentence fragment that encodes open-domain factual information associated to some entity. The entities are dates (e.g., 1971) when the events captured by the nuggets must have occurred. The nuggets are extracted from the unstructured text of Web documents, based on lightweight text processing. Since per-document time stamps are available only in restricted collections such as news articles, but not in arbitrary Web documents, the extraction of the relevant dates relies on simple, local text analysis. The resulting time-stamped text nuggets have immediate applications in Web search, as direct results for queries asking about the date of an event explicitly (e.g., "When was the transistor invented") or implicitly (e.g., "Golden Gate Bridge built").