Searching the wikipedia with contextual information

Authors:
Antti Ukkonen;Carlos Castillo;Debora Donato;Aristides Gionis
Affiliations:
Helsinki University of Technology, Helsinki, Finland;Yahoo! Research, Barcelona, Spain;Yahoo! Research, Barcelona, Spain;Yahoo! Research, Barcelona, Spain
Venue:
Proceedings of the 17th ACM conference on Information and knowledge management
Year:
2008

Citing 6
Cited 5

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Topic-sensitive PageRank

Proceedings of the 11th international conference on World Wide Web
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
The webgraph framework I: compression techniques

Proceedings of the 13th international conference on World Wide Web
On spectral graph drawing

COCOON'03 Proceedings of the 9th annual international conference on Computing and combinatorics
Terrier information retrieval platform

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research

Fast shortest path distance estimation in large networks

Proceedings of the 18th ACM conference on Information and knowledge management
SPRINT: ranking search results by paths

Proceedings of the 14th International Conference on Extending Database Technology
Shortest-path queries for complex networks: exploiting low tree-width outside the core

Proceedings of the 15th International Conference on Extending Database Technology
Fast exact shortest-path distance queries on large networks by pruned landmark labeling

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Dynamic and historical shortest-path distance queries on large evolving networks by pruned landmark labeling

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a framework for searching the Wikipedia with contextual information. Our framework extends the typical keyword search, by considering queries of the type (q,p), where q is a set of terms (as in classical Web search), and p is a source Wikipedia document. The query terms q represent the information that the user is interested in finding, and the document p provides the context of the query. The task is to rank other documents in Wikipedia with respect to their relevance to the query terms q given the context document p. By associating a context to the query terms, the search results of a search initiated in a particular page can be made more relevant. We suggest a number of features that extend the classical query-search model so that the context document p is considered. We then use RankSVM (Joachims 2002) to learn weights for the individual features given suitably constructed training data. Documents are ranked at query time using the inner product of the feature and the weight vectors. The experiments indicate that the proposed method considerably improves results obtained by a more traditional approach that does not take the context into account.