GPX: Ad-Hoc Queries and Automated Link Discovery in the Wikipedia

Authors:
Shlomo Geva
Affiliations:
Faculty of IT, Queensland University of Technology, Brisbane, Australia
Venue:
Focused Access to XML Documents
Year:
2008

Citing 9
Cited 3

On the measurement of inter-linker consistency and retrieval effectiveness in hypertext databases

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Building hypertext using information retrieval

Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Automated link generation: can we do better than term repetition?

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Automatic link generation

ACM Computing Surveys (CSUR)
Building Hypertext Links By Computing Semantic Similarity

IEEE Transactions on Knowledge and Data Engineering
From Keywords to Links: an Automatic Approach

ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
Discovering missing links in Wikipedia

Proceedings of the 3rd international workshop on Link discovery
The Wikipedia XML corpus

ACM SIGIR Forum
Comparative Evaluation of XML Information Retrieval Systems: 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006, Dagstuhl Castle, Germany, December 17-20, 2006, Revised and Selected Papers

Comparative Evaluation of XML Information Retrieval Systems: 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006, Dagstuhl Castle, Germany, December 17-20, 2006, Revised and Selected Papers

Automatic generation of inter-passage links based on semantic similarity

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Overview of the INEX 2010 link the wiki track

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
University of Otago at INEX 2010

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

The INEX 2007 evaluation was based on the Wikipedia collection. In this paper we describe some modifications to the GPX search engine and the approach taken in the Ad-hoc and the Link-the-Wiki tracks. In earlier version of GPX scores were recursively propagated from text containing nodes, through ancestors, all the way to the document root of the XML tree. In this paper we describe a simplification whereby the score of each node is computed directly, doing away with the score propagation mechanism. Results indicate slightly improved performance. The GPX search engine was used in the Link-the-Wiki track to identify prospective incoming links to new Wikipedia pages. We also describe a simple and efficient approach to the identification of prospective outgoing links in new Wikipedia pages. We present and discuss evaluation results.