An evaluation framework for cross-lingual link discovery

Authors:
Ling-Xiang Tang;Shlomo Geva;Andrew Trotman;Yue Xu;Kelly Y. Itakura
Affiliations:
Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia;Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia;Department of Computer Science, University of Otago, Dunedin, New Zealand;Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia;National Institute of Informatics, Japan
Venue:
Information Processing and Management: an International Journal
Year:
2014

Citing 24
Cited 0

Hypertext, full text, and automatic linking

SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
A methodology for the automatic construction of a hypertext for information retrieval

SAC '93 Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice
Automatic text decomposition and structuring

Information Processing and Management: an International Journal
Design and implementation of a tool for the automatic construction of hypertexts for information retrieval

Information Processing and Management: an International Journal - Special issue on history of information science
Automatic hypertext link typing

Proceedings of the the seventh ACM conference on Hypertext
On the use of information retrieval techniques for the automatic construction of hypertext

Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Building a hypertextual digital library in the humanities: a case study on London

Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Modern Information Retrieval

Modern Information Retrieval
A method of automatic hypertext construction from an encyclopedic dictionary of a specific field

ANLC '92 Proceedings of the third conference on Applied natural language processing
From Keywords to Links: an Automatic Approach

ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
Discovering missing links in Wikipedia

Proceedings of the 3rd international workshop on Link discovery
TREC: Continuing information retrieval's tradition of experimentation

Communications of the ACM
Wikify!: linking documents to encyclopedic knowledge

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Learning to link with wikipedia

Proceedings of the 17th ACM conference on Information and knowledge management
The importance of manual assessment in link discovery

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A text mining approach for automatic construction of hypertexts

Expert Systems with Applications: An International Journal
Modern Information Retrieval

Modern Information Retrieval
Untangling the cross-lingual link structure of Wikipedia

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Overview of the INEX 2009 ad hoc track

INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Overview of the INEX 2009 link the wiki track

INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
An exploration of learning to link with Wikipedia: features, methods and training collection

INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Assisting cross-lingual editing in collaborative writing

ACM SIGWEB Newsletter
Overview of the INEX 2010 link the wiki track

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Link Discovery: A Comprehensive Analysis

ICSC '11 Proceedings of the 2011 IEEE Fifth International Conference on Semantic Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cross-Lingual Link Discovery (CLLD) is a new problem in Information Retrieval. The aim is to automatically identify meaningful and relevant hypertext links between documents in different languages. This is particularly helpful in knowledge discovery if a multi-lingual knowledge base is sparse in one language or another, or the topical coverage in each language is different; such is the case with Wikipedia. Techniques for identifying new and topically relevant cross-lingual links are a current topic of interest at NTCIR where the CrossLink task has been running since the 2011 NTCIR-9. This paper presents the evaluation framework for benchmarking algorithms for cross-lingual link discovery evaluated in the context of NTCIR-9. This framework includes topics, document collections, assessments, metrics, and a toolkit for pooling, assessment, and evaluation. The assessments are further divided into two separate sets: manual assessments performed by human assessors; and automatic assessments based on links extracted from Wikipedia itself. Using this framework we show that manual assessment is more robust than automatic assessment in the context of cross-lingual link discovery.