Cross-Language Latent Relational Search between Japanese and English Languages Using a Web Corpus

Authors:
Nguyen Tuan Duc;Danushka Bollegala;Mitsuru Ishizuka
Affiliations:
The University of Tokyo;The University of Tokyo;The University of Tokyo
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2012

Citing 32
Cited 0

High-level perception, representation, and analogy: a critique of artificial intelligence methodology

Journal of Experimental & Theoretical Artificial Intelligence
Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought

Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought
Scaling question answering to the Web

Proceedings of the 10th international conference on World Wide Web
On the MSE robustness of batching estimators

Proceedings of the 33nd conference on Winter simulation
Message Understanding Conference-6: a brief history

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Evaluating high accuracy retrieval techniques

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Language identification: a solved problem suitable for undergraduate instruction

Journal of Computing Sciences in Colleges
A multilingual usage consultation tool based on internet searching: more than a search engine, less than QA

WWW '05 Proceedings of the 14th international conference on World Wide Web
Learning surface text patterns for a Question Answering system

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Example-based machine translation using DP-matching between word sequences

DMMT '01 Proceedings of the workshop on Data-driven methods in machine translation - Volume 14
Similarity of Semantic Relations

Computational Linguistics
Espresso: leveraging generic patterns for automatically harvesting semantic relations

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Preemptive information extraction using unrestricted relation discovery

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Multilingual phrase-based concordance generation in real-time

Information Retrieval
Hits on the web: how does it compare?

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A simple and efficient sampling method for estimating AP and NDCG

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval

Introduction to Information Retrieval
Measuring the similarity between implicit semantic relations using web search engines

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Measuring the similarity between implicit semantic relations from the web

Proceedings of the 18th international conference on World wide web
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
A uniform approach to analogies, synonyms, antonyms, and associations

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Exploiting Wikipedia and EuroWordNet to solve Cross-Lingual Question Answering

Information Sciences: an International Journal
The latent relation mapping engine: algorithm and experiments

Journal of Artificial Intelligence Research
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Measuring semantic similarity by latent relational analysis

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Query by analogical example: relational search using web search engine indices

Proceedings of the 18th ACM conference on Information and knowledge management
Relational duality: unsupervised extraction of semantic relations between entities on the web

Proceedings of the 19th international conference on World wide web
Cross-language text classification using structural correspondence learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Automated translation of semantic relationships

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Exploiting symmetry in relational similarity for ranking relational search results

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Overview of ResPubliQA 2009: question answering evaluation over European legislation

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Using Relational Similarity between Word Pairs for Latent Relational Search on the Web

WI-IAT '10 Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01

Quantified Score

Hi-index	0.00

Visualization

Abstract

Latent relational search is a novel entity retrieval paradigm based on the proportional analogy between two entity pairs. Given a latent relational search query {(Japan, Tokyo), (France, ?)}, a latent relational search engine is expected to retrieve and rank the entity “Paris” as the first answer in the result list. A latent relational search engine extracts entities and relations between those entities from a corpus, such as the Web. Moreover, from some supporting sentences in the corpus, (e.g., “Tokyo is the capital of Japan” and “Paris is the capital and biggest city of France”), the search engine must recognize the relational similarity between the two entity pairs. In cross-language latent relational search, the entity pairs as well as the supporting sentences of the first entity pair and of the second entity pair are in different languages. Therefore, the search engine must recognize similar semantic relations across languages. In this article, we study the problem of cross-language latent relational search between Japanese and English using Web data. To perform cross-language latent relational search in high speed, we propose a multi-lingual indexing method for storing entities and lexical patterns that represent the semantic relations extracted from Web corpora. We then propose a hybrid lexical pattern clustering algorithm to capture the semantic similarity between lexical patterns across languages. Using this algorithm, we can precisely measure the relational similarity between entity pairs across languages, thereby achieving high precision in the task of cross-language latent relational search. Experiments show that the proposed method achieves an MRR of 0.605 on Japanese-English cross-language latent relational search query sets and it also achieves a reasonable performance on the INEX Entity Ranking task.