Machine Learning
CYC: a large-scale investment in knowledge infrastructure
Communications of the ACM
WordNet: a lexical database for English
Communications of the ACM
Information Sciences: an International Journal
Learning to match and cluster large high-dimensional data sets for data integration
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
TAILOR: A Record Linkage Tool Box
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Adaptive Name Matching in Information Integration
IEEE Intelligent Systems
Record linkage: similarity measures and algorithms
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Hownet And the Computation of Meaning
Hownet And the Computation of Meaning
Using Bayesian decision for ontology mapping
Web Semantics: Science, Services and Agents on the World Wide Web
Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
idMesh: graph-based disambiguation of linked data
Proceedings of the 18th international conference on World wide web
Social influence analysis in large-scale networks
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A graph-based approach to mining multilingual word associations from wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
RiMOM: A Dynamic Multistrategy Ontology Alignment Framework
IEEE Transactions on Knowledge and Data Engineering
Improving the extraction of bilingual terminology from Wikipedia
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
DBpedia - A crystallization point for the Web of Data
Web Semantics: Science, Services and Agents on the World Wide Web
Discovering and Maintaining Links on the Web of Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Cross-Lingual Ontology Mapping --- An Investigation of the Impact of Machine Translation
ASWC '09 Proceedings of the 4th Asian Conference on The Semantic Web
Cross-lingual semantic relatedness using encyclopedic knowledge
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
DBpedia: a nucleus for a web of open data
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
A Wikipedia-based multilingual retrieval model
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Mining advisor-advisee relationships from research publication networks
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
BabelNet: building a very large multilingual semantic network
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
MENTA: inducing multilingual taxonomies from wikipedia
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Learning to infer social ties in large networks
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Who will follow you back?: reciprocal relationship prediction
Proceedings of the 20th ACM international conference on Information and knowledge management
Inferring social ties across heterogenous networks
Proceedings of the fifth ACM international conference on Web search and data mining
Factor graphs and the sum-product algorithm
IEEE Transactions on Information Theory
Domain-Aware ontology matching
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
Knowledge harvesting in the big-data era
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Simultaneously detecting fake reviews and review spammers using factor graph model
Proceedings of the 5th Annual ACM Web Science Conference
Unsupervised link prediction using aggregative statistics on heterogeneous social networks
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-step classification approaches to cumulative citation recommendation
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Boosting cross-lingual knowledge linking via concept annotation
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Hi-index | 0.00 |
Wikipedia becomes one of the largest knowledge bases on the Web. It has attracted 513 million page views per day in January 2012. However, one critical issue for Wikipedia is that articles in different language are very unbalanced. For example, the number of articles on Wikipedia in English has reached 3.8 million, while the number of Chinese articles is still less than half million and there are only 217 thousand cross-lingual links between articles of the two languages. On the other hand, there are more than 3.9 million Chinese Wiki articles on Baidu Baike and Hudong.com, two popular encyclopedias in Chinese. One important question is how to link the knowledge entries distributed in different knowledge bases. This will immensely enrich the information in the online knowledge bases and benefit many applications. In this paper, we study the problem of cross-lingual knowledge linking and present a linkage factor graph model. Features are defined according to some interesting observations. Experiments on the Wikipedia data set show that our approach can achieve a high precision of 85.8% with a recall of 88.1%. The approach found 202,141 new cross-lingual links between English Wikipedia and Baidu Baike.