Modern Information Retrieval
IEEE Intelligent Systems
Proceedings of the 15th international conference on World Wide Web
Learning to link with wikipedia
Proceedings of the 17th ACM conference on Information and knowledge management
NNexus: An Automatic Linker for Collaborative Web-Based Corpora
IEEE Transactions on Knowledge and Data Engineering
Hi-index | 0.00 |
Collaborative online encyclopedias or knowledge bases such as Wikipedia and PlanetMath are becoming increasingly popular. We demonstrate NNexus, a generalization of the automatic linking engine of PlanetMath.org and the first system that automates the process of linking disparate "encyclopedia" entries into a fully-connected conceptual network. The main challenges of this problem space include: 1) linking quality (correctly identifying which terms to link and which entry to link to with minimal effort on the part of users), 2) efficiency and scalability, and 3) generalization to multiple knowledge bases and web-based information environment. We present NNexus that utilizes subject classification and other metadata to address these challenges and demonstrate its effectiveness and efficiency through multiple real world corpora.