Zhishi.me: weaving chinese linking open data

  • Authors:
  • Xing Niu;Xinruo Sun;Haofen Wang;Shu Rong;Guilin Qi;Yong Yu

  • Affiliations:
  • APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University;APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University;APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University;APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University;Southeast University;APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University

  • Venue:
  • ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Linking Open Data (LOD) has become one of the most important community efforts to publish high-quality interconnected semantic data. Such data has been widely used in many applications to provide intelligent services like entity search, personalized recommendation and so on. While DBpedia, one of the LOD core data sources, contains resources described in multilingual versions and semantic data in English is proliferating, there is very few work on publishing Chinese semantic data. In this paper, we present Zhishi.me, the first effort to publish large scale Chinese semantic data and link them together as a Chinese LOD (CLOD). More precisely, we identify important structural features in three largest Chinese encyclopedia sites (i.e., Baidu Baike, Hudong Baike, and Chinese Wikipedia) for extraction and propose several data-level mapping strategies for automatic link discovery. As a result, the CLOD has more than 5 million distinct entities and we simply link CLOD with the existing LOD based on the multilingual characteristic of Wikipedia. Finally, we also introduce three Web access entries namely SPARQL endpoint, lookup interface and detailed data view, which conform to the principles of publishing data sources to LOD.