CCE: a chinese concept encyclopedia incorporating the expert-edited chinese concept dictionary with online cyclopedias

  • Authors:
  • Jiazhen Nian;Shan Jiang;Congrui Huang;Yan Zhang

  • Affiliations:
  • Department of Machine Intelligence, Peking University, Beijing, P.R. China;Department of Machine Intelligence, Peking University, Beijing, P.R. China;Department of Machine Intelligence, Peking University, Beijing, P.R. China;Department of Machine Intelligence, Peking University, Beijing, P.R. China

  • Venue:
  • ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bag-of-words is the most common-used method in text mining tasks and many other applications. However, this method has some obvious shortcomings, such as ignoring semantic information. While in document analysis, semantic information always plays a more important role than individual words. To tackle this problem, we need to borrow semantic information from ontologies to learn the text information better. An expert-edited ontology is usually well structured and is more authoritative than an online cyclopedia. On the other hand, due to the costly editing, it is rather difficult for expert-edited ontologies to keep up with a deluge of new words. In this paper, we propose a method to construct a Chinese ontology to keep the carefully-designed structure of an expert-edited ontology, meanwhile embody new vocabulary from an online cyclopedia. We name the enhanced ontology as Chinese Concept Encyclopedia (CCE) and employ it in some text mining applications. The experimental results show that CCE outperforms the expert-edited ontology Chinese Concept Dictionary (CCD).