A semantic matching of information segments for tolerating error chinese words

  • Authors:
  • Maoyuan Zhang;Chunyan Zou;Zhengding Lu;Zhigang Wang

  • Affiliations:
  • Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China;School of Foreign Languages, HuaZhong Normal University, Wuhan, P.R. China;Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China;Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China

  • Venue:
  • WISE'06 Proceedings of the 7th international conference on Web Information Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

There exist new words and error words in Chinese information of web pages. In this paper, we introduce our definition of semantic similarity between sememes and their theorems. On the base of proving the theorems, the influence of the parameter is analyzed. Moreover, this paper presents a novel definition of the word similarity based on the sememe similarity, which can be used to match the new Chinese words with the existing Chinese words and match the error Chinese words with correct Chinese words. And also, based on the novel word similarity, a matching method of information segments is presented to recognize the category of Chinese web information segments, in which new words and error words occur. In addition, the experiment of the matching methods is presented. Therefore, the novel matching method is an efficient method both in theory and from experimental results.