A semantic matching of information segments for tolerating error chinese words

Authors:
Maoyuan Zhang;Chunyan Zou;Zhengding Lu;Zhigang Wang
Affiliations:
Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China;School of Foreign Languages, HuaZhong Normal University, Wuhan, P.R. China;Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China;Department of Computer Science and Technology, HuaZhong University of Science and Technology, Wuhan, P.R. China
Venue:
WISE'06 Proceedings of the 7th international conference on Web Information Systems
Year:
2006

Citing 9
Cited 0

Determining Semantic Similarity among Entity Classes from Different Ontologies

IEEE Transactions on Knowledge and Data Engineering
Ontology Based Semantic Similarity Comparison of Documents

DEXA '03 Proceedings of the 14th International Workshop on Database and Expert Systems Applications
Ontology acquisition and semantic retrieval from semantic annotated chinese poetry

Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Efficient Semantic-Based Content Search in P2P Network

IEEE Transactions on Knowledge and Data Engineering
A Chinese word segmentation based on language situation in processing ambiguous words

Information Sciences: an International Journal
A Fuzzy Classification Based on Feature Selection for Web Pages

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
An Ontology Search Engine Based on Semantic Analysis

ICITA '05 Proceedings of the Third International Conference on Information Technology and Applications (ICITA'05) Volume 2 - Volume 02
XML application schema matching using similarity measure and relaxation labeling

Information Sciences: an International Journal
Integrating Element and Term Semantics for Similarity-Based XML Document Clustering

WI '05 Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

There exist new words and error words in Chinese information of web pages. In this paper, we introduce our definition of semantic similarity between sememes and their theorems. On the base of proving the theorems, the influence of the parameter is analyzed. Moreover, this paper presents a novel definition of the word similarity based on the sememe similarity, which can be used to match the new Chinese words with the existing Chinese words and match the error Chinese words with correct Chinese words. And also, based on the novel word similarity, a matching method of information segments is presented to recognize the category of Chinese web information segments, in which new words and error words occur. In addition, the experiment of the matching methods is presented. Therefore, the novel matching method is an efficient method both in theory and from experimental results.