Enterprise data classification using semantic web technologies
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part II
Feature annotation for text categorization
Proceedings of the CUBE International Information Technology Conference
Hi-index | 0.00 |
Current classification methods are based on the “Bag of Words” (BOW) representation, which only accounts for term frequency in the documents, and ignores important semantic relationships between key terms. In this paper, we proposed a system that uses ontologies and Natural Language Processing techniques to index texts. Traditional BOW matrix is replaced by “Bag of Concepts” (BOC). For this purpose, we developed fully automated methods for mapping keywords to their corresponding ontology concepts. Support Vector Machine a successful machine learning technique is used for classification. Experimental results shows that our proposed method dose improve text classification performance significantly