Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Enabling technology for knowledge sharing
AI Magazine
C4.5: programs for machine learning
C4.5: programs for machine learning
The nature of statistical learning theory
The nature of statistical learning theory
Machine Learning
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Machine Learning
Guest Editors' Introduction: Ontologies
IEEE Intelligent Systems
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text categorization based on k-nearest neighbor approach for web site classification
Information Processing and Management: an International Journal
Web page feature selection and classification using neural networks
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Informatics and computer science intelligent systems applications
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Neighbor-weighted K-nearest neighbor for unbalanced text corpus
Expert Systems with Applications: An International Journal
An overview of statistical learning theory
IEEE Transactions on Neural Networks
Multilayer SOM with tree-structured data for efficient document retrieval and plagiarism detection
IEEE Transactions on Neural Networks
Ontology-based document profile for vulnerability relevancy analysis
ACS'10 Proceedings of the 10th WSEAS international conference on Applied computer science
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
This paper proposes a new method for document categorization, based on support vector machine (SVM) using a concept vector model (CVM). The traditional document classification usually ignores the semantic relations among the keywords or documents. To effectively solve the semantic problem, the domain ontology is used to capture the semantic information among different terms or keywords in the documents. Using the concept vector model, domain-related semantic information more exactly from documents can be extracted. In the model, concept vector is extracted from a document by the matching method. According to concept features of the documents, documents are classified into a suitable category by SVM. The experimental results show that our CVM method yields higher accuracy compared to the traditional term-based vector space model (VSM) methods.