A probabilistic description-oriented approach for categorizing web documents
Proceedings of the eighth international conference on Information and knowledge management
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improving Text Classification by Shrinkage in a Hierarchy of Classes
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Use of a Weighted Topic Hierarchy for Document Classification
TSD '99 Proceedings of the Second International Workshop on Text, Speech and Dialogue
Knowledge-based automatic topic identification
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Grasping related words of unknown word for automatic extension of lexical dictionary
Proceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop
Hi-index | 0.00 |
This paper proposes a topic selection method for web documents using ontology hierarchy. The idea of this approach is to utilize the ontology structure in order to determine a topic in a web document. In this paper, we propose an approach for improving the performance of document clustering as we select the topic efficiently based on domain ontology. We preprocess the web documents for keywords extraction using Term Frequency formula and we build domain ontology as we branch off the partial hierarchy from WordNet using an automatic domain ontology building tool in preprocessing step. And we select a topic for the web documents based on domain ontology structure. Finally we realized that our approach contributes the efficient document clustering.