Information categorization in web pages and sites
Web Intelligence and Agent Systems
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
MCS'03 Proceedings of the 4th international conference on Multiple classifier systems
Core: a search and browsing tool for semantic instances of web sites
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
On the utility of incremental feature selection for the classification of textual data streams
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Automatic web pages hierarchical classification using dynamic domain ontologies
International Journal of Knowledge and Web Intelligence
Ontia iJADE: an intelligent ontology-based agent framework for semantic web service
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Hi-index | 0.00 |
Automatic classification of web pages is an effectiveway to deal with the difficulty of retrieving informationfrom the Internet. Although there are many automaticclassification algorithms and systems that have beenproposed, most of them ignore the conflict between thefixed number of categories and the growing number ofweb pages going into the system. They also requiresearching through all existing categories to make anyclassification. We propose a dynamic and hierarchicalclassification system that is capable of adding newcategories as required, organizing the web pages into atree structure, and classifying web pages by searchingthrough only one path of the tree structure. Our testresults show that our proposed single-path searchtechnique reduces the search complexity and increasesthe accuracy by 6% comparing to related algorithms. Ourdynamic-category expansion technique also achievessatisfying results on adding new categories into oursystem as required.