Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
A probabilistic description-oriented approach for categorizing web documents
Proceedings of the eighth international conference on Information and knowledge management
A Study of Two Sampling Methods for Analyzing Large Datasets with ILP
Data Mining and Knowledge Discovery
From Searching to Browsing through Multimodal Documents Linking
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Journal of the American Society for Information Science and Technology
Relevance feedback based on genetic programming for image retrieval
Pattern Recognition Letters
A relevance feedback method based on genetic programming for classification of remote sensing images
Information Sciences: an International Journal
Exploiting reference section to classify paper's topics
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Multimodal retrieval with relevance feedback based on genetic programming
Multimedia Tools and Applications
Hi-index | 0.00 |
This paper discusses how citation-based information and structural content (e.g., title, abstract) can be combined to improve classification of text documents into predefined categories. We evaluate different measures of similarity derived from the citation structure and the structural content of the collection, and determine how they can be fused to improve classification effectiveness. To discover the best fusion framework, we apply Genetic Programming (GP) techniques. Our empirical experiments using documents from the ACM Digital Library and the ACM Computing Classification System show that we can discover similarity functions that work better than using evidence in isolation and whose combined performance through a simple majority voting is comparable to that of Support Vector Machine classifiers.