TCS: a shell for content-based text categorization
Proceedings of the sixth conference on Artificial intelligence applications
Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
Interactive volumetric information visualization for document corpus management
Proceedings of the conference on Graphics interface '97
Autonomous document classification for business
AGENTS '97 Proceedings of the first international conference on Autonomous agents
Proceedings of the 5th international conference on Intelligent user interfaces
Adaptive information filtering using evolutionary computation
Information Sciences: an International Journal - Special issue on frontiers in evolutionary algorithms
Enlarging the Margins in Perceptron Decision Trees
Machine Learning
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
The use of bigrams to enhance text categorization
Information Processing and Management: an International Journal
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
PKDD '98 Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery
Strongly typed genetic programming
Evolutionary Computation
Genetic programming for protein related text classification
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
User behavior analysis of the open-ended document classification system
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Hi-index | 0.00 |
We describe a novel method for using Genetic Programming to create compact classification rules based on combinations of N-Grams (character strings). Genetic programs acquire fitness by producing rules that are effective classifiers in terms of precision and recall when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from a classification task using the Reuters 21578 dataset. We also suggest that because the induced rules are meaningful to a human analyst they may have a number of other uses beyond classification and provide a basis for text mining applications.