TCS: a shell for content-based text categorization
Proceedings of the sixth conference on Artificial intelligence applications
Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
Automatic text decomposition using text segments and text themes
Proceedings of the the seventh ACM conference on Hypertext
Autonomous document classification for business
AGENTS '97 Proceedings of the first international conference on Autonomous agents
Proceedings of the 5th international conference on Intelligent user interfaces
Enlarging the Margins in Perceptron Decision Trees
Machine Learning
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Information Retrieval
The use of bigrams to enhance text categorization
Information Processing and Management: an International Journal
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Document Categorization by Term Association
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
An analysis of the relative hardness of Reuters-21578 subsets: Research Articles
Journal of the American Society for Information Science and Technology
Strongly typed genetic programming
Evolutionary Computation
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
TRIPPER: rule learning using taxonomies
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
A survey on the application of genetic programming to classification
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Using feature construction to avoid large feature spaces in text classification
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Sentiment classification using automatically extracted subgraph features
CAAGET '10 Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text
Hi-index | 0.01 |
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks.