Inferring decision trees using the minimum description length principle
Information and Computation
The nature of statistical learning theory
The nature of statistical learning theory
Mining Text Using Keyword Distributions
Journal of Intelligent Information Systems
Websom for Textual Data Mining
Artificial Intelligence Review - Special issue on data mining on the Internet
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Acquisition of a Knowledge Dictionary from Training Examples Including Multiple Values
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Inductive Learning of a Knowledge Dictionary for a Text Mining System
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Combining clustering and co-training to enhance text classification using unlabelled data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 2011 ACM Symposium on Applied Computing
A pattern-based voting approach for concept discovery on the web
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Analysis of Textual Data Based on Inductive Learning Techniques
International Journal of Information Retrieval Research
Hi-index | 0.00 |
This paper proposes a new method for discovering rules from textual data. The method decomposes textual data into word sets by using lexical analysis, generates training examples from both key phrase relations extracted from the word sets by using key phrase patterns and text classes given by the user, and acquires key phrase relation rules from the examples by using a fuzzy inductive learning algorithm. The method is also able to deal with textual data that requires word segmentation, such as Japanese text. This paper reports on the application of the method to e-mail analysis tasks for a customer center. The e-mails are written in Japanese and have two analytical criteria: a product criterion and a contents criterion. We evaluate the acquired rules in each criterion.