Rule discovery from textual data based on key phrase patterns

Authors:
Shigeaki Sakurai;Akihiro Suyama
Affiliations:
Toshiba Corporation, 1, Komukai Toshiba-cho Saiwai-ku, Kawasaki, Japan;Toshiba Corporation, 1, Komukai Toshiba-cho Saiwai-ku, Kawasaki, Japan
Venue:
Proceedings of the 2004 ACM symposium on Applied computing
Year:
2004

Citing 9
Cited 3

Inferring decision trees using the minimum description length principle

Information and Computation
The nature of statistical learning theory

The nature of statistical learning theory
Mining Text Using Keyword Distributions

Journal of Intelligent Information Systems
Websom for Textual Data Mining

Artificial Intelligence Review - Special issue on data mining on the Internet
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Acquisition of a Knowledge Dictionary from Training Examples Including Multiple Values

ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Inductive Learning of a Knowledge Dictionary for a Text Mining System

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Combining clustering and co-training to enhance text classification using unlabelled data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining

Extracting preference terms from web browsing histories excluding pages unrelated to users' interests

Proceedings of the 2011 ACM Symposium on Applied Computing
A pattern-based voting approach for concept discovery on the web

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Analysis of Textual Data Based on Inductive Learning Techniques

International Journal of Information Retrieval Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a new method for discovering rules from textual data. The method decomposes textual data into word sets by using lexical analysis, generates training examples from both key phrase relations extracted from the word sets by using key phrase patterns and text classes given by the user, and acquires key phrase relation rules from the examples by using a fuzzy inductive learning algorithm. The method is also able to deal with textual data that requires word segmentation, such as Japanese text. This paper reports on the application of the method to e-mail analysis tasks for a customer center. The e-mails are written in Japanese and have two analytical criteria: a product criterion and a contents criterion. We evaluate the acquired rules in each criterion.