Analysis of Textual Data Based on Inductive Learning Techniques

Authors:
Shigeaki Sakurai
Affiliations:
IT Research and Development Center, Toshiba Solutions Corporation, Tokyo, Japan
Venue:
International Journal of Information Retrieval Research
Year:
2013

Citing 13
Cited 0

C4.5: programs for machine learning

C4.5: programs for machine learning
The nature of statistical learning theory

The nature of statistical learning theory
A re-examination of text categorization methods

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Mining Sequential Patterns

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth

Proceedings of the 17th International Conference on Data Engineering
Optimized Substructure Discovery for Semi-structured Data

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Rule discovery from textual data based on key phrase patterns

Proceedings of the 2004 ACM symposium on Applied computing
A quickstart in frequent structure mining can make a difference

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Semantic Web Technologies: Trends and Research in Ontology-based Systems

Semantic Web Technologies: Trends and Research in Ontology-based Systems
Wikipedia mining for an association web thesaurus construction

WISE'07 Proceedings of the 8th international conference on Web information systems engineering
A Discovery Method of Trend Rules from Complex Sequential Data

WAINA '12 Proceedings of the 2012 26th International Conference on Advanced Information Networking and Applications Workshops

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces knowledge discovery methods based on inductive learning techniques from textual data. The author argues three methods extracting features of the textual data. First one activates a key concept dictionary, second one does a key phrase pattern dictionary, and third one does a named entity extractor. These features are used in order to generate rules representing relationships between the features and text classes. The rules are described in the format of a fuzzy decision tree. Also, these features are used in order to acquire a classification model based on SVM Support Vector Machine. The model can classify new textual data into the text classes with high classification accuracy. Lastly, this paper introduces two application tasks based on these methods and verifies the effect of the methods.