Mining soft-matching rules from textual data

Authors:
Un Yong Nahm;Raymond J. Mooney
Affiliations:
Department of Computer Sciences, University of Texas, Austin, TX;Department of Computer Sciences, University of Texas, Austin, TX
Venue:
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Year:
2001

Citing 11
Cited 8

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
C4.5: programs for machine learning

C4.5: programs for machine learning
Unifying instance-based and rule-based induction

Machine Learning
Providing database-like access to the Web using queries based on textual similarity

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Mining the most interesting rules

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
The String-to-String Correction Problem

Journal of the ACM (JACM)
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
A Mutually Beneficial Integration of Data Mining and Information Extraction

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Applying Data Mining Techniques for Descriptive Phrase Extraction in Digital Document Collections

ADL '98 Proceedings of the Advances in Digital Libraries Conference
Untangling text data mining

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

Evaluating the novelty of text-mined rules using lexical knowledge

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Text Data: Special Features and Patterns

Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
Unsupervised learning of soft patterns for generating definitions from online news

Proceedings of the 13th international conference on World Wide Web
Mining knowledge from text using information extraction

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Cascading use of soft and hard matching pattern rules for weakly supervised information extraction

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Introducing structure management in automatic reference resolution: An XML-based approach

Information Processing and Management: an International Journal
An automatic reply to customers' e-mail queries model with Chinese text mining approach

ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
A propositional approach to textual case indexing

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Text mining concerns the discovery of knowledge from unstructured textual data. One important task is the discovery of rules that relate specific words and phrases. Although existing methods for this task learn traditional logical rules, soft-matching methods that utilize word-frequency information generally work better for textual data. This paper presents a rule induction system, TEXTRISE, that allows for partial matching of text-valued features by combining rule-based and instance-based learning. We present initial experiments applying TEXTRISE to corpora of book descriptions and patent documents retrieved from the web and compare its results to those of traditional rule and instance based methods.