RDRCE: combining machine learning and knowledge acquisition

Authors:
Han Xu;Achim Hoffmann
Affiliations:
School of Computer Science and Engineering, University of New South Wales, Sydney, Australia;School of Computer Science and Engineering, University of New South Wales, Sydney, Australia
Venue:
PKAW'10 Proceedings of the 11th international conference on Knowledge management and acquisition for smart systems and services
Year:
2010

Citing 12
Cited 6

A philosophical basis for knowledge acquisition

Knowledge Acquisition
An ounce of knowledge is worth a ton of data: quantitative studies of the trade-off between expertise and data based on statistically well-founded empirical induction

Proceedings of the sixth international workshop on Machine learning
Some advances in transformation-based part of speech tagging

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
A Computational Approach to Grammatical Coding of English Words

Journal of the ACM (JACM)
Knowledge Acquisition from Both Human Expert and Data

PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
FMR: An Incremental Knowledge Acquisition System for Fuzzy Domains

EKAW '99 Proceedings of the 11th European Workshop on Knowledge Acquisition, Modeling and Management
Dialogue act tagging with Transformation-Based Learning

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Two decades of ripple down rules research

The Knowledge Engineering Review
Efficient Knowledge Acquisition for Extracting Temporal Relations

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Semi-supervised training for the averaged perceptron POS tagger

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics

RDR-based open IE for the web document

Proceedings of the sixth international conference on Knowledge capture
Combining different summarization techniques for legal text

HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
Knowledge acquisition for categorization of legal case reports

PKAW'12 Proceedings of the 12th Pacific Rim conference on Knowledge Management and Acquisition for Intelligent Systems
Detection of CAN by ensemble classifiers based on ripple down rules

PKAW'12 Proceedings of the 12th Pacific Rim conference on Knowledge Management and Acquisition for Intelligent Systems
Improving open information extraction for informal web documents with ripple-down rules

PKAW'12 Proceedings of the 12th Pacific Rim conference on Knowledge Management and Acquisition for Intelligent Systems
Improving the performance of a named entity recognition system with knowledge acquisition

EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a new interactive workbench RDRCE (RDR Case Explorer) to facilitate the combination of Machine Learning and manual Knowledge Acquisition for Natural Language Processing problems. We show how to use Brill's well regarded transformational learning approach and convert its results into an RDR tree. RDRCE then strongly guides the systematic inspection of the generated RDR tree in order to further refine and improve it by manually adding more rules. Furthermore, RDRCE also helps in quickly recognising potential noise in the training data and allows to deal with noise effectively. Finally, we present a first study using RDRCE to build a high-quality Part-of-Speech tagger for English. After some 60 hours of manual knowledge acquisition, we already exceed slightly the state-of-the art performance on unseen benchmark test data and the fruits of some 15 years of further research in learning methods for Part-of-Speech taggers.