Taxonomic semantic indexing for textual case-based reasoning

Authors:
Juan A. Recio-Garcia;Nirmalie Wiratunga
Affiliations:
Universidad Complutense de Madrid, Spain;Robert Gordon University, Aberdeen, United Kingdom
Venue:
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Year:
2010

Citing 20
Cited 1

Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone

SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Latent dirichlet allocation

The Journal of Machine Learning Research
Towards the self-annotating web

Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Using the web to overcome data sparseness

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Textual case-based reasoning

The Knowledge Engineering Review
Pattern Recognition, Third Edition

Pattern Recognition, Third Edition
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
Building CBR systems with jcolibri

Science of Computer Programming
Bringing taxonomic structure to large digital libraries

International Journal of Metadata, Semantics and Ontologies
Semantics and Experience in the Future Web

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Knowledge Planning and Learned Personalization for Web-Based Case Adaptation

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
WordNet: similarity - measuring the relatedness of concepts

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
WikiRelate! computing semantic relatedness using wikipedia

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Learning concept hierarchies from text corpora using formal concept analysis

Journal of Artificial Intelligence Research
Computing semantic relatedness using Wikipedia-based explicit semantic analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A propositional approach to textual case indexing

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Unsupervised feature selection for text data

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning

Mining semantic relations between research areas

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

Case-Based Reasoning (CBR) solves problems by reusing past problem-solving experiences maintained in a casebase. The key CBR knowledge container therefore is its casebase. However there are further containers such as similarity, reuse and revision knowledge that are also crucial. Automated acquisition approaches are particularly attractive to discover knowledge for such containers. Majority of research in this area is focused on introspective algorithms to extract knowledge from within the casebase. However the rapid increase in Web applications has resulted in large volumes of user generated experiential content. This forms a valuable source of background knowledge for CBR system development. In this paper we present a novel approach to acquiring knowledge from Web pages. The primary knowledge structure is a dynamically generated taxonomy which once created can be used during the retrieve and reuse stages of the CBR cycle. Importantly this taxonomy is pruned according to a clustering-based sense disambiguation heuristic that uses similarity over the solution vocabulary of cases. Algorithms presented in the paper are applied to several online FAQ systems consisting of textual problem-solving cases. The goodness of generated taxonomies is evidenced by improved semantic comparison of text due to successful sense disambiguation resulting in higher retrieval accuracy. Our results show significant improvements over standard text comparison alternatives.