Acquiring Word Similarities with Higher Order Association Mining

Authors:
Sutanu Chakraborti;Nirmalie Wiratunga;Robert Lothian;Stuart Watt
Affiliations:
School of Computing, The Robert Gordon University, Aberdeen AB25 1HG, Scotland, UK;School of Computing, The Robert Gordon University, Aberdeen AB25 1HG, Scotland, UK;School of Computing, The Robert Gordon University, Aberdeen AB25 1HG, Scotland, UK;School of Computing, The Robert Gordon University, Aberdeen AB25 1HG, Scotland, UK
Venue:
ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Year:
2007

Citing 13
Cited 3

Machine Learning

Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
CBR for Document Retrieval: The FALLQ Project

ICCBR '97 Proceedings of the Second International Conference on Case-Based Reasoning Research and Development
Choosing the word most typical in context using a lexical co-occurrence network

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Frequency estimates for statistical word similarity measures

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Scalable collaborative filtering using cluster-based smoothing

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
WordNet-based User Profiles for Neighborhood Formation in Hybrid Recommender Systems

HIS '05 Proceedings of the Fifth International Conference on Hybrid Intelligent Systems
Supervised latent semantic indexing using adaptive sprinkling

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Extracting keyphrases to represent relations in social networks from web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Sophia: a novel approach for textual case-based reasoning

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A framework for understanding Latent Semantic Indexing (LSI) performance

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
A propositional approach to textual case indexing

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Unsupervised feature selection for text data

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning

Recognition of higher-order relations among features in textual cases using random indexing

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Term similarity and weighting framework for text representation

ICCBR'11 Proceedings of the 19th international conference on Case-Based Reasoning Research and Development
Supervised word sense disambiguation using semantic diffusion kernel

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel approach to mine word similarity in Textual Case Based Reasoning. We exploit indirect associations of words, in addition to direct ones for estimating their similarity. If word Aco-occurs with word B, we say Aand Bshare a first order association between them. If Aco-occurs with Bin some documents, and Bwith Cin some others, then Aand Care said to share a second order co-occurrence via B. Higher orders of co-occurrence may similarly be defined. In this paper we present algorithms for mining higher order co-occurrences. A weighted linear model is used to combine the contribution of these higher orders into a word similarity model. Our experimental results demonstrate significant improvements compared to similarity models based on first order co-occurrences alone. Our approach also outperforms state-of-the-art techniques like SVM and LSI in classification tasks of varying complexity.