Dictionary based sparse representation for domain adaptation

Authors:
Rishabh Mehrotra;Rushabh Agrawal;Syed Aqueel Haider
Affiliations:
BITS Pilani, Pilani, India;BITS Pilani, Pilani, India;MIT Manipal, Manipal, India
Venue:
Proceedings of the 21st ACM international conference on Information and knowledge management
Year:
2012

Citing 8
Cited 0

Atomic Decomposition by Basis Pursuit

SIAM Review
Generalized Principal Component Analysis (GPCA)

IEEE Transactions on Pattern Analysis and Machine Intelligence
Domain adaptation with structural correspondence learning

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Domain adaptation for statistical classifiers

Journal of Artificial Intelligence Research
From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images

SIAM Review
Corporate news classification and valence prediction: a supervised approach

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Sparse signal reconstruction from limited data using FOCUSS: are-weighted minimum norm algorithm

IEEE Transactions on Signal Processing
Greed is good: algorithmic results for sparse approximation

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

Machine Learning algorithms are often as good as the data they can learn from. Enormous amount of unlabeled data is readily available and the ability to efficiently use such amount of unlabeled data holds a significant promise in terms of increasing the performance of various learning tasks. We consider the task of supervised Domain Adaptation and present a Self-Taught learning based framework which makes use of the K-SVD algorithm for learning sparse representation of data in an unsupervised manner. To the best of our knowledge this is the first work that integrates K-SVD algorithm into the self-taught learning framework. The K-SVD algorithm iteratively alternates between sparse coding of the instances based on the current dictionary and a process of updating/adapting the dictionary to better fit the data so as to achieve a sparse representation under strict sparsity constraints. Using the learnt dictionary, a rich feature representation of the few labeled instances is obtained which is fed to a classifier along with class labels to build the model. We evaluate our framework on the task of domain adaptation for sentiment classification. Both self-domain (requiring very few domain-specific training instances) and cross-domain classification (requiring 0 labeled instances of target domain and very few labeled instances of source domain) are performed. Empirical comparisons of self-domain and cross-domain results establish the efficacy of the proposed framework.