Domain adaptation by constraining inter-domain variability of latent feature representation

Authors:
Ivan Titov
Affiliations:
Saarland University, Saarbruecken, Germany
Venue:
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Year:
2011

Citing 20
Cited 2

Information processing in dynamical systems: foundations of harmony theory

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Factorial Hidden Markov Models

Machine Learning - Special issue on learning with probabilistic representations
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Training products of experts by minimizing contrastive divergence

Neural Computation
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Reranking and self-training for parser adaptation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Discriminative learning for differing training and test distributions

Proceedings of the 24th international conference on Machine learning
A unified architecture for natural language processing: deep neural networks with multitask learning

Proceedings of the 25th international conference on Machine learning
Topic-bridged PLSA for cross-domain text classification

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Domain Adaptation of Conditional Probability Models Via Feature Subsetting

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
A latent variable model of synchronous syntactic-semantic parsing for multiple languages

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Multi-conditional learning: generative/discriminative training for clustering and classification

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Structural correspondence learning for parse disambiguation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Domain adaptation with structural correspondence learning

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Domain adaptation for statistical classifiers

Journal of Artificial Intelligence Research
Mean field theory for sigmoid belief networks

Journal of Artificial Intelligence Research
Distributional representations for handling sparsity in supervised sequence-labeling

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A theory of learning from different domains

Machine Learning
Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data

The Journal of Machine Learning Research
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research

A partially supervised cross-collection topic model for cross-domain text classification

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Bootstrapping polarity classifiers with rule-based classification

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider a semi-supervised setting for domain adaptation where only unlabeled data is available for the target domain. One way to tackle this problem is to train a generative model with latent variables on the mixture of data from the source and target domains. Such a model would cluster features in both domains and ensure that at least some of the latent variables are predictive of the label on the source domain. The danger is that these predictive clusters will consist of features specific to the source domain only and, consequently, a classifier relying on such clusters would perform badly on the target domain. We introduce a constraint enforcing that marginal distributions of each cluster (i.e., each latent variable) do not vary significantly across domains. We show that this constraint is effective on the sentiment classification task (Pang et al., 2002), resulting in scores similar to the ones obtained by the structural correspondence methods (Blitzer et al., 2007) without the need to engineer auxiliary tasks.