Domain adaptation with structural correspondence learning

Authors:
John Blitzer;Ryan McDonald;Fernando Pereira
Affiliations:
University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA
Venue:
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Year:
2006

Citing 18
Cited 142

Class-based n-gram models of natural language

Computational Linguistics
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Latent dirichlet allocation

The Journal of Machine Learning Research
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Supervised and unsupervised PCFG adaptation to novel domains

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Shallow parsing with conditional random fields

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research
Discriminative language modeling with conditional random fields and the perceptron algorithm

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Exploiting unannotated corpora for tagging and chunking

ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
A high-performance semi-supervised learning method for text chunking

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Online large-margin training of dependency parsers

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Flexible text segmentation with structured multilabel classification

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Online Passive-Aggressive Algorithms

The Journal of Machine Learning Research
Domain adaptation for statistical classifiers

Journal of Artificial Intelligence Research
Parsing biomedical literature

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing

A two-stage approach to domain adaptation for statistical classifiers

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Spectral domain-transfer learning

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Domain Adaptation of Conditional Probability Models Via Feature Subsetting

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Trada: tree based ranking function adaptation

Proceedings of the 17th ACM conference on Information and knowledge management
Intra-document structural frequency features for semi-supervised domain adaptation

Proceedings of the 17th ACM conference on Information and knowledge management
Cross-language query classification using web search for exogenous knowledge

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Information Extraction

Foundations and Trends in Databases
Domain adaptation of information extraction models

ACM SIGMOD Record
Domain adaptation from multiple sources via auxiliary classifiers

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Extracting discriminative concepts for domain adaptation in text mining

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining employment market via text block detection and adaptive cross-domain information extraction

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
On robustness and domain adaptation using SVD for word sense disambiguation

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Supervised domain adaption for WSD

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Structural correspondence learning for parse disambiguation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Feature Selection by Transfer Learning with Linear Regularized Models

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Transfer learning via dimensionality reduction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Semi-supervised learning for blog classification

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Domain adaptation with latent semantic association for named entity recognition

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Improving SCL model for sentiment-transfer learning

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Evaluating impact of re-training a lexical disambiguation model on domain adaptation of an HPSG parser

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
A comparison of structural correspondence learning and self-training for discriminative parse selection

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
SemEval-2010 task 17: all-words word sense disambiguation on a specific domain

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Gesture salience as a hidden variable for coreference resolution and keyframe extraction

Journal of Artificial Intelligence Research
Porting a lexicalized-grammar parser to the biomedical domain

Journal of Biomedical Informatics
Ranking model adaptation for domain-specific search

Proceedings of the 18th ACM conference on Information and knowledge management
Heterogeneous cross domain ranking in latent space

Proceedings of the 18th ACM conference on Information and knowledge management
A risk minimization framework for domain adaptation

Proceedings of the 18th ACM conference on Information and knowledge management
Domain adaptation via transfer component analysis

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Heterogeneous transfer learning for image clustering via the social web

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Distributional representations for handling sparsity in supervised sequence-labeling

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Automatic adaptation of annotation standards: Chinese word segmentation and POS tagging: a case study

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Multi-task transfer learning for weakly-supervised relation extraction

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
A Boosting Approach for Learning to Rank Using SVD with Partially Labeled Data

AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Transfer Learning beyond Text Classification

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
An empirical study of semi-supervised structured conditional models for dependency parsing

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Domain adaptive bootstrapping for named entity recognition

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Cross-domain sentiment classification via spectral feature alignment

Proceedings of the 19th international conference on World wide web
Bayesian multitask learning with latent hierarchies

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Learning to rank only using training data from related domain

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Vertical selection in the presence of unlabeled verticals

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Open-domain semantic role labeling by modeling word spans

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Cross-language text classification using structural correspondence learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Cross lingual adaptation: an experiment on sentiment classifications

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
Domain adaptation meets active learning

ALNLP '10 Proceedings of the NAACL HLT 2010 Workshop on Active Learning for Natural Language Processing
Three challenges in data mining

Frontiers of Computer Science in China
N-best reranking by multitask learning

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Grammar-driven versus data-driven: which parsing system is more affected by domain shifts?

NLPLING '10 Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground
Adaptive parameters for entity recognition with perceptron HMMs

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Domain adaptation to summarize human conversations

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Exploring representation-learning approaches to domain adaptation

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Domain adaptation with unlabeled data for dialog act tagging

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Frustratingly easy semi-supervised domain adaptation

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Efficient graph-based semi-supervised learning of structured tagging models

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
We're not in Kansas anymore: detecting domain changes in streams

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Uptraining for accurate deterministic question parsing

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
The necessity of combining adaptation methods

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Domain adaptation of rule-based annotators for named-entity recognition tasks

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A robust semi-supervised classification method for transfer learning

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Topic aspect analysis for multi-document summarization

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Semi-supervised ranking for document retrieval

Computer Speech and Language
Predictive distribution matching SVM for multi-domain learning

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Semi-supervised projection clustering with transferred centroid regularization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Cross-market model adaptation with pairwise preference data for web search ranking

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A multi-domain web-based algorithm for POS tagging of unknown words

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Which clustering do you want? inducing your ideal clustering with minimal feedback

Journal of Artificial Intelligence Research
Logistic regression for transductive transfer learning from multiple sources

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Collective Inference for Extraction MRFs Coupled with Symmetric Clique Potentials

The Journal of Machine Learning Research
Semi-supervised discourse relation classification with structural learning

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Knowledge transfer based on feature representation mapping for text classification

Expert Systems with Applications: An International Journal
Transfer learning via multi-view principal component analysis

Journal of Computer Science and Technology - Special issue on natural language processing
Domain adaptation for text categorization by feature labeling

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Domain adaptation by constraining inter-domain variability of latent feature representation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Query weighting for ranking model adaptation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Joint bilingual sentiment classification with unlabeled parallel corpora

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Effective measures of domain similarity for parsing

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Question detection in spoken conversations using textual conversations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Is machine translation ripe for cross-lingual sentiment classification?

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Parsing natural language queries for life science knowledge

BioNLP '11 Proceedings of BioNLP 2011 Workshop
Transfer learning through domain adaptation

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Relevant knowledge helps in choosing right teacher: active query selection for ranking adaptation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Classification probabilistic PCA with application in domain adaptation

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Knowledge transfer across multilingual corpora via latent topics

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Gender attribution: tracing stylometric evidence beyond topic and genre

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Language models as representations for weakly-supervised NLP tasks

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Adapting text instead of the model: an open domain approach

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Localized factor models for multi-context recommendation

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Domain selection and adaptation in smart homes

ICOST'11 Proceedings of the 9th international conference on Toward useful services for elderly and people with disabilities: smart homes and health telematics
Multi-task clustering via domain adaptation

Pattern Recognition
Active supervised domain adaptation

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Feature selection for transfer learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Cross-Lingual Adaptation Using Structural Correspondence Learning

ACM Transactions on Intelligent Systems and Technology (TIST)
Ranking function adaptation with boosting trees

ACM Transactions on Information Systems (TOIS)
Cross-Domain Effects on Parse Selection for Precision Grammars

Research on Language and Computation
Agnostic domain adaptation

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
On the usefulness of similarity based projection spaces for transfer learning

SIMBAD'11 Proceedings of the First international conference on Similarity-based pattern recognition
Modeling latent discriminative dynamic of multi-dimensional affective signals

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
Language-independent sentiment classification using three common words

Proceedings of the 20th ACM international conference on Information and knowledge management
A cross-domain adaptation method for sentiment classification using probabilistic latent analysis

Proceedings of the 20th ACM international conference on Information and knowledge management
Transfer learning for cross-company software defect prediction

Information and Software Technology
Transfer learning of classification rules for biomarker discovery and verification from molecular profiling studies

Journal of Biomedical Informatics
Importance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition

Neurocomputing
Pairwise cross-domain factor model for heterogeneous transfer ranking

Proceedings of the fifth ACM international conference on Web search and data mining
Boosting for transfer learning from multiple data sources

Pattern Recognition Letters
Training dependency parsers by jointly optimizing multiple objectives

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A new domain adaptation method based on rules discovered from cross-domain features

KSEM'11 Proceedings of the 5th international conference on Knowledge Science, Engineering and Management
Biographies or blenders: which resource is best for cross-domain sentiment analysis?

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Research on text categorization based on a weakly-supervised transfer learning method

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
On minimum distribution discrepancy support vector machine for domain adaptation

Pattern Recognition
Domain adaptation with ensemble of feature groups

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Heterogeneous domain adaptation using manifold alignment

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Sentiment detection with auxiliary data

Information Retrieval
Multi-domain active learning for text classification

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Content-based retrieval for heterogeneous domains: domain adaptation by relative aggregation points

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Cross-domain representation-learning framework with combination of class-separate and domain-merge objectives

Proceedings of the 1st International Workshop on Cross Domain Knowledge Discovery in Web and Social Network Mining
Linear semi-supervised projection clustering by transferred centroid regularization

Journal of Intelligent Information Systems
Active learning with transfer learning

ACL '12 Proceedings of ACL 2012 Student Research Workshop
Analyzing Urdu social media for sentiments using transfer learning with controlled translations

LSM '12 Proceedings of the Second Workshop on Language in Social Media
Cross-lingual mixture model for sentiment classification

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Linking named entities to any database

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Iterative annotation transformation with predict-self reestimation for Chinese word segmentation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Do neighbours help?: an exploration of graph-based algorithms for cross-domain sentiment classification

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Domain adaptation for coreference resolution: an adaptive ensemble approach

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Multi-domain learning: when do domains matter?

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Biased representation learning for domain adaptation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Wiki-ly supervised part-of-speech tagging

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Dictionary based sparse representation for domain adaptation

Proceedings of the 21st ACM international conference on Information and knowledge management
Active learning for transferrable object classification in cross-view traffic scene surveillance

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
An overview of transfer learning and computational cyberpsychology

ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
Transfer joint embedding for cross-domain named entity recognition

ACM Transactions on Information Systems (TOIS)
A Fast and Accurate Method for Bilingual Opinion Lexicon Extraction

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Unsupervised feature adaptation for cross-domain NLP with an application to compositionality grading

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Chinese terminology extraction using EM-Based transfer learning method

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Beyond dataset bias: multi-task unaligned shared knowledge transfer

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Double-bootstrapping source data selection for instance-based transfer learning

Pattern Recognition Letters
A partially supervised cross-collection topic model for cross-domain text classification

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Cross-domain sparse coding

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Domain adaptation with topical correspondence learning

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Democracy is good for ranking: towards multi-view rank learning and adaptation in web search

Proceedings of the 7th ACM international conference on Web search and data mining
Bootstrapping polarity classifiers with rule-based classification

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discriminative learning methods are widely used in natural language processing. These methods work best when their training and test data are drawn from the same distribution. For many NLP tasks, however, we are confronted with new domains in which labeled data is scarce or non-existent. In such cases, we seek to adapt existing models from a resource-rich source domain to a resource-poor target domain. We introduce structural correspondence learning to automatically induce correspondences among features from different domains. We test our technique on part of speech tagging and show performance gains for varying amounts of source and target training data, as well as improvements in target domain parsing accuracy using our improved tagger.