Constructing informative priors using transfer learning

Authors:
Rajat Raina;Andrew Y. Ng;Daphne Koller
Affiliations:
Stanford University, CA;Stanford University, CA;Stanford University, CA
Venue:
ICML '06 Proceedings of the 23rd international conference on Machine learning
Year:
2006

Citing 6
Cited 41

WordNet: a lexical database for English

Communications of the ACM
A Bayesian/Information Theoretic Model of Learning to Learn viaMultiple Task Sampling

Machine Learning - Special issue on inductive transfer
Multitask Learning

Machine Learning - Special issue on inductive transfer
Learning to learn with the informative vector machine

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning Gaussian processes from multiple tasks

ICML '05 Proceedings of the 22nd international conference on Machine learning
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research

Hierarchical maximum entropy density estimation

Proceedings of the 24th international conference on Machine learning
Learning a meta-level prior for feature relevance from multiple related tasks

Proceedings of the 24th international conference on Machine learning
Self-taught clustering

Proceedings of the 25th international conference on Machine learning
Iterative Reinforcement Cross-Domain Text Classification

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Transfer learning from multiple source domains via consensus regularization

Proceedings of the 17th ACM conference on Information and knowledge management
Search advertising using web relevance feedback

Proceedings of the 17th ACM conference on Information and knowledge management
A framework for classifier adaptation and its applications in concept detection

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Knowledge Supervised Text Classification with No Labeled Documents

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
EigenTransfer: a unified framework for transfer learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
2009 Special Issue: Predictive learning with structured (grouped) data

Neural Networks
Case-Based Reasoning in Transfer Learning

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Online methods for multi-domain learning and adaptation

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Relaxed Transfer of Different Classes via Spectral Partition

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Multi-task Feature Selection Using the Multiple Inclusion Criterion (MIC)

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Transferring naive bayes classifiers for text classification

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Mapping and revising Markov logic networks for transfer learning

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Importance of semantic representation: dataless classification

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Transfer learning from minimal target data by mapping across relational domains

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Transfer Learning beyond Text Classification

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Supervised self-taught learning: actively transferring knowledge from unlabeled data

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
When Is There a Representer Theorem? Vector Versus Matrix Regularizers

The Journal of Machine Learning Research
Bayesian multitask learning with latent hierarchies

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Three challenges in data mining

Frontiers of Computer Science in China
Improved natural language learning via variance-regularization support vector machines

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Minimum Description Length Penalization for Group and Multi-Task Sparse Learning

The Journal of Machine Learning Research
Towards semantic knowledge propagation from text corpus to web images

Proceedings of the 20th international conference on World wide web
Improving accuracy of microarray classification by a simple multi-task feature selection filter

International Journal of Data Mining and Bioinformatics
Activity knowledge transfer in smart environments

Pervasive and Mobile Computing
Part-based transfer learning

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Transfer learning with adaptive regularizers

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Transferring topical knowledge from auxiliary long texts for short text clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
Transferring knowledge of activity recognition across sensor networks

Pervasive'10 Proceedings of the 8th international conference on Pervasive Computing
Source-selection-free transfer learning

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Cross-Guided Clustering: Transfer of Relevant Supervision across Tasks

ACM Transactions on Knowledge Discovery from Data (TKDD)
Co-transfer learning via joint transition probability graph based method

Proceedings of the 1st International Workshop on Cross Domain Knowledge Discovery in Web and Social Network Mining
Transfer Learning from Unlabeled Data via Neural Networks

Neural Processing Letters
Effective fuzzy semantic clustering scheme for decentralised network through multi-domain ontology model

International Journal of Metadata, Semantics and Ontologies
What makes a good detector? --- structured priors for learning from few examples

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Transfer defect learning

Proceedings of the 2013 International Conference on Software Engineering
Scalable supervised dimensionality reduction using clustering

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Effective fuzzy semantic clustering scheme for decentralised network through multi-domain ontology model

International Journal of Metadata, Semantics and Ontologies

Quantified Score

Hi-index	0.01

Visualization

Abstract

Many applications of supervised learning require good generalization from limited labeled data. In the Bayesian setting, we can try to achieve this goal by using an informative prior over the parameters, one that encodes useful domain knowledge. Focusing on logistic regression, we present an algorithm for automatically constructing a multivariate Gaussian prior with a full covariance matrix for a given supervised learning task. This prior relaxes a commonly used but overly simplistic independence assumption, and allows parameters to be dependent. The algorithm uses other "similar" learning problems to estimate the covariance of pairs of individual parameters. We then use a semidefinite program to combine these estimates and learn a good prior for the current learning task. We apply our methods to binary text classification, and demonstrate a 20 to 40% test error reduction over a commonly used prior.