Self-taught learning: transfer learning from unlabeled data

Authors:
Rajat Raina;Alexis Battle;Honglak Lee;Benjamin Packer;Andrew Y. Ng
Affiliations:
Stanford University, CA;Stanford University, CA;Stanford University, CA;Stanford University, CA;Stanford University, CA
Venue:
Proceedings of the 24th international conference on Machine learning
Year:
2007

Citing 12
Cited 113

Multitask Learning

Machine Learning - Special issue on inductive transfer
Theoretical models of learning to learn

Learning to learn
Exploiting generative models in discriminative classifiers

Proceedings of the 1998 conference on Advances in neural information processing systems II
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Feature selection, L1 vs. L2 regularization, and rotational invariance

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Non-negative Matrix Factorization with Sparseness Constraints

The Journal of Machine Learning Research
Object Recognition with Features Inspired by Visual Cortex

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Combining Generative Models and Fisher Kernels for Object Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research

Self-taught clustering

Proceedings of the 25th international conference on Machine learning
Adaptive p-posterior mixture-model kernels for multiple instance learning

Proceedings of the 25th international conference on Machine learning
Learning to rank with partially-labeled data

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Transferred Dimensionality Reduction

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Transfer learning from multiple source domains via consensus regularization

Proceedings of the 17th ACM conference on Information and knowledge management
A framework for classifier adaptation and its applications in concept detection

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
EigenTransfer: a unified framework for transfer learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Transfer learning for collaborative filtering via a rating-matrix generative model

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Online dictionary learning for sparse coding

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Large-scale deep unsupervised learning using graphics processors

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Extracting discriminative concepts for domain adaptation in text mining

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A web of concepts

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Mining employment market via text block detection and adaptive cross-domain information extraction

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A machine learning approach to sentiment analysis in multilingual Web texts

Information Retrieval
Zero-data learning of new tasks

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Transfer learning via dimensionality reduction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Text categorization with knowledge transfer from heterogeneous data sources

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Cross-domain activity recognition

Proceedings of the 11th international conference on Ubiquitous computing
Semantic context transfer across heterogeneous sources for domain adaptive video search

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Heterogeneous cross domain ranking in latent space

Proceedings of the 18th ACM conference on Information and knowledge management
Large margin transductive transfer learning

Proceedings of the 18th ACM conference on Information and knowledge management
A risk minimization framework for domain adaptation

Proceedings of the 18th ACM conference on Information and knowledge management
Learning to sense sparse signals: simultaneous sensing matrix and sparsifying dictionary optimization

IEEE Transactions on Image Processing
Learning Deep Architectures for AI

Foundations and Trends® in Machine Learning
Selecting informative universum sample for semi-supervised learning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Exponential family sparse coding with applications to self-taught learning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Can movies and books collaborate?: cross-domain collaborative filtering for sparsity reduction

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Heterogeneous transfer learning for image clustering via the social web

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Transfer Learning beyond Text Classification

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Supervised self-taught learning: actively transferring knowledge from unlabeled data

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Autonomous altitude estimation of a UAV using a single onboard camera

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Online Learning for Matrix Factorization and Sparse Coding

The Journal of Machine Learning Research
Enhancing biomedical named entity classification using terabyte unlabeled data

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Convex coding

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Building topographic subspace model with transfer learning for sparse representation

Neurocomputing
Transfer estimation of evolving class priors in data stream classification

Pattern Recognition
Self-taught hashing for fast similarity search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Nonnegative shared subspace learning and its application to social media retrieval

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Inter-robot transfer learning for perceptual classification

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Semi-supervised ranking for document retrieval

Computer Speech and Language
Localizing objects while learning their appearance

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Image classification using super-vector coding of local image descriptors

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Sparse deep belief net for handwritten digits classification

AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
A Framework for Semisupervised Feature Generation and Its Applications in Biomedical Literature Mining

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Discovery of logic relations for text mining adaptation using unlabeled data

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Active deep networks for semi-supervised sentiment classification

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Towards semantic knowledge propagation from text corpus to web images

Proceedings of the 20th international conference on World wide web
A multi-scale learning framework for visual categorization

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
Learning adaptive and sparse representations of medical images

MCV'10 Proceedings of the 2010 international MICCAI conference on Medical computer vision: recognition techniques and applications in medical imaging
Unsupervised selective transfer learning for object recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Sparse coding on local spatial-temporal volumes for human action recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Knowledge transfer based on feature representation mapping for text classification

Expert Systems with Applications: An International Journal
Multitask Bregman clustering

Neurocomputing
Transfer learning via multi-view principal component analysis

Journal of Computer Science and Technology - Special issue on natural language processing
Cross-domain activity recognition via transfer learning

Pervasive and Mobile Computing
Unsupervised learning of hierarchical representations with convolutional deep belief networks

Communications of the ACM
Learning hierarchical dictionary for shape patterns

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Part-based transfer learning

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Transfer learning through domain adaptation

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Knowledge transfer across multilingual corpora via latent topics

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Localized factor models for multi-context recommendation

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-view transfer learning with a large margin approach

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Topic graph based non-negative matrix factorization for transfer learning

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Technical Section: Neural network-based symbol recognition using a few labeled samples

Computers and Graphics
Multi-task clustering via domain adaptation

Pattern Recognition
Ranking function adaptation with boosting trees

ACM Transactions on Information Systems (TOIS)
Transferring topical knowledge from auxiliary long texts for short text clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
Semi-supervised face image retrieval using sparse coding with identity constraint

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Leveraging Auxiliary Data for Learning to Rank

ACM Transactions on Intelligent Systems and Technology (TIST)
Pairwise cross-domain factor model for heterogeneous transfer ranking

Proceedings of the fifth ACM international conference on Web search and data mining
Boosting for transfer learning from multiple data sources

Pattern Recognition Letters
Cost effective depression patient thought record categorization via self-taught learning

Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments
Adapting SVM image classifiers to changes in imaging conditions using incremental SVM: an application to car detection

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
A case study on meta-generalising: a Gaussian processes approach

The Journal of Machine Learning Research
2012 Special Issue: Analysis of the IJCNN 2011 UTL challenge

Neural Networks
Transfer learning with local smoothness regularizer

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Letters: Learning spatiotemporal features by using independent component analysis with application to facial expression recognition

Neurocomputing
Bi-weighting domain adaptation for cross-language text classification

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Cross-Guided Clustering: Transfer of Relevant Supervision across Tasks

ACM Transactions on Knowledge Discovery from Data (TKDD)
Sentiment detection with auxiliary data

Information Retrieval
Dictionary Learning for Noisy and Incomplete Hyperspectral Images

SIAM Journal on Imaging Sciences
Reinforcement learning transfer via sparse coding

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Co-transfer learning via joint transition probability graph based method

Proceedings of the 1st International Workshop on Cross Domain Knowledge Discovery in Web and Social Network Mining
Simultaneous discriminative projection and dictionary learning for sparse representation based classification

Pattern Recognition
Self-taught dimensionality reduction on the high-dimensional small-sized data

Pattern Recognition
Review: Sparse coding and classifier ensemble based multi-instance learning for image categorization

Signal Processing
Transfer Learning from Unlabeled Data via Neural Networks

Neural Processing Letters
Weakly Supervised Localization and Learning with Generic Knowledge

International Journal of Computer Vision
TCSST: transfer classification of short & sparse text using external data

Proceedings of the 21st ACM international conference on Information and knowledge management
Order-Preserving sparse coding for sequence classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Real-time compressive tracking

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Discriminative factor alignment across heterogeneous feature space

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Transfer spectral clustering

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Source code author identification with unsupervised feature learning

Pattern Recognition Letters
Adaptive object detection by implicit sub-class sharing features

Signal Processing
Sparse hashing for fast multimedia search

ACM Transactions on Information Systems (TOIS)
Online semi-supervised discriminative dictionary learning for sparse representation

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Multi-Level structured image coding on high-dimensional image representation

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Predicting positive and negative links in signed social networks by transfer learning

Proceedings of the 22nd international conference on World Wide Web
Double-bootstrapping source data selection for instance-based transfer learning

Pattern Recognition Letters
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
An unsupervised transfer learning approach to discover topics for online reputation management

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Nobody likes Mondays: foreground detection and behavioral patterns analysis in complex urban scenes

Proceedings of the 4th ACM/IEEE international workshop on Analysis and retrieval of tracked events and motion in imagery stream
Deep learning of representations: looking forward

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Real-time visual tracking based on an appearance model and a motion mode

ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
Transfer learning of syntactic structures for building taxonomies for search engines

Engineering Applications of Artificial Intelligence
Robust image retrieval with hidden classes

Computer Vision and Image Understanding
Improved sparse coding under the influence of perceptual attention

Neural Computation
A robust elastic net approach for feature learning

Journal of Visual Communication and Image Representation
Self-taught learning via exponential family sparse coding for cost-effective patient thought record categorization

Personal and Ubiquitous Computing
Semi-supervised learning via sparse model

Neurocomputing
Rectifying the representation learned by Non-negative Matrix Factorization

International Journal of Knowledge-based and Intelligent Engineering Systems

Quantified Score

Hi-index	0.02

Visualization

Abstract

We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabeled data follows the same class labels or generative distribution as the labeled data. Thus, we would like to use a large number of unlabeled images (or audio samples, or text documents) randomly downloaded from the Internet to improve performance on a given image (or audio, or text) classification task. Such unlabeled data is significantly easier to obtain than in typical semi-supervised or transfer learning settings, making self-taught learning widely applicable to many practical learning problems. We describe an approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data. These features form a succinct input representation and significantly improve classification performance. When using an SVM for classification, we further show how a Fisher kernel can be learned for this representation.