80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

Authors:
Antonio Torralba;Rob Fergus;William T. Freeman
Affiliations:
MIT, Cambridge;New York University, New York;MIT, Cambridge
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2008

Citing 0
Cited 137

Learning tag relevance by neighbor voting for social image retrieval

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Deep learning from temporal coherence in video

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
A descriptor for large scale image retrieval based on sketched feature lines

Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Experimental Analysis of Insertion Costs in a Naïve Dynamic MDF-Tree

IbPRIA '09 Proceedings of the 4th Iberian Conference on Pattern Recognition and Image Analysis
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Making Archetypal Analysis Practical

Proceedings of the 31st DAGM Symposium on Pattern Recognition
Removing image artifacts due to dirty camera lenses and thin occluders

ACM SIGGRAPH Asia 2009 papers
Canonical contextual distance for large-scale image annotation and retrieval

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Image categorization combining neighborhood methods and boosting

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Leveraging large-scale weakly-tagged images to train inter-related classifiers for multi-label annotation

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Using large-scale web data to facilitate textual query based retrieval of consumer photos

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Descriptive visual words and visual phrases for image applications

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Evaluation of GIST descriptors for web-scale image search

Proceedings of the ACM International Conference on Image and Video Retrieval
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Ontologies and semantic mining for bio-technology and chemistry data and patents

Proceedings of the 2nd international workshop on Patent information retrieval
Knowledge discovery over community-sharing media: from signal to intelligence

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Gathering and ranking photos of named entities with high precision, high recall, and diversity

Proceedings of the third ACM international conference on Web search and data mining
Learning social tag relevance by neighbor voting

IEEE Transactions on Multimedia
Scalable learning for object detection with GPU hardware

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Autonomous indoor helicopter flight using a single onboard camera

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Incremental indexing and distributed image search using shared randomized vocabularies

Proceedings of the international conference on Multimedia information retrieval
New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative

Proceedings of the international conference on Multimedia information retrieval
Web-scale computer vision using MapReduce for multimedia data mining

Proceedings of the Tenth International Workshop on Multimedia Data Mining
The accuracy and value of machine-generated image tags: design and user evaluation of an end-to-end image tagging system

Proceedings of the ACM International Conference on Image and Video Retrieval
Structured max-margin learning for multi-label image annotation

Proceedings of the ACM International Conference on Image and Video Retrieval
Emotion related structures in large image databases

Proceedings of the ACM International Conference on Image and Video Retrieval
Automatic tag expansion using visual similarity for photo sharing websites

Multimedia Tools and Applications
Object Recognition in 3D Point Clouds Using Web Data and Domain Adaptation

International Journal of Robotics Research
Technical Section: An evaluation of descriptors for large-scale image retrieval from sketched feature lines

Computers and Graphics
Detecting activities from body-worn accelerometers via instance-based algorithms

Pervasive and Mobile Computing
Yes we can: simplex volume maximization for descriptive web-scale matrix factorization

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Image segmentation with patch-pair density priors

Proceedings of the international conference on Multimedia
S3MKL: scalable semi-supervised multiple kernel learning for image data mining

Proceedings of the international conference on Multimedia
Image tag refinement towards low-rank, content-tag prior and error sparsity

Proceedings of the international conference on Multimedia
Image retagging

Proceedings of the international conference on Multimedia
Vicept: link visual features to concepts for large-scale image understanding

Proceedings of the international conference on Multimedia
Keep moving!: revisiting thumbnails for mobile video retrieval

Proceedings of the international conference on Multimedia
Auto-tagging of images in non-english languages using tag language conversion

Proceedings of the international conference on Multimedia
Nearest-neighbor classification using unlabeled data for real world image application

Proceedings of the international conference on Multimedia
Understanding multimedia content using web scale social media data

Proceedings of the international conference on Multimedia
Size does matter: improving object recognition and 3D reconstruction with cross-media analysis of image clusters

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Semantic label sharing for learning with many categories

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Continuous visual codebooks with a limited branching tree growing neural gas

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part III
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Efficiently scaling up video annotation with crowdsourced marketplaces

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
What does classifying more than 10,000 image categories tell us?

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Modeling and analysis of dynamic behaviors of web image collections

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Superparsing: scalable nonparametric image parsing with superpixels

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Supervised label transfer for semantic segmentation of street scenes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Efficient region-aware large graph construction towards scalable multi-label propagation

Pattern Recognition
Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images

ACM Transactions on Intelligent Systems and Technology (TIST)
Determining the best suited semantic events for cognitive surveillance

Expert Systems with Applications: An International Journal
A disk-aware algorithm for time series motif discovery

Data Mining and Knowledge Discovery
Measuring and Predicting Object Importance

International Journal of Computer Vision
Personalization in multimedia retrieval: A survey

Multimedia Tools and Applications
Sorting through photos

Communications of the ACM
Size matters! how thumbnail number, size, and motion influence mobile video retrieval

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Exploiting Textons distributions on spatial hierarchy for scene classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Massive character recognition with a large ground-truthed database

Proceedings of the 2011 ACM Symposium on Applied Computing
Social negative bootstrapping for visual categorization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Lost in binarization: query-adaptive ranking for similar image search with compact codes

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Tall and skinny QR factorizations in MapReduce architectures

Proceedings of the second international workshop on MapReduce and its applications
Data-driven visual similarity for cross-domain image matching

Proceedings of the 2011 SIGGRAPH Asia Conference
Efficient relative camera orientation detection for mobile applications

Proceedings of the 1st international workshop on Mobile location-based service
Optimization of robust loss functions for weakly-labeled image taxonomies: an imagenet case study

EMMCVPR'11 Proceedings of the 8th international conference on Energy minimization methods in computer vision and pattern recognition
Finding images of difficult entities in the long tail

Proceedings of the 20th ACM international conference on Information and knowledge management
Learning "verb-object" concepts for semantic image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Autonomous exploration using rapid perception of low-resolution image information

Autonomous Robots
Boosting web video categorization with contextual information from social web

World Wide Web
An MMSE approach to nonlocal image denoising: Theory and practical implementation

Journal of Visual Communication and Image Representation
Leveraging social media for scalable object detection

Pattern Recognition
Evaluation of fast 2d and 3d medical image retrieval approaches based on image miniatures

MCBR-CDS'11 Proceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
A unified approach to learning task-specific bit vector representations for fast nearest neighbor search

Proceedings of the 21st international conference on World Wide Web
A log square average case algorithm to make insertions in fast similarity search

Pattern Recognition Letters
Swift: reducing the effects of latency in online video scrubbing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Epitomize your photos

International Journal of Computer Games Technology
A pilot study for mood-based classification of TV programmes

Proceedings of the 27th Annual ACM Symposium on Applied Computing
WSABIE: scaling up to large vocabulary image annotation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
SUPER: towards real-time event recognition in internet videos

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Quantity versus quality: the role of layout and interaction complexity in thumbnail-based video retrieval interfaces

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Nearest-neighbor method using multiple neighborhood similarities for social media data mining

Neurocomputing
Learning by expansion: Exploiting social media for image classification with few training examples

Neurocomputing
Web image prediction using multivariate point processes

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Manhattan hashing for large-scale image retrieval

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Video annotation and navigation on mobile devices

Proceedings of the 18th Brazilian symposium on Multimedia and the web
Harvesting visual concepts for image search with complex queries

Proceedings of the 20th ACM international conference on Multimedia
Query-driven iterated neighborhood graph search for large scale indexing

Proceedings of the 20th ACM international conference on Multimedia
On shape and the computability of emotions

Proceedings of the 20th ACM international conference on Multimedia
Joint statistical analysis of images and keywords with applications in semantic image enhancement

Proceedings of the 20th ACM international conference on Multimedia
Efficient image annotation for automatic sentence generation

Proceedings of the 20th ACM international conference on Multimedia
Similar image search with a tiny bag-of-delegates representation

Proceedings of the 20th ACM international conference on Multimedia
Towards indexing representative images on the web

Proceedings of the 20th ACM international conference on Multimedia
Semi-supervised spectral hashing for fast similarity search

Neurocomputing
Annotating images with suggestions: user study of a tagging system

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
Undoing the damage of dataset bias

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Unsupervised discovery of mid-level discriminative patches

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Constrained semi-supervised learning using attributes and comparative attributes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Supervised geodesic propagation for semantic label transfer

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Displacement template with divide-&-conquer algorithm for significantly improving descriptor based face recognition approaches

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Sequential spectral learning to hash with multiple representations

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Dating historical color images

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Connecting missing links: object discovery from sparse observations using 5 million product images

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Extracting minimalistic corridor geometry from low-resolution images

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part I
Measuring image distances via embedding in a semantic manifold

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Matrix factorization as search

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Time-sensitive web image ranking and retrieval via dynamic multi-task regression

Proceedings of the sixth ACM international conference on Web search and data mining
Superparsing

International Journal of Computer Vision
Efficiently Scaling up Crowdsourced Video Annotation

International Journal of Computer Vision
VISCERAL: towards large data in medical imaging -- challenges and directions

MCBR-CDS'12 Proceedings of the Third MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
Effective transfer tagging from image to video

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Tag completion based on belief theory and neighbor voting

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
OpenSurfaces: a richly annotated catalog of surface appearance

ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
Swifter: improved online video scrubbing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
MLRank: Multi-correlation Learning to Rank for image annotation

Pattern Recognition
Image search—from thousands to billions in 20 years

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
Improving tag-based image search by using linked open data

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation

Proceedings of the 21st ACM international conference on Multimedia
Indexing billions of images for sketch-based retrieval

Proceedings of the 21st ACM international conference on Multimedia
Clickage: towards bridging semantic and intent gaps via mining click logs of search engines

Proceedings of the 21st ACM international conference on Multimedia
Nonparametric guidance of autoencoder representations using label information

The Journal of Machine Learning Research
Efficient hierarchical clustering of large high dimensional datasets

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Content-based annotation and classification framework: a general multi-purpose approach

Proceedings of the 17th International Database Engineering & Applications Symposium
A new method of image classification based on local appearance and context information

Neurocomputing
Object class detection: A survey

ACM Computing Surveys (CSUR)
Memory-efficient groupby-aggregate using compressed buffer trees

Proceedings of the 4th annual Symposium on Cloud Computing
Using objective ground-truth labels created by multiple annotators for improved video classification: A comparative study

Computer Vision and Image Understanding
Multimedia search reranking: A literature survey

ACM Computing Surveys (CSUR)
Smart hashing update for fast response

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
A spatio-temporal Long-term Memory approach for visual place recognition in mobile robotic navigation

Robotics and Autonomous Systems
World-wide scale geotagged image dataset for automatic image annotation and reverse geotagging

Proceedings of the 5th ACM Multimedia Systems Conference
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval

Computer Vision and Image Understanding
Image categorization using a semantic hierarchy model with sparse set of salient regions

Frontiers of Computer Science: Selected Publications from Chinese Universities
QuMinS: Fast and scalable querying, mining and summarizing multi-modal databases

Information Sciences: an International Journal
C2TAM: A Cloud framework for cooperative tracking and mapping

Robotics and Autonomous Systems

Quantified Score

Hi-index	0.18

Visualization

Abstract

With the advent of the Internet, billions of images are now freely available online and constitute a dense sampling of the visual world. Using a variety of non-parametric methods, we explore this world with the aid of a large dataset of 79,302,017 images collected from the Internet. Motivated by psychophysical results showing the remarkable tolerance of the human visual system to degradations in image resolution, the images in the dataset are stored as 32 x 32 color images. Each image is loosely labeled with one of the 75,062 non-abstract nouns in English, as listed in the Wordnet lexical database. Hence the image database gives a comprehensive coverage of all object categories and scenes. The semantic information from Wordnet can be used in conjunction with nearest-neighbor methods to perform object classification over a range of semantic levels minimizing the effects of labeling noise. For certain classes that are particularly prevalent in the dataset, such as people, we are able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.