Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

Authors:
Aude Oliva;Antonio Torralba
Affiliations:
Harvard Medical School and the Brigham and Women's Hospital, 221 Longwood Ave., Boston, MA 02115. oliva@search.bwh.harvard.edu;Department of Brain and Cognitive Sciences, MIT, 45 Carleton Street, Cambridge, MA 02139. torralba@ai.mit.edu
Venue:
International Journal of Computer Vision
Year:
2001

Citing 14
Cited 419

What does the retina know about natural scenes?

Neural Computation
Identifying high level features of texture perception

CVGIP: Graphical Models and Image Processing
What is the goal of sensory coding?

Neural Computation
Using Discriminant Eigenfeatures for Image Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Visual Learning for Object Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Structure driven image database retrieval

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Classification of scene photographs from local orientations features

Pattern Recognition Letters - Selected papers from the 11th scandinavian conference on image analysis
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks
Indoor-Outdoor Image Classification

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Region-Based Image Querying

CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
Configuration based scene classification and image indexing

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Semantic Organization of Scenes Using Discriminant Structural Templates

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Content-Based Hierarchical Classification of Vacation Images

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
Global semantic classification of scenes using power spectrum templates

IM'99 Proceedings of the 1999 international conference on Challenge of Image Retrieval

A New Pattern Representation Scheme Using Data Compression

IEEE Transactions on Pattern Analysis and Machine Intelligence
Depth Estimation from Image Structure

IEEE Transactions on Pattern Analysis and Machine Intelligence
Contextual Priming for Object Detection

International Journal of Computer Vision
Scene-Centered Description from Spatial Envelope Properties

BMCV '02 Proceedings of the Second International Workshop on Biologically Motivated Computer Vision
Manhattan world: orientation and outlier detection by Bayesian inference

Neural Computation
ERIC7: an experimental tool for Content-Based Image encoding and Retrieval under the MPEG-7 standard

WISICT '04 Proceedings of the winter international synposium on Information and communication technologies
Image retrieval and perceptual similarity

ACM Transactions on Applied Perception (TAP)
Categorization of natural scenes: local vs. global information

APGV '06 Proceedings of the 3rd symposium on Applied perception in graphics and visualization
Behavioral and Neuroimaging Evidence for a Contribution of Color and Texture Information to Scene Classification in a Patient with Visual Form Agnosia

Journal of Cognitive Neuroscience
A psychophysically plausible model for typicality ranking of natural scenes

ACM Transactions on Applied Perception (TAP)
Rapid Biologically-Inspired Scene Classification Using Features Shared with Visual Attention

IEEE Transactions on Pattern Analysis and Machine Intelligence
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval

International Journal of Computer Vision
Segmentation and description of natural outdoor scenes

Image and Vision Computing
Review: Which is the best way to organize/classify images by content?

Image and Vision Computing
Categorization of natural scenes: Local versus global information and the role of color

ACM Transactions on Applied Perception (TAP)
Image annotation: which approach for realistic databases?

Proceedings of the 6th ACM international conference on Image and video retrieval
Multi-level local descriptor quantization for bag-of-visterms image representation

Proceedings of the 6th ACM international conference on Image and video retrieval
Recovering Surface Layout from an Image

International Journal of Computer Vision
Limits of Event-related Potential Differences in Tracking Object Processing Speed

Journal of Cognitive Neuroscience
Color conceptualization

Proceedings of the 15th international conference on Multimedia
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
Perception of complex aggregates

ACM SIGGRAPH 2008 papers
Unsupervised segmentation of natural images via lossy data compression

Computer Vision and Image Understanding
Scene modeling in global-local view for scene classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Continuous visual vocabulary modelsfor pLSA-based scene recognition

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Semantic spaces revisited: investigating the performance of auto-annotation and semantic retrieval using semantic spaces

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Semantic representation of multimedia content: Knowledge representation and semantic indexing

Multimedia Tools and Applications
Effectiveness of global features for automatic medical image classification and retrieval - The experiences of OHSU at ImageCLEFmed

Pattern Recognition Letters
Comparing Local Feature Descriptors in pLSA-Based Image Models

Proceedings of the 30th DAGM symposium on Pattern Recognition
An exploratory study on joint analysis of visual classification in narrow domains and the discriminative power of tags

MS '08 Proceedings of the 2nd ACM workshop on Multimedia semantics
Semantic object classes in video: A high-definition ground truth database

Pattern Recognition Letters
Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Context First

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Natural Versus Artificial Scene Classification by Ordering Discrete Fourier Power Spectra

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Using visual and text features for direct marketing on multimedia messaging services domain

Multimedia Tools and Applications
SemanGist: A Local Semantic Image Representation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Spatial Hierarchy of Textons Distributions for Scene Classification

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Adaptively Combining Local with Global Information for Natural Scenes Categorization

IEICE - Transactions on Information and Systems
Mapping the world's photos

Proceedings of the 18th international conference on World wide web
Integrating Visual Context and Object Detection within a Probabilistic Framework

Attention in Cognitive Systems
Relative Influence of Bottom-Up and Top-Down Attention

Attention in Cognitive Systems
Recent Advances in Large Scale Image Search

Emerging Trends in Visual Computing
A Nonparametric Bayesian Learning Model: Application to Text and Image Categorization

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
ConVeS: a context verification framework for object recognition system

Proceedings of the 2009 conference on Information Science, Technology and Applications
Multi-view object representation with inverse difference pyramid decomposition

WAV'09 Proceedings of the 3rd WSEAS international symposium on Wavelets theory and applications in applied mathematics, signal processing & modern science
A descriptor for large scale image retrieval based on sketched feature lines

Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Unsupervised modeling of objects and their hierarchical contextual interactions

Journal on Image and Video Processing - Special issue on patches in vision
Content Based Image Retrieval Using Adaptive Inverse Pyramid Representation

Proceedings of the Symposium on Human Interface 2009 on Human Interface and the Management of Information. Information and Interaction. Part II: Held as part of HCI International 2009
Global Context Extraction for Object Recognition Using a Combination of Range and Visual Features

Dyn3D '09 Proceedings of the DAGM 2009 Workshop on Dynamic 3D Imaging
Scene classification using pLSA with visterm spatial location

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Event recognition from photo collections via PageRank

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Personal photo album summarization

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Dense sampling low-level statistics of local features

Proceedings of the ACM International Conference on Image and Video Retrieval
Evaluation of GIST descriptors for web-scale image search

Proceedings of the ACM International Conference on Image and Video Retrieval
Jointly optimising relevance and diversity in image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Subspace learning-based dimensionality reduction in building recognition

Neurocomputing
Tree detection from aerial imagery

Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Parsimonious reduction of Gaussian mixture models with a variational-Bayes approach

Pattern Recognition
A first glimpse of cryptography's Holy Grail

Communications of the ACM
Using the forest to see the trees: exploiting context for visual object detection and localization

Communications of the ACM
Natural Scene Retrieval Based on Graph Semantic Similarity for Adaptive Scene Classification

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Randomized Probabilistic Latent Semantic Analysis for Scene Recognition

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
A Simulated User Study of Image Browsing Using High-Level Classification

SAMT '09 Proceedings of the 4th International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
Scene Categorization by Introducing Contextual Information to the Visual Words

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
Grouping and Summarizing Scene Images from Web Collections

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II
Hierarchical System for Content Based Categorization and Orientation of Consumer Images

PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Connecting people in photo-sharing sites by photo content and user annotations

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
View-invariant object category learning, attention, recognition, search, and scene understanding

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Ranking canonical views for tourist attractions

Multimedia Tools and Applications
Improving Bag-of-Features for Large Scale Image Search

International Journal of Computer Vision
An Approach to the Parameterization of Structure for Fast Categorization

International Journal of Computer Vision
Visual place categorization: problem, dataset, and algorithm

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Parallel scene perception on various blurry images

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Biased discriminant euclidean embedding for content-based image retrieval

IEEE Transactions on Image Processing
Scene categorization via contextual visual words

Pattern Recognition
Progressive randomization: Seeing the unseen

Computer Vision and Image Understanding
Image annotation with tagprop on the MIRFLICKR set

Proceedings of the international conference on Multimedia information retrieval
Beyond pixels: Exploiting camera metadata for photo classification

Pattern Recognition
Natural scene classification using overcomplete ICA

Pattern Recognition
Comparing compact codebooks for visual categorization

Computer Vision and Image Understanding
Learning natural scene categories by selective multi-scale feature extraction

Image and Vision Computing
Novel Gaussianized vector representation for improved natural scene categorization

Pattern Recognition Letters
LWDOS: language for writing descriptors of outline shapes

SCIA'03 Proceedings of the 13th Scandinavian conference on Image analysis
Social group suggestion from user image collections

Proceedings of the 19th international conference on World wide web
Deriving a priori co-occurrence probability estimates for object recognition from social networks and text processing

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Model-based subspace clustering of non-Gaussian data

Neurocomputing
A framework for visual-context-aware object detection in still images

Computer Vision and Image Understanding
Speeding up top-down attention control learning by using full observation knowledge

CIRA'09 Proceedings of the 8th IEEE international conference on Computational intelligence in robotics and automation
Unsupervised multi-feature tag relevance learning for social image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Shadow edge detection using geometric and photometric features

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Automatic discovery of image families: global vs. local features

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Learning contextual rules for priming object categories in images

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Mean shift feature space warping for relevance feedback

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Seed image selection in interactive cosegmentation

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Biologically inspired feature manifold for scene classification

IEEE Transactions on Image Processing
Scalable similarity search with optimized kernel hashing

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Horizon estimation: perceptual and computational experiments

Proceedings of the 7th Symposium on Applied Perception in Graphics and Visualization
What your design looks like to peripheral vision

Proceedings of the 7th Symposium on Applied Perception in Graphics and Visualization
Interactively browsing large image collections

ACM SIGGRAPH 2010 Talks
Photo zoom: high resolution from unordered image collections

Proceedings of Graphics Interface 2010
Technical Section: An evaluation of descriptors for large-scale image retrieval from sketched feature lines

Computers and Graphics
Multi-view object representation with modified 2-layer IDP decomposition

WSEAS Transactions on Signal Processing
Mitosis sequence detection using hidden conditional random fields

ISBI'10 Proceedings of the 2010 IEEE international conference on Biomedical imaging: from nano to Macro
Multi-model classification method in heterogeneous image databases

Pattern Recognition
Cosaliency: where people look when comparing images

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Scene classification using multiple features in a two-stage probabilistic classification framework

Neurocomputing
Movie genre classification via scene categorization

Proceedings of the international conference on Multimedia
Boosting-based multiple kernel learning for image re-ranking

Proceedings of the international conference on Multimedia
Learning contextual metrics for automatic image annotation

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Scene categorization using boosted back-propagation neural networks

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Object, scene and actions: combining multiple features for human action recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Image-to-class distance metric learning for image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Extracting structures in image collections for object recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Efficient structure from motion by graph optimization

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Why did the person cross the road (there)? scene understanding using probabilistic logic models and common sense reasoning

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
A data-driven approach for event prediction

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Location recognition using prioritized feature matching

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
An eye fixation database for saliency detection in images

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Building Rome on a cloudless day

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Localizing objects while learning their appearance

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Efficiently scaling up video annotation with crowdsourced marketplaces

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Improving local descriptors by embedding global and local spatial information

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
What does classifying more than 10,000 image categories tell us?

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Non-local characterization of scenery images: statistics, 3D reasoning, and a generative model

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Supervised label transfer for semantic segmentation of street scenes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Constrained spectral clustering via exhaustive and efficient constraint propagation

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Learning pre-attentive driving behaviour from holistic visual features

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Predicting facial beauty without landmarks

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Bayesian fusion of camera metadata cues in semantic scene classification

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Collection-based sparse label propagation and its application on social group suggestion from photos

ACM Transactions on Intelligent Systems and Technology (TIST)
Characteristic pattern discovery in videos

Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Multiple kernel learning for image indexing

Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Weakly supervised landmark labeling in searched data

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Context modeling in computer vision: techniques, implications, and applications

Multimedia Tools and Applications
Geotagging in multimedia and computer vision--a survey

Multimedia Tools and Applications
VIRaL: Visual Image Retrieval and Localization

Multimedia Tools and Applications
Detection of visual concepts and annotation of images using ensembles of trees for hierarchical multi-label classification

ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
Bayesian hybrid generative discriminative learning based on finite Liouville mixture models

Pattern Recognition
A unified context assessing model for object categorization

Computer Vision and Image Understanding
Sewing photos: smooth transition between photos

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Bottom-up saliency detection model based on amplitude spectrum

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Exploiting Textons distributions on spatial hierarchy for scene classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Video summarization via transferrable structured learning

Proceedings of the 20th international conference on World wide web
Man-made structure detection in natural images using a causal multiscale random field

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Online learning for PLSA-based visual recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Indoor scene classification using combined 3D and gist features

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Multi-class leveraged k-NN for image classification

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Image classification using spatial pyramid coding and visual word reweighting

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Improved spatial pyramid matching for image classification

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
Region Contextual Visual Words for scene categorization

Expert Systems with Applications: An International Journal
Integrated image representation based natural scene classification

Expert Systems with Applications: An International Journal
Summarization of personal photologs using multidimensional content and context

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Lost in binarization: query-adaptive ranking for similar image search with compact codes

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Saliency moments for image categorization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
A polynomial characterization of hypergraphs using the Ihara zeta function

Pattern Recognition
Content-based image retrieval with relevance feedback using random walks

Pattern Recognition
Gabor descriptors for aerial image classification

ICANNGA'11 Proceedings of the 10th international conference on Adaptive and natural computing algorithms - Volume Part II
Improved learning of I2C distance and accelerating the neighborhood search for image classification

Pattern Recognition
PLBP: An effective local binary patterns texture descriptor with pyramid representation

Pattern Recognition
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Real-time detection of landscape scenes

SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
Malware images: visualization and automatic classification

Proceedings of the 8th International Symposium on Visualization for Cyber Security
Nonlinear discriminative embedding for clustering via spectral regularization

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
An iterated graph laplacian approach for ranking on manifolds

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Feature selection for unlabeled data

ICSI'11 Proceedings of the Second international conference on Advances in swarm intelligence - Volume Part II
perception-based design for tele-presence

PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
Hierarchical spatial matching kernel for image categorization

ICIAR'11 Proceedings of the 8th international conference on Image analysis and recognition - Volume Part I
Building global image features for scene recognition

Pattern Recognition
Iconizer: a framework to identify and create effective representations for visual information encoding

SG'11 Proceedings of the 11th international conference on Smart graphics
Supervised visual vocabulary with category information

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
A fuzzy set approach for shape-based image annotation

WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

International Journal of Computer Vision
Multiple region categorization for scenery images

ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Sum-of-superellipses: a low parameter model for amplitude spectra of natural images

ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Evaluation of global descriptors for large scale image retrieval

ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Exploiting depth information for indoor-outdoor scene classification

ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing - Volume Part II
Towards a universal and limited visual vocabulary

ISVC'11 Proceedings of the 7th international conference on Advances in visual computing - Volume Part II
Evaluating feature combination in object classification

ISVC'11 Proceedings of the 7th international conference on Advances in visual computing - Volume Part II
Judging a site by its content: learning the textual, structural, and visual features of malicious web pages

Proceedings of the 4th ACM workshop on Security and artificial intelligence
A comparative assessment of malware classification using binary texture analysis and dynamic analysis

Proceedings of the 4th ACM workshop on Security and artificial intelligence
A hierarchical latent topic model based on sparse coding

Neurocomputing
Efficient approximate nearest neighbor search with integrated binary codes

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Hybrid image summarization

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploring self-similarities of bag-of-features for image classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Marginal-based visual alphabets for local image descriptors aggregation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Bag-of-colors for improved image search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Asymmetric hamming embedding: taking the best of our bits for large scale image search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Spatial pooling for transformation invariant image representation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Automatic sentence generation from images

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Active query sensing for mobile location search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Personalizing automated image annotation using cross-entropy

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Bilinear deep learning for image classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
IM2MAP: deriving maps from georeferenced community contributed photo collections

WSM '11 Proceedings of the 3rd ACM SIGMM international workshop on Social media
Around the world in 80 seconds

SIGGRAPH Asia 2011 Posters
New color image histogram-based detectors

IVIC'11 Proceedings of the Second international conference on Visual informatics: sustaining research and innovations - Volume Part I
Using eye-tracking to assess different image retargeting methods

Proceedings of the ACM SIGGRAPH Symposium on Applied Perception in Graphics and Visualization
Tms to the lateral occipital cortex disrupts object processing but facilitates scene processing

Journal of Cognitive Neuroscience
Fusion of region and image-based techniques for automatic image annotation

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Automated image annotation using global features and robust nonparametric density estimation

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Feature fusion within local region using localized maximum-margin learning for scene categorization

Pattern Recognition
Sorting unorganized photo sets for urban reconstruction

Graphical Models
The rapid extraction of gist-early neural correlates of high-level visual processing

Journal of Cognitive Neuroscience
The Visual Extent of an Object

International Journal of Computer Vision
Corpus-guided sentence generation of natural images

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Graph transduction as a noncooperative game

Neural Computation
Incorporating spatial correlogram into bag-of-features model for scene categorization

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Image label completion by pursuing contextual decomposability

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Estimating size fraction categories of coal particles on conveyor belts using image texture modeling methods

Expert Systems with Applications: An International Journal
Scene location guide by image-based retrieval

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Which tags are related to visual content?

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
A novel retrieval framework using classification, feature selection and indexing structure

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Scene gist: a holistic generative model of natural image

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part II
Image classification based on weighted topics

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Histogram of Oriented Uniform Patterns for robust place recognition and categorization

International Journal of Robotics Research
Semantic parsing of street scenes from video

International Journal of Robotics Research
Spatial color image segmentation based on finite non-Gaussian mixture models

Expert Systems with Applications: An International Journal
Sketch-based shape retrieval

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Automatic context analysis for image classification and retrieval

ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing
A unified approach to learning task-specific bit vector representations for fast nearest neighbor search

Proceedings of the 21st international conference on World Wide Web
Visual vocabulary optimization with spatial context for image annotation and classification

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
A model for the qualitative description of images based on visual and spatial features

Computer Vision and Image Understanding
A local spectral distribution approach to face recognition

Computer Vision and Image Understanding
Modulating Shape Features by Color Attention for Object Recognition

International Journal of Computer Vision
Real-time estimation of 3D scene geometry from a single image

Pattern Recognition
Scene categorization based on integrated feature description and local weighted feature mapping

Computers and Electrical Engineering
Halfway through the semantic gap: Prosemantic features for image retrieval

Information Sciences: an International Journal
Multi-scale gist feature manifold for building recognition

Neurocomputing
Nearest-Neighbor based Metric Functions for indoor scene recognition

Computer Vision and Image Understanding
Global localization with non-quantized local image features

Robotics and Autonomous Systems
3D Material Style Transfer

Computer Graphics Forum
SUPER: towards real-time event recognition in internet videos

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Color CENTRIST: a color descriptor for scene categorization

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multimodal feature generation framework for semantic image classification

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Learning by expansion: Exploiting social media for image classification with few training examples

Neurocomputing
Fast shared boosting for large-scale concept detection

Multimedia Tools and Applications
Fast semi-supervised clustering with enhanced spectral embedding

Pattern Recognition
Energy Conservation for Image Retrieval on Mobile Systems

ACM Transactions on Embedded Computing Systems (TECS)
Manhattan hashing for large-scale image retrieval

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Active query sensing: Suggesting the best query view for mobile visual search

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section of best papers of ACM multimedia 2011, and special section on 3D mobile multimedia
Biologically motivated local contextual modulation improves low-level visual feature representations

ICIAR'12 Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part I
Compact and adaptive spatial pyramids for scene recognition

Image and Vision Computing
Scene classification using a multi-resolution bag-of-features model

Pattern Recognition
Automatically characterizing places with opportunistic crowdsensing using smartphones

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Kinect image classification using LLC

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Point-context descriptor based region search for logo recognition

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Boosting k-NN for Categorization of Natural Scenes

International Journal of Computer Vision
Weakly Supervised Localization and Learning with Generic Knowledge

International Journal of Computer Vision
Feature space optimization for content-based image retrieval

ACM SIGAPP Applied Computing Review
Hierarchical Classifiers for Robust Topological Robot Localization

Journal of Intelligent and Robotic Systems
Face identification using reference-based features with message passing model

Neurocomputing
Distributional semantics in technicolor

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Sense beauty via face, dressing, and/or voice

Proceedings of the 20th ACM international conference on Multimedia
Submodular video hashing: a unified framework towards video pooling and indexing

Proceedings of the 20th ACM international conference on Multimedia
Efficient image annotation for automatic sentence generation

Proceedings of the 20th ACM international conference on Multimedia
DLMSearch: diversified landmark search by photo

Proceedings of the 20th ACM international conference on Multimedia
Sketch-based image retrieval on a large scale database

Proceedings of the 20th ACM international conference on Multimedia
Hierarchical Narrative Collage For Digital Photo Album

Computer Graphics Forum
Biologically inspired task oriented gist model for scene classification

Computer Vision and Image Understanding
Image collection summarization via dictionary learning for sparse representation

Pattern Recognition
Bag of spatio-visual words for context inference in scene classification

Pattern Recognition
A dictionary learning approach for classification: separating the particularity and the commonality

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Leafsnap: a computer vision system for automatic plant species identification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Query specific fusion for image retrieval

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Scene aligned pooling for complex video recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Randomized spatial partition for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Negative evidences and co-occurences in image retrieval: the benefit of PCA and whitening

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Nested pictorial structures

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Dynamic eye movement datasets and learnt saliency models for visual action recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Annotation propagation in large image databases via dense image correspondence

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Attributes for classifier feedback

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Ensemble partitioning for unsupervised image categorization

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Supervised geodesic propagation for semantic label transfer

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Learning hybrid part filters for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Latent pyramidal regions for recognizing scenes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
A new biologically inspired color image descriptor

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Local expert forest of score fusion for video event classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
PATCHMATCHGRAPH: building a graph of dense patch correspondences for label transfer

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Sequential spectral learning to hash with multiple representations

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Depth extraction from video using non-parametric sampling

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Contextual object detection using set-based classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Dating historical color images

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Spring lattice counting grids: scene recognition using deformable positional constraints

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Efficient mining of repetitions in large-scale TV streams with product quantization hashing

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
An efficient parallel strategy for matching visual self-similarities in large image databases

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Identification of illustrators

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Scene recognition on the semantic manifold

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Recognizing complex events using large margin joint low-level event model

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Beyond spatial pyramids: a new feature extraction framework with dense spatial sampling for image classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Fast tiered labeling with topological priors

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Unsupervised learning of discriminative relative visual attributes

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Explicit performance metric optimization for fusion-based video retrieval

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Unsupervised classemes

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Enhancing semantic features with compositional analysis for scene recognition

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Instant scene recognition on mobile platform

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
ISABoost: A weak classifier inner structure adjusting based AdaBoost algorithm-ISABoost based application in scene categorization

Neurocomputing
SIFT match verification by geometric coding for large-scale partial-duplicate web image search

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Neti Neti: in search of deity

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Novel color Gabor-LBP-PHOG (GLP) descriptors for object and scene image classification

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Feature set reduction for image matching in large scale environments

Proceedings of the 27th Conference on Image and Vision Computing New Zealand
Scene classification based on category-specific representations created through prototype feature selection

Proceedings of the 27th Conference on Image and Vision Computing New Zealand
Gabor-Based novel local, shape and color features for image classification

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III
A jensen-shannon kernel for hypergraphs

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Novel Gabor-PHOG features for object and scene image classification

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Location matters, especially for non-salient features-An eye-tracking study on the effects of web object placement on different types of websites

International Journal of Human-Computer Studies
Translating related words to videos and back through latent topics

Proceedings of the sixth ACM international conference on Web search and data mining
Fast organization of large photo collections using CUDA

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Objects as attributes for scene classification

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I
A generic model to compose vision modules for holistic scene understanding

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I
Self organizing natural scene image retrieval

Expert Systems with Applications: An International Journal
Efficiently Scaling up Crowdsourced Video Annotation

International Journal of Computer Vision
Hashing with cauchy graph

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Multimedia event detection using segment-based approach for motion feature

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Improving image distance metric learning by embedding semantic relations

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Location and route tracking in university from photos without GPS information

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Instance-Level landmark labeling via multi-layer superpixels

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
An interactive semi-supervised approach for automatic image annotation

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Learning image-to-class distance metric for image classification

ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on agent communication, trust in multiagent systems, intelligent tutoring and coaching systems
Learning saliency-based visual attention: A review

Signal Processing
Boosted key-frame selection and correlated pyramidal motion-feature representation for human action recognition

Pattern Recognition
Learning dictionary on manifolds for image classification

Pattern Recognition
Image region description using orthogonal combination of local binary patterns enhanced with color information

Pattern Recognition
Heterogeneous bag-of-features for object/scene recognition

Applied Soft Computing
How do image complexity, task demands and looking biases influence human gaze behavior?

Pattern Recognition Letters
Autonomous place naming system using opportunistic crowdsensing and knowledge from crowdsourcing

Proceedings of the 12th international conference on Information processing in sensor networks
Direct modeling of image keypoints distribution through copula-based image signatures

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Clothing-to-words mapping using word separation method

Computers and Electrical Engineering
Dual local consistency hashing with discriminative projections selection

Signal Processing
Residual enhanced visual vector as a compact signature for mobile visual search

Signal Processing
Webzeitgeist: design mining the web

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
LSH-based large scale chinese calligraphic character recognition

Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Transcranial magnetic stimulation to the transverse occipital sulcus affects scene but not object processing

Journal of Cognitive Neuroscience
Beyond dataset bias: multi-task unaligned shared knowledge transfer

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Cross-Database transfer learning via learnable and discriminant error-correcting output codes

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Tell me what you like and i'll tell you what you are: discriminating visual preferences on flickr data

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Local hypersphere coding based on edges between visual words

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Semi-Supervised learning on a budget: scaling up to large datasets

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Relative forest for attribute prediction

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Quadra-Embedding: binary code embedding with low quantization error

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
A picture is worth a thousand tags: automatic web based image tag expansion

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Hierarchical space tiling for scene modeling

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Classifying images at scene level: comparing global and local descriptors

AMR'11 Proceedings of the 9th international conference on Adaptive Multimedia Retrieval: large-scale multimedia retrieval and evaluation
Neighbourhood preserving quantisation for LSH

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Image search—from thousands to billions in 20 years

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
WaveLBP based hierarchical features for image classification

Pattern Recognition Letters
Bubble space and place representation in topological maps

International Journal of Robotics Research
A semi-supervised feature selection method using a non-parametric technique with pairwise instance constraints

Journal of Information Science
Towards social imagematics: sentiment analysis in social multimedia

Proceedings of the Thirteenth International Workshop on Multimedia Data Mining
Towards decrypting attractiveness via multi-modality cues

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Attribit: content creation with semantic attributes

Proceedings of the 26th annual ACM symposium on User interface software and technology
Sentribute: image sentiment analysis from a mid-level perspective

Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
Topology preserving hashing for similarity search

Proceedings of the 21st ACM international conference on Multimedia
Building holistic descriptors for scene recognition: a multi-objective genetic programming approach

Proceedings of the 21st ACM international conference on Multimedia
Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation

Proceedings of the 21st ACM international conference on Multimedia
Static saliency vs. dynamic saliency: a comparative study

Proceedings of the 21st ACM international conference on Multimedia
Segmental multi-way local pooling for video recognition

Proceedings of the 21st ACM international conference on Multimedia
Superpixel segmentation based structural scene recognition

Proceedings of the 21st ACM international conference on Multimedia
Relative spatial features for image memorability

Proceedings of the 21st ACM international conference on Multimedia
AdVisual: a visual-based advertising system

Proceedings of the 21st ACM international conference on Multimedia
Unveiling the multimedia unconscious: implicit cognitive processes and multimedia content analysis

Proceedings of the 21st ACM international conference on Multimedia
Large-scale visual sentiment ontology and detectors using adjective noun pairs

Proceedings of the 21st ACM international conference on Multimedia
Accurate and efficient cross-domain visual matching leveraging multiple feature representations

The Visual Computer: International Journal of Computer Graphics
Towards metric fusion on multi-view data: a cross-view based graph random walk approach

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A novel unsupervised approach for multilevel image clustering from unordered image collection

Frontiers of Computer Science: Selected Publications from Chinese Universities
Indoor scene recognition by a mobile robot through adaptive object detection

Robotics and Autonomous Systems
Predicting retweet count using visual cues

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Human-inspired features for natural scene classification

Pattern Recognition Letters
PatchNet: a patch-based image representation for interactive library-driven image editing

ACM Transactions on Graphics (TOG)
Projective analysis for 3D shape segmentation

ACM Transactions on Graphics (TOG)
Exploiting socially-generated side information in dimensionality reduction

Proceedings of the 2nd international workshop on Socially-aware multimedia
Malware analysis method using visualization of binary files

Proceedings of the 2013 Research in Adaptive and Convergent Systems
Learning from contextual information of geo-tagged web photos to rank personalized tourism attractions

Neurocomputing
Hypergraph Spectral Hashing for image retrieval with heterogeneous social contexts

Neurocomputing
Scene image retrieval via re-ranking semantic and packed dense interestpoints

Neurocomputing
A new method of image classification based on local appearance and context information

Neurocomputing
Object class detection: A survey

ACM Computing Surveys (CSUR)
SigMal: a static signal processing based malware triage

Proceedings of the 29th Annual Computer Security Applications Conference
A novel ensemble algorithm for tumor classification

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
An efficient access method for multimodal video retrieval

Proceedings of the 19th Brazilian symposium on Multimedia and the web
Line image signature for scene understanding with a wearable vision system

Proceedings of the 4th International SenseCam & Pervasive Imaging Conference
Using objective ground-truth labels created by multiple annotators for improved video classification: A comparative study

Computer Vision and Image Understanding
An experimental study on the universality of visual vocabularies

Journal of Visual Communication and Image Representation
Saliency-Based region log covariance feature for image copy detection

IWDW'12 Proceedings of the 11th international conference on Digital Forensics and Watermaking
Content-based diversifying leaf image retrieval

ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
A mass spectra-based compound-identification approach with a reduced reference library

ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
Mesh saliency via spectral processing

ACM Transactions on Graphics (TOG)
Scene classification using multi-resolution low-level feature combination

Neurocomputing
Large-scale image retrieval based on boosting iterative quantization hashing with query-adaptive reranking

Neurocomputing
Recognizing architecture styles by hierarchical sparse coding of blocklets

Information Sciences: an International Journal
Active learning with multi-label SVM classification

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning descriptive visual representation by semantic regularized matrix factorization

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multi-view K-means clustering on big data

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Mixed image-keyword query adaptive hashing over multilabel images

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A spatio-temporal Long-term Memory approach for visual place recognition in mobile robotic navigation

Robotics and Autonomous Systems
A classification-oriented dictionary learning model: Explicitly learning the particularity and commonality across categories

Pattern Recognition
Multiple feature kernel hashing for large-scale visual search

Pattern Recognition
Classifying web videos using a global video descriptor

Machine Vision and Applications
Object segmentation and classification using 3-D range camera

Journal of Visual Communication and Image Representation
Contextual object category recognition for RGB-D scene labeling

Robotics and Autonomous Systems
Automated image analysis framework for high-throughput determination of grapevine berry sizes using conditional random fields

Computers and Electronics in Agriculture
Low-cost prioritization of image blocks in wireless sensor networks for border surveillance

Journal of Network and Computer Applications
What makes an image popular?

Proceedings of the 23rd international conference on World wide web
Globality and locality incorporation in distance metric learning

Neurocomputing
Multimedia event detection with multimodal feature fusion and temporal concept localization

Machine Vision and Applications
Multimedia Event Detection Using Segment-Based Approach for Motion Feature

Journal of Signal Processing Systems
New color GPHOG descriptors for object and scene image classification

Machine Vision and Applications
Branch&Rank for Efficient Object Detection

International Journal of Computer Vision
Film segmentation and indexing using autoassociative neural networks

International Journal of Speech Technology
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition

International Journal of Computer Vision
Multi-objective optimization based color constancy

Applied Soft Computing
Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation tasks

Image and Vision Computing
A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics

International Journal of Computer Vision
Enhancing K-Means using class labels

Intelligent Data Analysis
Efficient binary code indexing with pivot based locality sensitive clustering

Multimedia Tools and Applications
HWVP: hierarchical wavelet packet descriptors and their applications in scene categorization and semantic concept retrieval

Multimedia Tools and Applications

Quantified Score

Hi-index	0.04

Visualization

Abstract

In this paper, we propose a computational model of the recognition of real world scenes that bypasses the segmentation and the processing of individual objects or regions. The procedure is based on a very low dimensional representation of the scene, that we term the Spatial Envelope. We propose a set of perceptual dimensions (naturalness, openness, roughness, expansion, ruggedness) that represent the dominant spatial structure of a scene. Then, we show that these dimensions may be reliably estimated using spectral and coarsely localized information. The model generates a multidimensional space in which scenes sharing membership in semantic categories (e.g., streets, highways, coasts) are projected closed together. The performance of the spatial envelope model shows that specific information about object shape or identity is not a requirement for scene categorization and that modeling a holistic representation of the scene informs about its probable semantic category.