Mining GPS traces and visual words for event classification

Authors:
Junsong Yuan;Jiebo Luo;Henry Kautz;Ying Wu
Affiliations:
Northwestern University, Evanston, IL, USA;Kodak Labs, Rochester, NY, USA;University of Rochester, Rochester, NY, USA;Northwestern University, Evanston, IL, USA
Venue:
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Year:
2008

Citing 9
Cited 8

Machine Learning

Machine Learning
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Automatic Image Orientation Detection via Confidence-Based Integration of Low-Level and Semantic Cues

IEEE Transactions on Pattern Analysis and Machine Intelligence
Fast Algorithms for Frequent Itemset Mining Using FP-Trees

IEEE Transactions on Knowledge and Data Engineering
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Frequent pattern mining: current status and future directions

Data Mining and Knowledge Discovery
From frequent itemsets to semantically meaningful visual patterns

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
How flickr helps us make sense of the world: context and content in community-contributed media collections

Proceedings of the 15th international conference on Multimedia
Direct Discriminative Pattern Mining for Effective Classification

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering

Speeding up spatio-temporal sliding-window search for efficient event detection in crowded videos

EiMM '09 Proceedings of the 1st ACM international workshop on Events in multimedia
A visual analysis of the relationship between word concepts and geographical locations

Proceedings of the ACM International Conference on Image and Video Retrieval
Generic Object Recognition in Urban Image Databases

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Generic Object Recognition in Urban Image Databases

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Visual content layer for scalable object recognition in urban image databases

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
SNDocRank: a social network-based video search ranking framework

Proceedings of the international conference on Multimedia information retrieval
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Geotagging in multimedia and computer vision--a survey

Multimedia Tools and Applications

Quantified Score

Hi-index	0.01

Visualization

Abstract

It is of great interest to recognize semantic events (e.g., hiking, skiing, party), in particular when given a collection of personal photos, where each photo is tagged with a timestamp and GPS (Global Positioning System) information at the capture. We address this emerging multiclass classification problem by mining informative features derived from traces of GPS coordinates and a bag of visual words, both based on the entire collection as opposed to individual photos. Considering that semantic events are best characterized by a compositional description of the visual content in terms of the co-occurrence of objects and scenes, we focus on mining compositional features (equivalent to word combinations in the "bag-of-words" method) that have better discriminative and descriptive abilities than individual features. In order to handle the combinatorial complexity in discovering such compositional features, we apply a data mining method based on frequent itemset mining (FIM). Complementary features are also derived from GPS traces and mined to characterize the underlying movement patterns of various event types. Upon compositional feature mining, we perform multiclass AdaBoost to solve the multiclass problem. Based on a dataset of eight event classes and a total of more than 3000 geotagged images from 88 events, experimental results using leave-one-out cross validation have shown the synergy of all of the components in our proposed approach to event classification.