Combining image descriptors to effectively retrieve events from visual lifelogs

Authors:
Aiden R. Doherty;Ciarán Ó Conaire;Michael Blighe;Alan F. Smeaton;Noel E. O'Connor
Affiliations:
Dublin City University, Dublin, Ireland;Dublin City University, Dublin, Ireland;Dublin City University, Dublin, Ireland;Dublin City University, Dublin, Ireland;Dublin City University, Dublin, Ireland
Venue:
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Year:
2008

Citing 15
Cited 12

Relevance score normalization for metasearch

Proceedings of the tenth international conference on Information and knowledge management
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Context-based video retrieval system for the life-log applications

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Practical experience recording and indexing of Life Log video

CARPE '05 Proceedings of the 2nd ACM workshop on Continuous archival and retrieval of personal experiences
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
VFerret: content-based similarity search tool for continuous archived video

Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences
Generating summaries and visualization for large collections of geo-referenced photographs

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
TREC: Continuing information retrieval's tradition of experimentation

Communications of the ACM
Investigating keyframe selection methods in the novel domain of passively captured visual lifelogs

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Keyframe detection in visual lifelogs

Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments
Automatically Segmenting LifeLog Data into Events

WIAMIS '08 Proceedings of the 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services
Validating the Detection of Everyday Concepts in Visual Lifelogs

SAMT '08 Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
SURF: speeded up robust features

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
SenseCam: a retrospective memory aid

UbiComp'06 Proceedings of the 8th international conference on Ubiquitous Computing

SenseCam Image Localisation Using Hierarchical SURF Trees

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Fixed in time and "time in motion": mobility of vision through a SenseCam lens

Proceedings of the 11th International Conference on Human-Computer Interaction with Mobile Devices and Services
An investigation into event decay from large personal media archives

EiMM '09 Proceedings of the 1st ACM international workshop on Events in multimedia
Utilising contextual memory retrieval cues and the ubiquity of the cell phone to review lifelogged physiological activities

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
The colour of life: novel visualisations of population lifestyles

Proceedings of the international conference on Multimedia
Creating digital life stories through activity recognition with image filtering

ICOST'10 Proceedings of the Aging friendly technology for health and independence, and 8th international conference on Smart homes and health telematics
Considerations for a touchscreen visual lifelog

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Passively recognising human activities through lifelogging

Computers in Human Behavior
Place recognition via 3d modeling for personal activity lifelog using wearable camera

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Indexing media by personal events

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Dynamic two-stage image retrieval from large multimedia databases

Information Processing and Management: an International Journal
Personal driving diary: Automated recognition of driving events from first-person videos

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

The SenseCam is a wearable camera that passively captures approximately 3,000 images per day, which equates to almost one million images per year. It is used to create a personal visual recording of the wearer's life and generates information which can be helpful as a human memory aid. For such a large amount of visual information to be of any use, it is accepted that it should be structured into "events", of which there are about 8,000 in a wearer's average year. In automatically segmenting SenseCam images into events, it will then be useful for users to locate other events similar to a given event e.g. "what other times was I walking in the park?", "show me other events when I was in a restaurant". On two datasets of 240k and 1.8M images containing topics with a variety of information needs, we evaluate the fusion of MPEG-7, SIFT, and SURF content-based retrieval techniques to address the event search issue. We have found that our proposed fusion approach of MPEG-7 and SURF offers an improvement on using either of those sources or SIFT individually, and we have also shown how a lifelog event is modeled has a large effect on the retrieval performance.