Designing LoL@, a Mobile Tourist Guide for UMTS
Mobile HCI '02 Proceedings of the 4th International Symposium on Mobile Human-Computer Interaction
A Visual Vocabulary for Flower Classification
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Searching the web with mobile images for location recognition
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Photo-based question answering
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Proxima: a mobile augmented-image search system
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Versatile question answering systems: seeing in synthesis
International Journal of Intelligent Information and Database Systems
Finding experts in tag based knowledge sharing communities
KSEM'11 Proceedings of the 5th international conference on Knowledge Science, Engineering and Management
Context-Aware Expert Finding in Tag Based Knowledge Sharing Communities
International Journal of Knowledge and Systems Science
Hi-index | 0.00 |
This paper introduces multimodal question answering, a new interface for community-based question answering services. By offering users an extra modality---photos---in addition to the text modality to formulate queries, multimodal question answering overcomes the limitations of text-only input methods when the users ask questions regarding visually distinctive objects. Such interface is especially useful when users become curious about an interesting object in the environment and want to know about it---simply by taking a photo and asking a question in a situated (from a mobile device) and intuitive (without describing the object in words) manner. We propose a system architecture for multimodal question answering, describe an algorithm for searching the database, and report on the findings of two prototype studies.