Faceted search and browsing of audio content on spoken web

Authors:
Mamadou Diao;Sougata Mukherjea;Nitendra Rajput;Kundan Srivastava
Affiliations:
Georgia Institute of Technology, Atlanta, GA, USA;IBM Research, New Delhi, India;IBM Research, New Delhi, India;IBM Research, New Delhi, India
Venue:
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Year:
2010

Citing 23
Cited 7

Usability inspection methods

CHI '95 Conference Companion on Human Factors in Computing Systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Handbook of Usability Testing: How to Plan, Design, and Conduct Effective Tests

Handbook of Usability Testing: How to Plan, Design, and Conduct Effective Tests
Language-independent and language-adaptive acoustic modeling for speech recognition

Speech Communication
Faceted metadata for image search and browsing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Metadata creation system for mobile images

Proceedings of the 2nd international conference on Mobile systems, applications, and services
Swoogle: a search and metadata engine for the semantic web

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Automatic analysis of call-center conversations

Proceedings of the 14th ACM international conference on Information and knowledge management
FaThumb: a facet-based interface for mobile search

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Searching in audio: the utility of transcripts, dichotic presentation, and time-compression

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Position specific posterior lattices for indexing speech

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Future internet research: The EU framework

ACM SIGCOMM Computer Communication Review
HSTP: hyperspeech transfer protocol

Proceedings of the eighteenth conference on Hypertext and hypermedia
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
WWTW: the world wide telecom web

Proceedings of the 2007 workshop on Networked systems for developing regions
Organizing the unorganized - employing IT to empower the under-privileged

Proceedings of the 17th international conference on World Wide Web
A lattice-based approach to query-by-example spoken document retrieval

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
An audio indexing system for election video material

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Crowd translator: on building localized speech recognizers through micropayments

ACM SIGOPS Operating Systems Review
Avaaj Otalo: a field study of an interactive voice forum for small farmers in rural India

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Content creation and dissemination by-and-for users in rural areas

ICTD'09 Proceedings of the 3rd international conference on Information and communication technologies and development
Lucene in Action, Second Edition: Covers Apache Lucene 3.0

Lucene in Action, Second Edition: Covers Apache Lucene 3.0
Organizational, social and operational implications in delivering ICT solutions: a telecom web case-study

Proceedings of the 4th ACM/IEEE International Conference on Information and Communication Technologies and Development

Spoken Web: creation, navigation and searching of VoiceSites

Proceedings of the 16th international conference on Intelligent user interfaces
Two-stream indexing for spoken web search

Proceedings of the 20th international conference companion on World wide web
Finding dimensions for queries

Proceedings of the 20th ACM international conference on Information and knowledge management
Designing a voice-based employment exchange for rural India

Proceedings of the Fifth International Conference on Information and Communication Technologies and Development
Influence of training and stage of search on gaze behavior in a library catalog faceted search interface

Journal of the American Society for Information Science and Technology
Query by babbling: a research agenda

Proceedings of the first workshop on Information and knowledge management for developing region
Generating facets for phone-based navigation of structured data

Proceedings of the 21st ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spoken Web is a web of VoiceSites that can be accessed by a phone. The content in a VoiceSite is audio. Therefore Spoken Web provides an alternate to the World Wide Web (WWW) in developing regions where low Internet penetration and low literacy are barriers to accessing the conventional WWW. Searching of audio content in Spoken Web through an audio query-result interface presents two key challenges: indexing of audio content is not accurate, and the presentation of results in audio is sequential, and therefore cumbersome. In this paper, we apply the concepts of faceted search and browsing to the SpokenWeb search problem. We use the concepts of facets to index the meta-data associated with the audio content. We provide a mechanism to rank the facets based on the search results. We develop an interactive query interface that enables easy browsing of search results through the top ranked facets. To our knowledge, this is the first system to use the concepts of facets in audio search, and the first solution that provides an audio search for the rural population. We present quantitative results to illustrate the accuracy and effectiveness of the faceted search and qualitative results to highlight the usability of the interactive browsing system. The experiments have been conducted on more than 4000 audio documents collected from a live SpokenWeb VoiceSite and evaluations were carried out with 40 farmers who are the target users of the VoiceSite.