Overview of the CLEF-2006 cross-language speech retrieval track

Authors:
Douglas W. Oard;Jianqiang Wang;Gareth J. F. Jones;Ryen W. White;Pavel Pecina;Dagobert Soergel;Xiaoli Huang;Izhak Shafran
Affiliations:
College of Information Studies and Institute for Advanced Computer Studies, University of Maryland, College Park, MD;Department of Library and Information Studies, State University of New York at Buffalo, Buffalo, NY;School of Computing, Dublin City University, Dublin 9, Ireland;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;MFF UK, Praha 1, Czech Republic;MFF UK, Praha 1, Czech Republic;College of Information Studies, University of Maryland, College Park, MD
Venue:
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Year:
2006

Citing 7
Cited 16

Using graded relevance assessments in IR evaluation

Journal of the American Society for Information Science and Technology
Cross-language text classification

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Corrective models for speech recognition of inflected languages

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Overview of the CLEF-2005 cross-language speech retrieval track

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
UNED@CL-SR CLEF 2005: mixing different strategies to retrieve automatic speech transcriptions

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories

Improving text classification for oral history archives with temporal domain knowledge

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
First experiments searching spontaneous Czech speech

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Natural language processing for information retrieval: the time is ripe (again)

Proceedings of the ACM first Ph.D. workshop in CIKM
Multilingual information access: the contribution of evaluation

Proceedings of the 2006 international workshop on Research issues in digital libraries
Spoken Document Retrieval Based on Approximated Sequence Alignment

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Experiments with Automatic Query Formulation in the Extended Boolean Model

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Studying discourse and dialogue with SIDGrid

TeachCL '08 Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics
Exploring fusion in a spontaneous speech retrieval task

SSCS '09 Proceedings of the third workshop on Searching spontaneous conversational speech
Matching meaning for cross-language information retrieval

Information Processing and Management: an International Journal
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval
Benefit of proper language processing for Czech speech retrieval in the CL-SR task at CLEF 2006

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Experiments for the cross language speech retrieval task at CLEF 2006

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Penalty functions for evaluation measures of unsegmented speech retrieval

CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Multimedia information seeking through search and hyperlinking

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
An illustrated methodology for evaluating ASR systems

AMR'11 Proceedings of the 9th international conference on Adaptive Multimedia Retrieval: large-scale multimedia retrieval and evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

The CLEF-2006 Cross-Language Speech Retrieval (CL-SR) track included two tasks: to identify topically coherent segments of English interviews in a known-boundary condition, and to identify time stamps marking the beginning of topically relevant passages in Czech interviews in an unknown-boundary condition. Five teams participated in the English evaluation, performing both monolingual and cross-language searches of speech recognition transcripts, automatically generated metadata, and manually generated metadata. Results indicate that the 2006 English evaluation topics are more challenging than those used in 2005, but that cross-language searching continued to pose no unusual challenges when compared with monolingual searches of the same collection. Three teams participated in the monolingual Czech evaluation using a new evaluation measure based on differences between system-suggested and ground truth replay start times, with results that were broadly comparable to those observed for English.