Prosody-based automatic segmentation of speech into sentences and topics
Speech Communication - Special issue on accessing information in spoken audio
Adaptiver stochastischer Sprache/Pause-Detektor
Mustererkennung 1995, 17. DAGM-Symposium
Dialogue act modeling for automatic tagging and recognition of conversational speech
Computational Linguistics
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Minimal-impact audio-based personal archives
Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Multimodal group action clustering in meetings
Proceedings of the ACM 2nd international workshop on Video surveillance & sensor networks
Next-Generation Personal Memory Aids
BT Technology Journal
Automatic Analysis of Multimodal Group Actions in Meetings
IEEE Transactions on Pattern Analysis and Machine Intelligence
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Detection of agreement vs. disagreement in meetings: training with unlabeled data
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Extracting information from multimedia meeting collections
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Directions for multi-party human-computer interaction research
HLT-NAACL-DIALOGUE '03 Proceedings of the HLT-NAACL 2003 workshop on Research directions in dialogue processing - Volume 7
Time is of the essence: an evaluation of temporal compression algorithms
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Speech-to-text transcription in support of pervasive computing
CRPIT '02 Proceedings of the 2002 conference on Pervasive computing - Volume 25
Making sense of engineering design review activities
Artificial Intelligence for Engineering Design, Analysis and Manufacturing
A writer identification system for on-line whiteboard data
Pattern Recognition
Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction
MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Automatic nonverbal analysis of social interaction in small groups: A review
Image and Vision Computing
Microphone array beamforming approach to blind speech separation
MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
A game-theoretic model of metaphorical bargaining
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Towards computer understanding of human interactions
MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
Overlap in meetings: ASR effects and analysis by dialog factors, speakers, and collection site
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
Writer identification for smart meeting room systems
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Exploring the structure of media stream interactions for multimedia browsing
AMR'05 Proceedings of the Third international conference on Adaptive Multimedia Retrieval: user, context, and feedback
Hi-index | 0.00 |
In collaboration with colleagues at UW, OGI, IBM, and SRI, we are developing technology to process spoken language from informal meetings. The work includes a substantial data collection and transcription effort, and has required a nontrivial degree of infrastructure development. We are undertaking this because the new task area provides a significant challenge to current HLT capabilities, while offering the promise of a wide range of potential applications. In this paper, we give our vision of the task, the challenges it represents, and the current state of our development, with particular attention to automatic transcription.