Dialogue act modeling for automatic tagging and recognition of conversational speech
Computational Linguistics
A maximum entropy approach to identifying sentence boundaries
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Modeling lexical tones for mandarin large vocabulary continuous speech recognition
Modeling lexical tones for mandarin large vocabulary continuous speech recognition
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
Restoring punctuation and capitalization in transcribed speech
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Domain adaptation with structural correspondence learning
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Cascaded model adaptation for dialog act segmentation and tagging
Computer Speech and Language
Semi-supervised speech act recognition in emails and forums
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Domain adaptation with unlabeled data for dialog act tagging
DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Analysis of interrogatives in different domains
Proceedings of the Third COST 2102 international training school conference on Toward autonomous, adaptive, and context-aware multimodal interfaces: theoretical and practical issues
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
We investigate the use of textual Internet conversations for detecting questions in spoken conversations. We compare the text-trained model with models trained on manually-labeled, domain-matched spoken utterances with and without prosodic features. Overall, the text-trained model achieves over 90% of the performance (measured in Area Under the Curve) of the domain-matched model including prosodic features, but does especially poorly on declarative questions. We describe efforts to utilize unlabeled spoken utterances and prosodic features via domain adaptation.