Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
Transcriber: Development and use of a tool for assisting speech corpora production
Speech Communication - Special issue on speech annotation and corpus tools
Programming for Corpus Linguistics
Programming for Corpus Linguistics
Xed: A New Tool for eXtracting Hidden Structures from Electronic Documents
DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
Dialogue act modeling for automatic tagging and recognition of conversational speech
Computational Linguistics
The reliability of a dialogue structure coding scheme
Computational Linguistics
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Discourse segmentation of multi-party conversation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Word distributions for thematic segmentation in a support vector machine approach
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Towards an objective test for meeting browsers: the BET4TQB pilot experiment
MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
The CALO meeting assistant system
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
This paper presents a shallow dialogue analysis model, aimed at human-human dialogues in the context of staff or business meetings. Four components of the model are defined, and several machine learning techniques are used to extract features from dialogue transcripts: maximum entropy classifiers for dialogue acts, latent semantic analysis for topic segmentation, or decision tree classifiers for discourse markers. A rule-based approach is proposed for solving cross-modal references to meeting documents. The methods are trained and evaluated thanks to a common data set and annotation format. The integration of the components into an automated shallow dialogue parser opens the way to multimodal meeting processing and retrieval applications.