Disentangling chat

Authors:
Micha Elsner;Eugene Charniak
Affiliations:
Brown Laboratory for Linguistic Information Processing (BLLIP);Brown Laboratory for Linguistic Information Processing (BLLIP)
Venue:
Computational Linguistics
Year:
2010

Citing 15
Cited 6

The mad hatter's cocktail party: a social mobile audio space supporting multiple simultaneous conversations

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Text chat in action

GROUP '03 Proceedings of the 2003 international ACM SIGGROUP conference on Supporting group work
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
Correlation Clustering

Machine Learning
Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Thread detection in dynamic text message streams

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Where's the "party" in "multi-party"?: analyzing the structure of small-group sociable talk

CSCW '06 Proceedings of the 2006 20th anniversary conference on Computer supported cooperative work
Minimum cut model for spoken lecture segmentation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
On coreference resolution performance metrics

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Prototype-driven learning for sequence models

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Topic Detection and Extraction in Chat

ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
Bounding and comparing methods for correlation clustering beyond ILP

ILP '09 Proceedings of the Workshop on Integer Linear Programming for Natural Langauge Processing
Context-based message expansion for disentanglement of interleaved text conversations

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Modeling and multiway analysis of chatroom tensors

ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
A multimodal analysis of floor control in meetings

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction

Plans toward automated chat summarization

WASDGML '11 Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages
Unsupervised modeling of dialog acts in asynchronous conversations

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Multiple narrative disentanglement: unraveling Infinite Jest

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Discovering habits of effective online support group chatrooms

Proceedings of the 17th ACM international conference on Supporting group work
Hierarchical conversation structure prediction in multi-party chat

SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Topic segmentation and labeling in asynchronous conversations

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

When multiple conversations occur simultaneously, a listener must decide which conversation each utterance is part of in order to interpret and respond to it appropriately. We refer to this task as disentanglement. We present a corpus of Internet Relay Chat dialogue in which the various conversations have been manually disentangled, and evaluate annotator reliability. We propose a graph-based clustering model for disentanglement, using lexical, timing, and discourse-based features. The model's predicted disentanglements are highly correlated with manual annotations. We conclude by discussing two extensions to the model, specificity tuning and conversation start detection, both of which are promising but do not currently yield practical improvements.