Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Language and the Internet
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
The Journal of Machine Learning Research
TextTiling: segmenting text into multi-paragraph subtopic passages
Computational Linguistics
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Discourse segmentation of multi-party conversation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Topic themes for multi-document summarization
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Topic modeling: beyond bag-of-words
ICML '06 Proceedings of the 23rd international conference on Machine learning
Where's the "party" in "multi-party"?: analyzing the structure of small-group sociable talk
CSCW '06 Proceedings of the 2006 20th anniversary conference on Computer supported cooperative work
Unsupervised topic modelling for multi-party spoken discourse
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Minimum cut model for spoken lecture segmentation
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Summarizing email conversations with clue words
Proceedings of the 16th international conference on World Wide Web
Generating summary keywords for emails using topics
Proceedings of the 13th international conference on Intelligent user interfaces
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Topic segmentation algorithms for text summarization and passage retrieval: an exhaustive evaluation
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Global models of document structure using latent permutations
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Exploiting conversational features to detect high-quality blog comments
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Unsupervised modeling of dialog acts in asynchronous conversations
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Using the omega index for evaluating abstractive community detection
Proceedings of Workshop on Evaluation Metrics and System Comparison for Automatic Summarization
A learning approach for email conversation thread reconstruction
Journal of Information Science
Topic segmentation and labeling in asynchronous conversations
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
This work concerns automatic topic segmentation of email conversations. We present a corpus of email threads manually annotated with topics, and evaluate annotator reliability. To our knowledge, this is the first such email corpus. We show how the existing topic segmentation models (i.e., Lexical Chain Segmenter (LCSeg) and Latent Dirichlet Allocation (LDA)) which are solely based on lexical information, can be applied to emails. By pointing out where these methods fail and what any desired model should consider, we propose two novel extensions of the models that not only use lexical information but also exploit finer level conversation structure in a principled way. Empirical evaluation shows that LCSeg is a better model than LDA for segmenting an email thread into topical clusters and incorporating conversation structure into these models improves the performance significantly.