Mixed membership Markov models for unsupervised conversation modeling

Authors:
Michael J. Paul
Affiliations:
Johns Hopkins University, Baltimore, MD
Venue:
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Year:
2012

Citing 18
Cited 0

Factorial Hidden Markov Models

Machine Learning - Special issue on learning with probabilistic representations
Latent dirichlet allocation

The Journal of Machine Learning Research
Dialogue act modeling for automatic tagging and recognition of conversational speech

Computational Linguistics
Discourse processing of dialogues with multiple threads

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Topic modeling: beyond bag-of-words

ICML '06 Proceedings of the 23rd international conference on Machine learning
Learning the structure of task-driven human-human dialogs

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
It pays to be picky: an evaluation of thread retrieval in online forums

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Relationship identification for social network discovery

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Global models of document structure using latent permutations

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Online community search using thread structure

Proceedings of the 18th ACM conference on Information and knowledge management
Unsupervised modeling of Twitter conversations

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Tagging and linking web forum posts

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Sequential Latent Dirichlet Allocation: Discover Underlying Topic Structures within a Document

ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Structural topic model for latent topical structure analysis

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Learning online discussion structures by conditional random fields

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Predicting thread discourse structure over technical web forums

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Classifying sentences as speech acts in message board posts

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised modeling of dialog acts in asynchronous conversations

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent work has explored the use of hidden Markov models for unsupervised discourse and conversation modeling, where each segment or block of text such as a message in a conversation is associated with a hidden state in a sequence. We extend this approach to allow each block of text to be a mixture of multiple classes. Under our model, the probability of a class in a text block is a log-linear function of the classes in the previous block. We show that this model performs well at predictive tasks on two conversation data sets, improving thread reconstruction accuracy by up to 15 percentage points over a standard HMM. Additionally, we show quantitatively that the induced word clusters correspond to speech acts more closely than baseline models.