The Journal of Machine Learning Research
Bayesian query-focused summarization
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Mixtures of hierarchical topics with Pachinko allocation
Proceedings of the 24th international conference on Machine learning
Modeling online reviews with multi-grain topic models
Proceedings of the 17th international conference on World Wide Web
Generating summary keywords for emails using topics
Proceedings of the 13th international conference on Intelligent user interfaces
Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process
MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Evaluation methods for topic models
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Detecting Temporal Trends of Technical Phrases by Using Importance Indices and Linear Regression
ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Global models of document structure using latent permutations
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Joint sentiment/topic model for sentiment analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Topic tracking model for analyzing consumer purchase behavior
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Detecting temporal patterns of technical phrases by using importance indices in a research documents
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Content modeling using latent permutations
Journal of Artificial Intelligence Research
Text categorization based on topic model
RSKT'08 Proceedings of the 3rd international conference on Rough sets and knowledge technology
A statistical model for topic segmentation and clustering
Canadian AI'08 Proceedings of the Canadian Society for computational studies of intelligence, 21st conference on Advances in artificial intelligence
Multilingual topic models for unaligned text
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Modeling the evolution of associated data
Data & Knowledge Engineering
Online multiscale dynamic topic models
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent variable models of selectional preference
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Experts' retrieval with multiword-enhanced author topic model
SS '10 Proceedings of the NAACL HLT 2010 Workshop on Semantic Search
Exploiting conversation structure in unsupervised topic segmentation for emails
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Finding the storyteller: automatic spoiler tagging using linguistic cues
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Video topic modelling with behavioural segmentation
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis
Topic tracking language model for speech recognition
Computer Speech and Language
KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
A new bigram-PLSA language model for speech recognition
EURASIP Journal on Advances in Signal Processing
Aspect and sentiment unification model for online review analysis
Proceedings of the fourth ACM international conference on Web search and data mining
Learning summary content units with topic modeling
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Citation author topic model in expert search
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Modeling reciprocity in social interactions with probabilistic latent space models
Natural Language Engineering
Word order matters: measuring topic coherence with lexical argument structure
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Discovery of topically coherent sentences for extractive summarization
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A hierarchical model of web summaries
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Evaluating a temporal pattern detection method for finding research keys in bibliographical data
Transactions on rough sets XIV
A supervised topic transition model for detecting malicious system call sequences
Proceedings of the 2011 workshop on Knowledge discovery, modeling and simulation
Human behavior clustering for anomaly detection
Frontiers of Computer Science in China
Dynamically Modeling Semantic Dependencies in Web Forum Threads
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Summarizing web forum threads based on a latent topic propagation process
Proceedings of the 20th ACM international conference on Information and knowledge management
Communications of the ACM
News thread extraction based on topical n-gram model with a background distribution
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Sentiment-Preserving reduction for social media analysis
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Collective context-aware topic models for entity disambiguation
Proceedings of the 21st international conference on World Wide Web
Discovering geographical topics in the twitter stream
Proceedings of the 21st international conference on World Wide Web
International Journal of Computer Vision
Generative Models for Evolutionary Clustering
ACM Transactions on Knowledge Discovery from Data (TKDD)
Video Behaviour Mining Using a Dynamic Topic Model
International Journal of Computer Vision
Mining contentions from discussions and debates
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A scalable distributed syntactic, semantic, and lexical language model
Computational Linguistics
Incorporating lexical priors into topic models
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Mixed membership Markov models for unsupervised conversation modeling
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
A phrase-discovering topic model using hierarchical Pitman-Yor processes
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Community-based classification of noun phrases in twitter
Proceedings of the 21st ACM international conference on Information and knowledge management
Automatic classification of archaeological pottery sherds
Journal on Computing and Cultural Heritage (JOCCH)
Topic model for analyzing purchase data with price information
Data Mining and Knowledge Discovery
Modeling discussion topics in interactions with a tablet reading primer
Proceedings of the 2013 international conference on Intelligent user interfaces
Intuitive Topic Discovery by Incorporating Word-Pair's Connection Into LDA
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
An n-gram topic model for time-stamped documents
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
On collocations and topic models
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
Personalized time-aware tweets summarization
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
An unsupervised topic segmentation model incorporating word order
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
The bag-of-repeats representation of documents
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A statistical semantic language model for source code
Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
On handling textual errors in latent document modeling
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Activity clustering for anomaly detection
International Journal of Intelligent Information and Database Systems
Discovering different types of topics: factored topic models
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Probabilistic topic models for sequence data
Machine Learning
Proceedings of the 7th ACM international conference on Web search and data mining
A new ROI based image retrieval system using an auxiliary Gaussian weighting scheme
Multimedia Tools and Applications
Self-help: Seeking out perplexing images for ever improving topological mapping
International Journal of Robotics Research
Topic segmentation and labeling in asynchronous conversations
Journal of Artificial Intelligence Research
A probabilistic approach to mining mobile phone data sequences
Personal and Ubiquitous Computing
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Activity-based topic discovery
Web Intelligence and Agent Systems
Hi-index | 0.02 |
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assumption, in which word order is ignored. Previously, these methods have not been combined. In this work, I explore a hierarchical generative probabilistic model that incorporates both n-gram statistics and latent topic variables by extending a unigram topic model to include properties of a hierarchical Dirichlet bigram language model. The model hyperparameters are inferred using a Gibbs EM algorithm. On two data sets, each of 150 documents, the new model exhibits better predictive accuracy than either a hierarchical Dirichlet bigram language model or a unigram topic model. Additionally, the inferred topics are less dominated by function words than are topics discovered using unigram statistics, potentially making them more meaningful.