Finding topic words for hierarchical summarization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The Journal of Machine Learning Research
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Topic sentiment mixture: modeling facets and opinions in weblogs
Proceedings of the 16th international conference on World Wide Web
Subject metadata enrichment using statistical topic models
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Mixtures of hierarchical topics with Pachinko allocation
Proceedings of the 24th international conference on Machine learning
Automatic labeling of multinomial topic models
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining geographic knowledge using location aware topic model
Proceedings of the 4th ACM workshop on Geographical information retrieval
Topic modeling with network regularization
Proceedings of the 17th international conference on World Wide Web
Modeling online reviews with multi-grain topic models
Proceedings of the 17th international conference on World Wide Web
Opinion integration through semi-supervised topic modeling
Proceedings of the 17th international conference on World Wide Web
Latent dirichlet allocation based multi-document summarization
Proceedings of the second workshop on Analytics for noisy unstructured text data
Joint latent topic models for text and citations
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast collapsed gibbs sampling for latent dirichlet allocation
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluating topic models for information retrieval
Proceedings of the 17th ACM conference on Information and knowledge management
Mining common topics from multiple asynchronous text streams
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Statistical Language Models for Information Retrieval A Critical Review
Foundations and Trends in Information Retrieval
A density-based method for adaptive LDA model selection
Neurocomputing
Rated aspect summarization of short comments
Proceedings of the 18th international conference on World wide web
A Comparative Study of Utilizing Topic Models for Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Probabilistic dyadic data analysis with local and global consistency
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Accounting for burstiness in topic models
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Dynamic mixed membership blockmodel for evolving networks
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Evaluation methods for topic models
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Refined experts: improving classification in large taxonomies
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Studying the history of ideas using topic models
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Identifying the Original Contribution of a Document via Language Modeling
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
A Generic Approach to Topic Models
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Probabilistic community discovery using hierarchical latent Gaussian mixture model
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Topic and role discovery in social networks with experiments on enron and academic email
Journal of Artificial Intelligence Research
Joint sentiment/topic model for sentiment analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Learning author-topic models from text corpora
ACM Transactions on Information Systems (TOIS)
Multi-grain hierarchical topic extraction algorithm for text mining
Expert Systems with Applications: An International Journal
Latent variable models of concept-attribute attachment
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Estimating Likelihoods for Topic Models
ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Cross-cultural analysis of blogs and forums with mixed-collection topic models
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Learning subsumption hierarchies of ontology concepts from texts
Web Intelligence and Agent Systems
A Survey of Statistical Network Models
Foundations and Trends® in Machine Learning
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
Distributed Algorithms for Topic Models
The Journal of Machine Learning Research
Building taxonomy of web search intents for name entity queries
Proceedings of the 19th international conference on World wide web
Smoothing LDA model for text categorization
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
A statistical model for topic segmentation and clustering
Canadian AI'08 Proceedings of the Canadian Society for computational studies of intelligence, 21st conference on Advances in artificial intelligence
Community-based ranking of the social web
Proceedings of the 21st ACM conference on Hypertext and hypermedia
Variational Bayes for generic topic models
KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Cross-lingual latent topic extraction
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Latent interest-topic model: finding the causal relationships behind dyadic data
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
OpinionIt: a text mining system for cross-lingual opinion analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Semantic multi-grain mixture topic model for text analysis
Expert Systems with Applications: An International Journal
Trend analysis model: trend consists of temporal words, topics, and timestamps
Proceedings of the fourth ACM international conference on Web search and data mining
Text mining for automatic image tagging
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
PLDA+: Parallel latent dirichlet allocation with data placement and pipeline processing
ACM Transactions on Intelligent Systems and Technology (TIST)
Concept-Based Information Retrieval Using Explicit Semantic Analysis
ACM Transactions on Information Systems (TOIS)
Ontology population and enrichment: state of the art
Knowledge-driven multimedia information extraction and ontology evolution
Discovery of topically coherent sentences for extractive summarization
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Partially labeled topic models for interpretable text mining
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent topic feedback for information retrieval
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Entity disambiguation with hierarchical topic models
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Typology of mixed-membership models: towards a design method
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
A multi-collection latent topic model for federated search
Information Retrieval
Extracting multi-dimensional relations: a generative model of groups of entities in a corpus
Proceedings of the 20th ACM international conference on Information and knowledge management
Non-Parametric Estimation of Topic Hierarchies from Texts with Hierarchical Dirichlet Processes
The Journal of Machine Learning Research
Communications of the ACM
COLBERT: a scoring based graphical model for expert identification
SBP'10 Proceedings of the Third international conference on Social Computing, Behavioral Modeling, and Prediction
On finding the natural number of topics with latent dirichlet allocation: some observations
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Integer linear programming for Constrained Multi-Aspect Committee Review Assignment
Information Processing and Management: an International Journal
HotDigg: finding recent hot topics from digg
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Discovering K web user groups with specific aspect interests
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Hierarchical classification of web documents by stratified discriminant analysis
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Apples to oranges: evaluating image annotations from natural language processing systems
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Shared components topic models
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
SSHLDA: a semi-supervised hierarchical topic model
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
The generalized dirichlet distribution in enhanced topic detection
Proceedings of the 21st ACM international conference on Information and knowledge management
Modeling topic hierarchies with the recursive chinese restaurant process
Proceedings of the 21st ACM international conference on Information and knowledge management
Supporting factual statements with evidence from the web
Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical topic integration through semi-supervised hierarchical topic modeling
Proceedings of the 21st ACM international conference on Information and knowledge management
Theme chronicle model: chronicle consists of timestamp and topical words over each theme
Proceedings of the 21st ACM international conference on Information and knowledge management
PKAW'12 Proceedings of the 12th Pacific Rim conference on Knowledge Management and Acquisition for Intelligent Systems
Representations for multi-document event clustering
Data Mining and Knowledge Discovery
Unsupervised graph-based topic labelling using dbpedia
Proceedings of the sixth ACM international conference on Web search and data mining
Transforming graph data for statistical relational learning
Journal of Artificial Intelligence Research
Improving ESA with document similarity
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
A study on query expansion based on topic distributions of retrieved documents
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Transfer learning using a nonparametric sparse topic model
Neurocomputing
An unsupervised topic segmentation model incorporating word order
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A phrase mining framework for recursive construction of a topical hierarchy
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
DIGTOBI: a recommendation system for Digg articles using probabilistic modeling
Proceedings of the 22nd international conference on World Wide Web
A graph-based topic extraction method enabling simple interactive customization
Proceedings of the 2013 ACM symposium on Document engineering
Content coverage maximization on word networks for hierarchical topic summarization
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Navigating the topical structure of academic search results via the Wikipedia category network
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Modeling latent topic interactions using quantum interference for information retrieval
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A PAM-based ontology concept and hierarchy learning method
Journal of Information Science
Latent word context model for information retrieval
Information Retrieval
Hi-index | 0.02 |
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not capture correlations between topics. In this paper, we introduce the pachinko allocation model (PAM), which captures arbitrary, nested, and possibly sparse correlations between topics using a directed acyclic graph (DAG). The leaves of the DAG represent individual words in the vocabulary, while each interior node represents a correlation among its children, which may be words or other interior nodes (topics). PAM provides a flexible alternative to recent work by Blei and Lafferty (2006), which captures correlations only between pairs of topics. Using text data from newsgroups, historic NIPS proceedings and other research paper corpora, we show improved performance of PAM in document classification, likelihood of held-out data, the ability to support finer-grained topics, and topical keyword coherence.