A new approach to the maximum-flow problem
Journal of the ACM (JACM)
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Fast and effective text mining using linear-time document clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Deriving concept hierarchies from text
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchical classification of Web content
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Model-Based Hierarchical Clustering
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
The VLDB Journal — The International Journal on Very Large Data Bases
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The Journal of Machine Learning Research
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
The author-topic model for authors and documents
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Monte Carlo Statistical Methods (Springer Texts in Statistics)
Monte Carlo Statistical Methods (Springer Texts in Statistics)
A Bayesian Hierarchical Model for Learning Natural Scene Categories
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Bayesian hierarchical clustering
ICML '05 Proceedings of the 22nd international conference on Machine learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Statistical entity-topic models
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Contextual dependencies in unsupervised word segmentation
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Organizing the OCA: learning faceted subjects from a library of digital books
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Unsupervised prediction of citation influences
Proceedings of the 24th international conference on Machine learning
Mixed Membership Stochastic Blockmodels
The Journal of Machine Learning Research
Nearly-automated metadata hierarchy creation
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Logical generative models for probabilistic reasoning about existence, roles and identity
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Learning concept hierarchies from text corpora using formal concept analysis
Journal of Artificial Intelligence Research
A machine learning approach to building domain-specific search engines
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
The cluster-abstraction model: unsupervised learning of topic hierarchies from text data
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Approximate inference for first-order probabilistic languages
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A hybrid hierarchical model for multi-document summarization
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Exploiting semantic hierarchies for Flickr group
AMT'10 Proceedings of the 6th international conference on Active media technology
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Modeling the evolution of topics in source code histories
Proceedings of the 8th Working Conference on Mining Software Repositories
A hierarchical model of web summaries
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Proximal Methods for Hierarchical Sparse Coding
The Journal of Machine Learning Research
Sampling table configurations for the hierarchical poisson-dirichlet process
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Graph evolution via social diffusion processes
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Browse by chunks: Topic mining and organizing on web-scale social media
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Distance Dependent Chinese Restaurant Processes
The Journal of Machine Learning Research
Communications of the ACM
A generative model for unsupervised discovery of relations and argument classes from clinical texts
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A simple word trigger method for social tag suggestion
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Hierarchical generative biclustering for MicroRNA expression analysis
RECOMB'10 Proceedings of the 14th Annual international conference on Research in Computational Molecular Biology
A non-parametric visual-sense model of images--extending the cluster hypothesis beyond text
Multimedia Tools and Applications
Document hierarchies from text and links
Proceedings of the 21st international conference on World Wide Web
Discovering K web user groups with specific aspect interests
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Document-topic hierarchies from document graphs
Proceedings of the 21st ACM international conference on Information and knowledge management
An empirical study on developer interactions in StackOverflow
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Modeling the dynamics of composite social networks
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Hierarchical geographical modeling of user locations from social media posts
Proceedings of the 22nd international conference on World Wide Web
An exploration of discussion threads in social news sites: a case study of the Reddit community
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
A novel neighborhood based document smoothing model for information retrieval
Information Retrieval
Navigating the topical structure of academic search results via the Wikipedia category network
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Classifying entities into an incomplete ontology
Proceedings of the 2013 workshop on Automated knowledge base construction
We know how you live: exploring the spectrum of urban lifestyles
Proceedings of the first ACM conference on Online social networks
Taxonomy discovery for personalized recommendation
Proceedings of the 7th ACM international conference on Web search and data mining
A hierarchical Dirichlet model for taxonomy expansion for search engines
Proceedings of the 23rd international conference on World wide web
A time-based collective factorization for topic discovery and monitoring in news
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.02 |
We present the nested Chinese restaurant process (nCRP), a stochastic process that assigns probability distributions to ensembles of infinitely deep, infinitely branching trees. We show how this stochastic process can be used as a prior distribution in a Bayesian nonparametric model of document collections. Specifically, we present an application to information retrieval in which documents are modeled as paths down a random tree, and the preferential attachment dynamics of the nCRP leads to clustering of documents according to sharing of topics at multiple levels of abstraction. Given a corpus of documents, a posterior inference algorithm finds an approximation to a posterior distribution over trees, topics and allocations of words to levels of the tree. We demonstrate this algorithm on collections of scientific abstracts from several journals. This model exemplifies a recent trend in statistical machine learning—the use of Bayesian nonparametric methods to infer distributions on flexible data structures.