Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Using Noun Phrase Heads to Extract Document Keyphrases
AI '00 Proceedings of the 13th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
The Journal of Machine Learning Research
Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach
Data Mining and Knowledge Discovery
Ontology-based personalized search and browsing
Web Intelligence and Agent Systems
A practical web-based approach to generating topic hierarchy for text segments
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A language model approach to keyphrase extraction
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Pachinko allocation: DAG-structured mixture models of topic correlations
ICML '06 Proceedings of the 23rd international conference on Machine learning
Mixtures of hierarchical topics with Pachinko allocation
Proceedings of the 24th international conference on Machine learning
Automatic labeling of multinomial topic models
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering Subsumption Hierarchies of Ontology Concepts from Text Corpora
WI '07 Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence
Using tagflake for condensing navigable tag hierarchies from tag clouds
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Extracting key terms from noisy and multitheme documents
Proceedings of the 18th international conference on World wide web
Re-examining automatic keyphrase extraction approaches in scientific articles
MWE '09 Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications
Automatic keyphrase extraction via topic decomposition
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Graph-based Natural Language Processing and Information Retrieval
Graph-based Natural Language Processing and Information Retrieval
Topical keyphrase extraction from Twitter
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A graph-based algorithm for inducing lexical taxonomies from scratch
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Ontology learning from text: A look back and into the future
ACM Computing Surveys (CSUR)
Automatic taxonomy construction from keywords
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic labeling hierarchical topics
Proceedings of the 21st ACM international conference on Information and knowledge management
ETM: Entity Topic Models for Mining Documents Associated with Entities
ICDM '12 Proceedings of the 2012 IEEE 12th International Conference on Data Mining
AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
EventCube: multi-dimensional search and mining of structured and text data
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Content coverage maximization on word networks for hierarchical topic summarization
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A hierarchical Dirichlet model for taxonomy expansion for search engines
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
A high quality hierarchical organization of the concepts in a dataset at different levels of granularity has many valuable applications such as search, summarization, and content browsing. In this paper we propose an algorithm for recursively constructing a hierarchy of topics from a collection of content-representative documents. We characterize each topic in the hierarchy by an integrated ranked list of mixed-length phrases. Our mining framework is based on a phrase-centric view for clustering, extracting, and ranking topical phrases. Experiments with datasets from three different domains illustrate our ability to generate hierarchies of high quality topics represented by meaningful phrases.