Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Recent trends in hierarchic document clustering: a critical review
Information Processing and Management: an International Journal
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Fast and effective text mining using linear-time document clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Deriving concept hierarchies from text
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Grouper: a dynamic clustering interface to Web search results
WWW '99 Proceedings of the eighth international conference on World Wide Web
Document clustering using word clusters via the information bottleneck method
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Agglomerative clustering of a search engine query log
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Finding topic words for hierarchical summarization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Query clustering using user logs
ACM Transactions on Information Systems (TOIS)
Inferring hierarchical descriptions
Proceedings of the eleventh international conference on Information and knowledge management
A Min-max Cut Algorithm for Graph Partitioning and Data Clustering
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Model-Based Hierarchical Clustering
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Enhanced word clustering for hierarchical text classification
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining topic-specific concepts and definitions on the web
WWW '03 Proceedings of the 12th international conference on World Wide Web
Towards Automatic Generation of Query Taxonomy: A Hierarchical Query Clustering Approach
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Topic hierarchy generation via linear discriminant projection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection
HICSS '99 Proceedings of the Thirty-Second Annual Hawaii International Conference on System Sciences-Volume 2 - Volume 2
Word-sense disambiguation using statistical methods
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Unsupervised learning of soft patterns for generating definitions from online news
Proceedings of the 13th international conference on World Wide Web
Learning to cluster web search results
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Editorial: special issue on web content mining
ACM SIGKDD Explorations Newsletter
Automatically labeling hierarchical clusters
dg.o '06 Proceedings of the 2006 international conference on Digital government research
An experimental study on automatically labeling hierarchical clusters using statistical features
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Acclimatizing Taxonomic Semantics for Hierarchical Content Classification
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Topic taxonomy adaptation for group profiling
ACM Transactions on Knowledge Discovery from Data (TKDD)
A survey on session detection methods in query logs and a proposal for future evaluation
Information Sciences: an International Journal
Integrating knowledge flow mining and collaborative filtering to support document recommendation
Journal of Systems and Software
Novel labeling strategies for hierarchical representation of multidimensional data analysis results
AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Semantic based real-time clustering for PubMed literatures
DS'07 Proceedings of the 10th international conference on Discovery science
International Journal of Approximate Reasoning
Organizing query completions for web search
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Exploiting semantic hierarchies for Flickr group
AMT'10 Proceedings of the 6th international conference on Active media technology
Query session detection as a cascade
Proceedings of the 20th ACM international conference on Information and knowledge management
Behavior-driven clustering of queries into topics
Proceedings of the 20th ACM international conference on Information and knowledge management
Concept hierarchy construction by combining spectral clustering and subsumption estimation
WISE'06 Proceedings of the 7th international conference on Web Information Systems
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Discovering a term taxonomy from term similarities using principal component analysis
EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Mining and supporting task-stage knowledge: a hierarchical clustering technique
PAKM'06 Proceedings of the 6th international conference on Practical Aspects of Knowledge Management
Category hierarchy maintenance: a data-driven approach
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining semantic relations between research areas
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
A phrase mining framework for recursive construction of a topical hierarchy
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
From search session detection to search mission detection
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Content coverage maximization on word networks for hierarchical topic summarization
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
It is crucial in many information systems to organize short text segments, such as keywords in documents and queries from users, into a well-formed topic hierarchy. In this paper, we address the problem of generating topic hierarchies for diverse text segments with a general and practical approach that uses the Web as an additional knowledge source. Unlike long documents, short text segments typically do not contain enough information to extract reliable features. This work investigates the possibilities of using highly ranked search-result snippets to enrich the representation of text segments. A hierarchical clustering algorithm is then applied to create the hierarchical topic structure of text segments. Different from traditional clustering algorithms, which tend to produce cluster hierarchies with a very unnatural shape, the approach tries to produce a more natural and comprehensive hierarchy. Extensive experiments were conducted on different domains of text segments. The obtained results have shown the potential of the proposed approach, which is believed able to benefit many information systems.