Suffix arrays: a new method for on-line string searches
SIAM Journal on Computing
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The use of MMR, diversity-based reranking for reordering documents and producing summaries
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Bringing order to the Web: automatically categorizing search results
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
An information-theoretic approach to automatic query expansion
ACM Transactions on Information Systems (TOIS)
Cumulated gain-based evaluation of IR techniques
ACM Transactions on Information Systems (TOIS)
Beyond independent relevance: methods and evaluation metrics for subtopic retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Cluster-based retrieval using language models
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A personalized search engine based on web-snippet hierarchical clustering
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
A divide-and-merge methodology for clustering
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A Concept-Driven Algorithm for Clustering Search Results
IEEE Intelligent Systems
Minimal document set retrieval
Proceedings of the 14th ACM international conference on Information and knowledge management
Improving personalized web search using result diversification
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Web searching on the Vivisimo search engine
Journal of the American Society for Information Science and Technology
Graph Visualization Techniques for Web Clustering Engines
IEEE Transactions on Visualization and Computer Graphics
Novelty and topicality in interactive information retrieval
Journal of the American Society for Information Science and Technology
Determining the informational, navigational, and transactional intent of Web queries
Information Processing and Management: an International Journal
Incremental cluster-based retrieval using compressed cluster-skipping inverted files
ACM Transactions on Information Systems (TOIS)
Novelty and diversity in information retrieval evaluation
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the Second ACM International Conference on Web Search and Data Mining
A survey of Web clustering engines
ACM Computing Surveys (CSUR)
Mobile information retrieval with search results clustering: Prototypes and evaluations
Journal of the American Society for Information Science and Technology
Portfolio theory of information retrieval
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Usage based effectiveness measures: monitoring application performance in information retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
Jointly optimising relevance and diversity in image retrieval
Proceedings of the ACM International Conference on Image and Video Retrieval
A Scoring Function for Retrieving Photo Sets with Broad Topic Coverage
NCM '09 Proceedings of the 2009 Fifth International Joint Conference on INC, IMS and IDC
Using Kullback-Leibler distance for text categorization
ECIR'03 Proceedings of the 25th European conference on IR research
Optimal meta search results clustering
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Inducing word senses to improve web search result clustering
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Explicit search result diversification through sub-queries
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Using the quantum probability ranking principle to rank interdependent documents
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Hi-index | 0.00 |
To address the inability of current ranking systems to support subtopic retrieval, two main post-processing techniques of search results have been investigated: clustering and diversification. In this paper we present a comparative study of their performance, using a set of complementary evaluation measures that can be applied to both partitions and ranked lists, and two specialized test collections focusing on broad and ambiguous queries, respectively. The main finding of our experiments is that diversification of top hits is more useful for quick coverage of distinct subtopics whereas clustering is better for full retrieval of single subtopics, with a better balance in performance achieved through generating multiple subsets of diverse search results. We also found that there is little scope for improvement over the search engine baseline unless we are interested in strict full-subtopic retrieval, and that search results clustering methods do not perform well on queries with low divergence subtopics, mainly due to the difficulty of generating discriminative cluster labels.