A Concept-Driven Algorithm for Clustering Search Results

Authors:
Stanislaw Osinski;Dawid Weiss
Affiliations:
Poznan University of Technology;Poznan University of Technology
Venue:
IEEE Intelligent Systems
Year:
2005

Citing 6
Cited 47

Reexamining the cluster hypothesis: scatter/gather on retrieval results

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Grouper: a dynamic clustering interface to Web search results

WWW '99 Proceedings of the eighth international conference on World Wide Web
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Document clustering based on non-negative matrix factorization

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Using Linear Algebra for Intelligent Information Retrieval

Using Linear Algebra for Intelligent Information Retrieval
Carrot2 and language properties in web search results clustering

AWIC'03 Proceedings of the 1st international Atlantic web intelligence conference on Advances in web intelligence

A new algorithm for clustering search results

Data & Knowledge Engineering
Concept Level Web Search Via Semantic Clustering

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Web Search Results Clustering Based on a Novel Suffix Tree Structure

ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Exploiting Gene Ontology to Conceptualize Biomedical Document Collections

ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
A Concept-Driven Automatic Ontology Generation Approach for Conceptualization of Document Corpora

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Clustering the tagged web

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Exploiting noun phrases and semantic relationships for text document clustering

Information Sciences: an International Journal
A survey of Web clustering engines

ACM Computing Surveys (CSUR)
Enhancing cluster labeling using wikipedia

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Exploiting corpus-related ontologies for conceptualizing document corpora

Journal of the American Society for Information Science and Technology
Grouping Results of Queries to Ontological Knowledge Bases by Conceptual Clustering

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Query Results Clustering by Extending SPARQL with CLUSTER BY

OTM '09 Proceedings of the Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems: ADI, CAMS, EI2N, ISDE, IWSSA, MONET, OnToContent, ODIS, ORM, OTM Academy, SWWS, SEMELS, Beyond SAWSDL, and COMBEK 2009
Semantic-Linguistic Feature Vectors for Search: Unsupervised Construction and Experimental Validation

ASWC '09 Proceedings of the 4th Asian Conference on The Semantic Web
GOClonto: An ontological clustering approach for conceptualizing PubMed abstracts

Journal of Biomedical Informatics
Noodles: a clustering engine for the web

ICWE'07 Proceedings of the 7th international conference on Web engineering
Web snippets clustering based on an improved suffix tree algorithm

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
A knowledge-driven approach to biomedical document conceptualization

Artificial Intelligence in Medicine
Optimal meta search results clustering

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Analysis of structural relationships for hierarchical cluster labeling

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Inducing word senses to improve web search result clustering

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Clustering Web video search results based on integration of multiple features

World Wide Web
Comprehensible and accurate cluster labels in text clustering

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
QuestionHolic: Hot topic discovery and trend analysis in community question answering systems

Expert Systems with Applications: An International Journal
Using a new relational concept to improve the clustering performance of search engines

Information Processing and Management: an International Journal
Clustering web search results with maximum spanning trees

AI*IA'11 Proceedings of the 12th international conference on Artificial intelligence around man and beyond
A framework for personalized and collaborative clustering of search results

Proceedings of the 20th ACM international conference on Information and knowledge management
Text mining for efficient search and assisted creation of clinical trials

Proceedings of the ACM fifth international workshop on Data and text mining in biomedical informatics
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Presenting search results of meeting documents

Proceedings of the 23rd Australian Computer-Human Interaction Conference
ASCOT: assisting search and creation of clinical trials

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Topical clustering of search results

Proceedings of the fifth ACM international conference on Web search and data mining
Evaluating subtopic retrieval methods: Clustering versus diversification of search results

Information Processing and Management: an International Journal
Discovering and analyzing multi-granular web search results

FQAS'11 Proceedings of the 9th international conference on Flexible Query Answering Systems
Building a term suggestion and ranking system based on a probabilistic analysis model and a semantic analysis graph

Decision Support Systems
Disambiguated query suggestions and personalized content-similarity and novelty ranking of clustered results to optimize web searches

Information Processing and Management: an International Journal
Improving quality of search results clustering with approximate matrix factorisations

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Improving suffix tree clustering with new ranking and similarity measures

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Association rule centric clustering of web search results

MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
A transduction-based approach to fuzzy clustering, relevance ranking and cluster label generation on web search results

Journal of Intelligent Information Systems
Cluster labeling for multilingual scatter/gather using comparable corpora

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Selecting labels for news document clusters

NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Improving feature location practice with multi-faceted interactive exploration

Proceedings of the 2013 International Conference on Software Engineering
Mining subtopics from text fragments for a web query

Information Retrieval
Exploiting DBpedia for web search results clustering

Proceedings of the 2013 workshop on Automated knowledge base construction
Online image search result grouping with MapReduce-based image clustering and graph construction for large-scale photos

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most search engines return search results in a single-dimensional ranking of relevance to a user's query. Although this method works well for specific information needs, it often fails when users submit broad, ambiguous queries, seeking a general cross-section of topics related to the query. Search result clustering has successfully served this purpose in both commercial and scientific systems. The proposed method separates search results (document references) into meaningful groups. Unlike previous clustering techniques that use some proximity measure between documents, this method tries to discover meaningful phrases that can become cluster descriptions and only then assign documents to those phrases to form clusters. This idea is the core of the Lingo algorithm, which combines common phrase discovery and latent semantic indexing techniques. Clusters created by Lingo are compared to those created by the classic suffix-tree clustering algorithm.