A New Web Search Result Clustering based on True Common Phrase Label Discovery

Authors:
Jongkol Janruang;Worapoj Kreesuradej
Affiliations:
King Mongkut's Institute of Technology Ladkrabang Bankok, 15320 Thailand;King Mongkut's Institute of Technology Ladkrabang Bankok, 15320 Thailand
Venue:
CIMCA '06 Proceedings of the International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce
Year:
2006

Citing 0
Cited 5

Web Search Results Clustering Based on a Novel Suffix Tree Structure

ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
STC+ and NM-STC: Two Novel Online Results Clustering Methods for Web Searching

WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
Exploratory web searching with dynamic taxonomies and results clustering

ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
A transduction-based approach to fuzzy clustering, relevance ranking and cluster label generation on web search results

Journal of Intelligent Information Systems
Result disambiguation in web people search

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Web search results clustering are navigator for users to search results. Therefore the correct cluster label is important which has been index the set of web document. Suffix Tree Clustering (STC) is fast automatically clustering and labeling. However, STC is inadequate since they generate interrupted cluster label due to using n-gram technique. In this paper, we propose an approach for web search results clustering and labeling based on a new suffix tree data structure, a new base cluster combining algorithm with a new partial phase join operation. The algorithm for constructing the data structure is an incremental and a linear time algorithm. Thus, the proposed approach is suitable for on-the-fly the web search results clustering and labeling cluster. The proposed approach provides more readable and true common phrase of web document cluster than conventional web search result clustering. Experimental results also show that the proposed approach has better performance than that of conventional web search result clustering.