Similarity-based clustering of Web transactions

Authors:
Giuseppe Manco;Riccardo Ortale;Domenico Saccà
Affiliations:
ICAR-CNR, Rende (CS), Italy;University of Calabria, Rende (CS), Italy;University of Calabria, Rende (CS), Italy
Venue:
Proceedings of the 2003 ACM symposium on Applied computing
Year:
2003

Citing 10
Cited 3

Social information filtering: algorithms for automating “word of mouth”

CHI '95 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
GroupLens: applying collaborative filtering to Usenet news

Communications of the ACM
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs

SIAM Journal on Scientific Computing
An algorithmic framework for performing collaborative filtering

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
ROCK: a robust clustering algorithm for categorical attributes

Information Systems
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization

Data Mining and Knowledge Discovery
Clustering Transactional Data

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Creating Adaptive Web Sites Through Usage-Based Clustering of URLs

KDEX '99 Proceedings of the 1999 Workshop on Knowledge and Data Engineering Exchange
Letizia: an agent that assists web browsing

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1

Clustering techniques utilized in web usage mining

AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Spatial clustering and outlier analysis for the regionalization of maize cultivation in China

WSEAS Transactions on Information Science and Applications
Spatial clustering for the regionalization of maize cultivation in China and its outlier analysis

WSEAS Transactions on Information Science and Applications

Quantified Score

Hi-index	0.03

Visualization

Abstract

We introduce a measure to compute similarity between two sequences containing accesses to Web pages, to be exploited in a clustering approach for grouping sessions of accesses to a Web site. The notion of sequence similarity is parametric to the sequence topology, and the similarity among Web pages within the sequences. In our formalization, two Web pages are similar if they can be considered synonymies not only from a content point of view, but also from a usage point of view, i.e., if users exhibit the same behavior on both pages. The refined notion of page similarity, as well as the related notion of sequence siilarity, are envisaged to be effective in the application of a centroid-based clustering technique to the personalization of Web experience.