SAHN with SEP/COP and SPADE, to build a general web navigation adaptation system using server log information

Authors:
Olatz Arbelaitz;Ibai Gurrutxaga;Aizea Lojo;Javier Muguerza;Iñigo Perona
Affiliations:
Dept. of Computer Architecture and Technology, University of the Basque Country, Donostia, Spain;Dept. of Computer Architecture and Technology, University of the Basque Country, Donostia, Spain;Dept. of Computer Architecture and Technology, University of the Basque Country, Donostia, Spain;Dept. of Computer Architecture and Technology, University of the Basque Country, Donostia, Spain;Dept. of Computer Architecture and Technology, University of the Basque Country, Donostia, Spain
Venue:
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Year:
2011

Citing 9
Cited 0

Algorithms for clustering data

Algorithms for clustering data
Algorithms on strings, trees, and sequences: computer science and computational biology

Algorithms on strings, trees, and sequences: computer science and computational biology
Web mining research: a survey

ACM SIGKDD Explorations Newsletter
SPADE: an efficient algorithm for mining frequent sequences

Machine Learning
Web Usage Mining as a Tool for Personalization: A Survey

User Modeling and User-Adapted Interaction
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
An architecture for making recommendations to courseware authors using association rule mining and collaborative filtering

User Modeling and User-Adapted Interaction
The adaptive web: methods and strategies of web personalization

The adaptive web: methods and strategies of web personalization
SEP/COP: An efficient method to find the best partition in hierarchical clustering based on a new cluster validity index

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

During the last decades, the information on the web has increased drastically but larger quantities of data do not provide added value for web visitors; there is a need of easier access to the required information and adaptation to their preferences or needs. The use of machine learning techniques to build user models allows to take into account their real preferences. We present in this work the design of a complete system, based on the collaborative filtering approach, to identify interesting links for the users while they are navigating and to make the access to those links easier. Starting from web navigation logs and adding a generalization procedure to the preprocessing step, we use agglomerative hierarchical clustering (SAHN) combined with SEP/COP, a novel methodology to obtain the best partition from a hierarchy, to group users with similar navigation behavior or interests. We then use SPADE as sequential pattern discovery technique to obtain the most probable transactions for the users belonging to each group and then be able to adapt the navigation of future users according to those profiles. The experiments show that the designed system performs efficiently in a web-accesible database and is even able to tackle the cold start or 0-day problem.