Web user profiling on proxy logs and its evaluation in personalization

  • Authors:
  • Hiroshi Fujimoto;Minoru Etoh;Akira Kinno;Yoshikazu Akinaga

  • Affiliations:
  • NTT DOCOMO R&D Center, Yokosuka-shi, Kanagawa, Japan;Osaka University Cybermedia Center, Toyonaka, Osaka, Japan;NTT DOCOMO R&D Center, Yokosuka-shi, Kanagawa, Japan;NTT DOCOMO R&D Center, Yokosuka-shi, Kanagawa, Japan

  • Venue:
  • APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a web user profiling and clustering framework based on LDA-based topic modeling with an analogy to document analysis in which documents and words represent users and their actions. The main technical challenge addressed here is how to symbolize web access actions, by words, that are monitored through a web proxy. We develop a hierarchical URL dictionary generated from Yahoo! Directory and a cross-hierarchical matching method that provides the function of automatic abstraction. We apply the proposed framework to 7500 students in Osaka University. The framework is used to analyze their 40GB click streams over a 4 month period. We evaluate clustering-based recommendation effectiveness to confirm the optimality of the framework. The results show high hit precision compared with existing methods.