Web mining and its SQL based parallel execution

  • Authors:
  • Masaru Kitsuregawa;Takahiko Shintani;Iko Pramudiono

  • Affiliations:
  • The University of Tokyo, 7-22-1 Roppongi, Minato-ku, Tokyo 106, Japan;The University of Tokyo, 7-22-1 Roppongi, Minato-ku, Tokyo 106, Japan;The University of Tokyo, 7-22-1 Roppongi, Minato-ku, Tokyo 106, Japan

  • Venue:
  • ITVE '01 Proceedings of the workshop on Information technology for virtual enterprises
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web mining can be classified into two categories, Web access log mining and Web structure mining. We performed association rule mining and sequence pattern mining against the access log which was accumulated at NTT Software Mobile Info Search portal site. Detail web log mining process and the rules we derived are reported in this paper. The parallel association rule mining is explored on large scale PC cluster system. Parallelism is key to improve the performance. We achieved substantial speed up through parallel SQL execution.