Bayesian Analysis of Online Newspaper Log Data

  • Authors:
  • Hannes Wettig;Jussi Lahtinen;Tuomas Lepola;Petri Myllymäki;Henry Tirri

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • SAINT-W '03 Proceedings of the 2003 Symposium on Applications and the Internet Workshops (SAINT'03 Workshops)
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we address the problem of analyzing weblog data collected at a typical online newspaper site. Wepropose a two-way clustering technique based on probability theory. On one hand the suggested method clusters thereaders of the online newspaper into user groups of similar browsing behaviour, where the clusters are determinedsolely based on the click streams collected. On the otherhand, the articles of the newspaper are clustered based onthe reading behaviour of the users. The two-way clustering produces statistical user and page profiles that can beanalyzed by domain experts for content personalization. Inaddition, the produced model can also be used for on-lineprediction so that given the user cluster of a person enteringthe site, and the page cluster of an article of a newspaper,one can infer whether or not the user will have a look at thepage in question.