Automatic keyword extraction by server log analysis

  • Authors:
  • Chen Ding;Jin Zhou;Chi-Hung Chi

  • Affiliations:
  • Department of Computer Science, Ryerson University, Toronto, ON, Canada;Department of Computer Science, Ryerson University, Toronto, ON, Canada;School of Software, Tsinghua University, Beijing, China

  • Venue:
  • WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditionally, keywords are extracted from full texts of a document. While in the web environment, there are more sources we can use to provide a more complete view of a web page’s contents. In this paper, we propose to analyze web server logs to extract keywords of entry pages from anchor texts and query terms, and propagate these terms along user access paths to other linked pages. The major benefit of this method is that temporal changes could be reflected in extracted terms, and it is more about a user’s viewpoint on page’s contents instead of author’s.