Using Temporal Language Models for Document Dating

  • Authors:
  • Nattiya Kanhabua;Kjetil Nørvåg

  • Affiliations:
  • Dept. of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway;Dept. of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway

  • Venue:
  • ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to increase precision in searching for web pages or web documents, taking the temporal dimension into account is gaining increased interest. A particular problem for web documents found on the Internet is that in general, no trustworthy timestamp is available. This is due to its decentralized nature and the lack of standards for time and date. In previous work we have presented techniques for solving this problem. In this paper, we present a tool for determining the timestamp of a non-timestamped document (using file, URL or text as input) using temporal language models. We also outline how this tool will be demonstrated.