Ord i dag: mining norwegian daily newswire

  • Authors:
  • Unni Cathrine Eiken;Anja Therese Liseth;Hans Friedrich Witschel;Matthias Richter;Chris Biemann

  • Affiliations:
  • AKSIS, University of Bergen, Bergen, Norway;AKSIS, University of Bergen, Bergen, Norway;NLP Department, University of Leipzig, Leipzig, Germany;NLP Department, University of Leipzig, Leipzig, Germany;NLP Department, University of Leipzig, Leipzig, Germany

  • Venue:
  • FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

We present Ord i Dag, a new service that displays today’s most important keywords. These are extracted fully automatically from Norwegian online newspapers. Describing the complete process, we provide an entirely disclosed method for media monitoring and news summarization. For keyword extraction, a reference corpus serves as background about average language use, which is contrasted with the current day’s word frequencies. Having detected the most prominent keywords of a day, we introduce several ways of grouping and displaying them in intuitive ways. A discussion about possible applications concludes. Up to now, the service is available for Norwegian and German. As only some shallow language-specific processing is needed, it can easily be set up for other languages.