Clipping and Analyzing News Using Machine Learning Techniques

  • Authors:
  • Hans Gründel;Tino Naphtali;Christian Wiech;Jan-Marian Gluba;Maiken Rohdenburg;Tobias Scheffer

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • DS '01 Proceedings of the 4th International Conference on Discovery Science
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Generating press clippings for companies manually requires a considerable amount of resources. We describe a system that monitors online newspapers and discussion boards automatically. The system extracts, classifies and analyzes messages and generates press clippings automatically, taking the specific needs of client companies into account. Key components of the system are a spider, an information extraction engine, a text classifier based on the Support Vector Machine that categorizes messages by subject, and a second classifier that analyzes which emotional state the author of a newsgroup posting was likely to be in. By analyzing large amount of messages, the system can summarize the main issues that are being reported on for given business sectors, and can summarize the emotional attitude of customers and shareholders towards companies.