Terminology Extraction from Log Files

  • Authors:
  • Hassan Saneifar;Stéphane Bonniol;Anne Laurent;Pascal Poncelet;Mathieu Roche

  • Affiliations:
  • LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392 and Satin IP Technologies, Cap Omega, RP Benjamin Franklin, Montpellier Cedex 2, France 34960;Satin IP Technologies, Cap Omega, RP Benjamin Franklin, Montpellier Cedex 2, France 34960;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392

  • Venue:
  • DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The log files generated by digital systems can be used in management information systems as the source of important information on the condition of systems. However, log files are not exhaustively exploited in order to extract information. The classical methods of information extraction such as terminology extraction methods are irrelevant to this context because of the specific characteristics of log files like their heterogeneous structure, the special vocabulary and the fact that they do not respect a natural language grammar. In this paper, we introduce our approach Exterlog to extract the terminology from log files. We detail how it deals with the particularity of such textual data.