Terminology Extraction from Log Files

Authors:
Hassan Saneifar;Stéphane Bonniol;Anne Laurent;Pascal Poncelet;Mathieu Roche
Affiliations:
LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392 and Satin IP Technologies, Cap Omega, RP Benjamin Franklin, Montpellier Cedex 2, France 34960;Satin IP Technologies, Cap Omega, RP Benjamin Franklin, Montpellier Cedex 2, France 34960;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392;LIRMM - Université Montpellier 2 --- CNRS, Montpellier Cedex 5, France 34392
Venue:
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Year:
2009

Citing 8
Cited 2

The use of bigrams to enhance text categorization

Information Processing and Management: an International Journal
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
A simple rule-based part of speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Noun-phrase analysis in unrestricted text for information retrieval

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Mining interesting knowledge from weblogs: a survey

Data & Knowledge Engineering
Dynamic syslog mining for network failure monitoring

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Conceptual structuring through term variations

MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Ontology aided query expansion for retrieving relevant texts

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence

Passage retrieval in log files: an approach based on query enrichment

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Event log messages as a human interface, or, "do you pine for the days when men were men and wrote their own device drivers?"

Proceedings of the 22nd Conference of the Computer-Human Interaction Special Interest Group of Australia on Computer-Human Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

The log files generated by digital systems can be used in management information systems as the source of important information on the condition of systems. However, log files are not exhaustively exploited in order to extract information. The classical methods of information extraction such as terminology extraction methods are irrelevant to this context because of the specific characteristics of log files like their heterogeneous structure, the special vocabulary and the fact that they do not respect a natural language grammar. In this paper, we introduce our approach Exterlog to extract the terminology from log files. We detail how it deals with the particularity of such textual data.