The use of bigrams to enhance text categorization
Information Processing and Management: an International Journal
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Noun-phrase analysis in unrestricted text for information retrieval
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Mining interesting knowledge from weblogs: a survey
Data & Knowledge Engineering
Dynamic syslog mining for network failure monitoring
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Conceptual structuring through term variations
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Ontology aided query expansion for retrieving relevant texts
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Passage retrieval in log files: an approach based on query enrichment
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Proceedings of the 22nd Conference of the Computer-Human Interaction Special Interest Group of Australia on Computer-Human Interaction
Hi-index | 0.00 |
The log files generated by digital systems can be used in management information systems as the source of important information on the condition of systems. However, log files are not exhaustively exploited in order to extract information. The classical methods of information extraction such as terminology extraction methods are irrelevant to this context because of the specific characteristics of log files like their heterogeneous structure, the special vocabulary and the fact that they do not respect a natural language grammar. In this paper, we introduce our approach Exterlog to extract the terminology from log files. We detail how it deals with the particularity of such textual data.