Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars
Proceedings of the International Symposium on Natural Language and Logic
Geographic Data Mining and Knowledge Discovery
Geographic Data Mining and Knowledge Discovery
Computational Linguistics
Message Understanding Conference-6: a brief history
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
MumbleSearch Extraction of High Quality Web information for SME
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Semi-supervised time series classification
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Stalker, A Multilingual Text Mining Search Engine for Open Source Intelligence
IV '08 Proceedings of the 2008 12th International Conference Information Visualisation
Introduction to the CoNLL-2005 shared task: semantic role labeling
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Hi-index | 0.00 |
The revolution in information technology is making open sources more accessible, ubiquitous and valuable. The international intelligence communities have seen open sources become increasingly easier and cheaper to acquire in recent years. But up to 80% of electronic data is textual and most valuable information is often hidden and encoded in pages which are neither structured nor classified. The process of accessing all these raw data, heterogeneous in terms of source and language, and transforming them into information is therefore strongly linked to automatic textual analysis and synthesis, which are greatly related to the ability to master the problems of multilinguality. This paper describes a content-enabling system that provides deep semantic search and information access to large quantities of distributed multimedia data for both experts and the general public. Stalker provides a language-independent search and dynamic classification features for a broad range of data collected from several sources in a number of culturally diverse languages.