Indexing text data under space constraints
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A search engine for natural language applications
WWW '05 Proceedings of the 14th international conference on World Wide Web
Foundations and Trends in Databases
Benchmarking Fulltext Search Performance of RDF Stores
ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
A Video Retrieval System for Computer Assisted Language Learning
Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Entity annotation based on inverse index operations
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Efficiently evaluating complex boolean expressions
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
The architecture and implementation of an extensible web crawler
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Manimal: relational optimization for data-intensive programs
Procceedings of the 13th International Workshop on the Web and Databases
Processing SPARQL queries with regular expressions in RDF databases
DTMBIO '10 Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics
A robust index for regular expression queries
Proceedings of the 20th ACM international conference on Information and knowledge management
On supporting efficient updates of regular expression indexes in RDF databases
Proceedings of the ACM fifth international workshop on Data and text mining in biomedical informatics
Probabilistic management of OCR data using an RDBMS
Proceedings of the VLDB Endowment
A prefiltering approach to regular expression matching for network security systems
ACNS'12 Proceedings of the 10th international conference on Applied Cryptography and Network Security
Efficient subsequence search in databases
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.00 |
In this paper, we describe the design, architecture, and the lessons learned from the implementation of a fast regular expression indexing engine FREE. FREE uses a pre-built index to identify the text data units which may contain a matching string and only examines these further. In this way, FREE shows orders of magnitude performance improvement in certain cases over standard regular expression matching systems, such as lex, awk and grep.