A Fast Regular Expression Indexing Engine

Authors:
Affiliations:
Venue:
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Year:
2002

Citing 0
Cited 15

Indexing text data under space constraints

Proceedings of the thirteenth ACM international conference on Information and knowledge management
A search engine for natural language applications

WWW '05 Proceedings of the 14th international conference on World Wide Web
Information Extraction

Foundations and Trends in Databases
Benchmarking Fulltext Search Performance of RDF Stores

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
A Video Retrieval System for Computer Assisted Language Learning

Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Entity annotation based on inverse index operations

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Efficiently evaluating complex boolean expressions

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
The architecture and implementation of an extensible web crawler

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Manimal: relational optimization for data-intensive programs

Procceedings of the 13th International Workshop on the Web and Databases
Processing SPARQL queries with regular expressions in RDF databases

DTMBIO '10 Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics
A robust index for regular expression queries

Proceedings of the 20th ACM international conference on Information and knowledge management
On supporting efficient updates of regular expression indexes in RDF databases

Proceedings of the ACM fifth international workshop on Data and text mining in biomedical informatics
Probabilistic management of OCR data using an RDBMS

Proceedings of the VLDB Endowment
A prefiltering approach to regular expression matching for network security systems

ACNS'12 Proceedings of the 10th international conference on Applied Cryptography and Network Security
Efficient subsequence search in databases

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we describe the design, architecture, and the lessons learned from the implementation of a fast regular expression indexing engine FREE. FREE uses a pre-built index to identify the text data units which may contain a matching string and only examines these further. In this way, FREE shows orders of magnitude performance improvement in certain cases over standard regular expression matching systems, such as lex, awk and grep.