Optimal parallel pattern matching in strings
Information and Control
A fast string searching algorithm
Communications of the ACM
Time-space-optimal string matching (Preliminary Report)
STOC '81 Proceedings of the thirteenth annual ACM symposium on Theory of computing
Rapid identification of repeated patterns in strings, trees and arrays
STOC '72 Proceedings of the fourth annual ACM symposium on Theory of computing
Optimal parallel suffix-prefix matching algorithm and applications
SPAA '89 Proceedings of the first annual ACM symposium on Parallel algorithms and architectures
Designing programs that check their work
STOC '89 Proceedings of the twenty-first annual ACM symposium on Theory of computing
Algorithms for string searching
ACM SIGIR Forum
Deterministic sampling—a new technique for fast pattern matching
STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Delta storage for arbitrary non-text files
SCM '91 Proceedings of the 3rd international workshop on Software configuration management
Efficient 2-dimensional approximate matching of non-rectangular figures
SODA '91 Proceedings of the second annual ACM-SIAM symposium on Discrete algorithms
Tight bounds on the complexity of the Boyer-Moore string matching algorithm
SODA '91 Proceedings of the second annual ACM-SIAM symposium on Discrete algorithms
Alphabet independent two dimensional matching
STOC '92 Proceedings of the twenty-fourth annual ACM symposium on Theory of computing
A constant-time optimal parallel string-matching algorithm
STOC '92 Proceedings of the twenty-fourth annual ACM symposium on Theory of computing
A new approach to text searching
Communications of the ACM
Two-dimensional periodicity and its applications
SODA '92 Proceedings of the third annual ACM-SIAM symposium on Discrete algorithms
Intermittent Fault Diagnosis in Multiprocessor Systems
IEEE Transactions on Computers
On randomization in sequential and distributed algorithms
ACM Computing Surveys (CSUR)
Non-standard stringology: algorithms and complexity
STOC '94 Proceedings of the twenty-sixth annual ACM symposium on Theory of computing
Designing programs that check their work
Journal of the ACM (JACM)
A constant-time optimal parallel string-matching algorithm
Journal of the ACM (JACM)
Optimal parallel dictionary matching and compression (extended abstract)
Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
Incremental cryptography and application to virus protection
STOC '95 Proceedings of the twenty-seventh annual ACM symposium on Theory of computing
String matching in Lempel-Ziv compressed strings
STOC '95 Proceedings of the twenty-seventh annual ACM symposium on Theory of computing
ACM Computing Surveys (CSUR)
YAP3: improved detection of similarities in computer program and other texts
SIGCSE '96 Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education
Recursive hashing functions for n-grams
ACM Transactions on Information Systems (TOIS)
Two and higher dimensional pattern matching in optimal expected time
SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
Faster suffix tree construction with missing suffix links
STOC '00 Proceedings of the thirty-second annual ACM symposium on Theory of computing
A new string-pattern matching algorithm using partitioning and hashing efficiently
Journal of Experimental Algorithmics (JEA)
Optimization of sequence queries in database systems
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Using textual redundancy to study The Mintainability of source
Advances in software engineering
Efficient pattern-matching with don't cares
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
The string edit distance matching problem with moves
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
Compactly encoding unstructured inputs with differential compression
Journal of the ACM (JACM)
Sending compressed messages to a learned receiver on a bidirectional line
Information Processing Letters
Indexing and Dictionary Matching with One Error
WADS '99 Proceedings of the 6th International Workshop on Algorithms and Data Structures
WADS '01 Proceedings of the 7th International Workshop on Algorithms and Data Structures
Algorithms on Compressed Strings and Arrays
SOFSEM '99 Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics
Correlating XML data streams using tree-edit distance embeddings
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
String pattern matching for a deluge survival kit
Handbook of massive data sets
Winnowing: local algorithms for document fingerprinting
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Fast pattern matching for entropy bounded text
DCC '95 Proceedings of the Conference on Data Compression
Pattern Matching in Huffman Encoded Texts
DCC '01 Proceedings of the Data Compression Conference
A morphology-driven string matching approach to Arabic text searching
Journal of Systems and Software
The Effect of Flexible Parsing for Dynamic Dictionary-Based Data Compression
Journal of Experimental Algorithmics (JEA)
Sensitivity analysis and efficient method for identifying optimal spaced seeds
Journal of Computer and System Sciences
Ladderlike stepping and interval jumping searching algorithms for DNA sequences
APBC '04 Proceedings of the second conference on Asia-Pacific bioinformatics - Volume 29
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Algebraic Signatures for Scalable Distributed Data Structures
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Accurate, scalable in-network identification of p2p traffic using application signatures
Proceedings of the 13th international conference on World Wide Web
Expressing and optimizing sequence queries in database systems
ACM Transactions on Database Systems (TODS)
Computational Geometry: Theory and Applications - Special issue on the 10th fall workshop on computational geometry
Bounded similarity querying for time-series data
Information and Computation - Special issue: Commemorating the 50th birthday anniversary of Paris C. Kanellakis
Scaled and permuted string matching
Information Processing Letters
Hierarchical substring caching for efficient content distribution to low-bandwidth clients
WWW '05 Proceedings of the 14th international conference on World Wide Web
XML stream processing using tree-edit distance embeddings
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
Server-friendly delta compression for efficient web access
Web content caching and distribution
DeltaCast: efficient file reconciliation in wireless broadcast systems
Proceedings of the 3rd international conference on Mobile systems, applications, and services
Substring compression problems
SODA '05 Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms
Pattern matching in Huffman encoded texts
Information Processing and Management: an International Journal
Removing duplication from java.io: a case study using traits
OOPSLA '05 Companion to the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Efficient algorithms for substring near neighbor problem
SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
q-Gram Matching Using Tree Models
IEEE Transactions on Knowledge and Data Engineering
An approximation to the greedy algorithm for differential compression
IBM Journal of Research and Development - Spintronics
Cache-oblivious string B-trees
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the suitability of programming tasks for automated evaluation
Informatics in education
Desktop tools for offline plagiarism detection in computer programs
Informatics in education
Plagiarism detection across programming languages
ACSC '06 Proceedings of the 29th Australasian Computer Science Conference - Volume 48
Faster algorithms for finding missing patterns
CATS '06 Proceedings of the 12th Computing: The Australasian Theroy Symposium - Volume 51
An incrementally maintainable index for approximate lookups in hierarchical data
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Clustering and indexing of experience sequences for popularity-driven recommendations
Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences
The string edit distance matching problem with moves
ACM Transactions on Algorithms (TALG)
Multipattern string matching with q-grams
Journal of Experimental Algorithmics (JEA)
Using the danger model of immune systems for distributed defense in modern data networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Efficient plagiarism detection for large code repositories
Software—Practice & Experience
An efficient signature representation and matching method for mobile devices
WICON '06 Proceedings of the 2nd annual international workshop on Wireless internet
Fast exact string matching algorithms
Information Processing Letters
Efficient search in large textual collections with redundancy
Proceedings of the 16th international conference on World Wide Web
In-place rsync: file synchronization for mobile and wireless devices
ATEC '03 Proceedings of the annual conference on USENIX Annual Technical Conference
Comparison and Evaluation of Clone Detection Tools
IEEE Transactions on Software Engineering
Fast nGram-based string search over data encoded using algebraic signatures
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
A study of the repetitive structure and distribution of short motifs in human genomic sequences
International Journal of Bioinformatics Research and Applications
Exact multi-pattern string matching on the cell/b.e. processor
Proceedings of the 5th conference on Computing frontiers
Hardness of optimal spaced seed design
Journal of Computer and System Sciences
Efficient signature based malware detection on mobile devices
Mobile Information Systems
Embedding and similarity search for point sets under translation
Proceedings of the twenty-fourth annual symposium on Computational geometry
FPGA based string matching for network processing applications
Microprocessors & Microsystems
SPIN '08 Proceedings of the 15th international workshop on Model Checking Software
Succinct backward-DAWG-matching
Journal of Experimental Algorithmics (JEA)
Extremely fast text feature extraction for classification and indexing
Proceedings of the 17th ACM conference on Information and knowledge management
Improving on-line construction of two-dimensional suffix trees for square matrices
Information Processing Letters
A data structure for representing multi-version texts online
International Journal of Human-Computer Studies
Specification Mining with Few False Positives
TACAS '09 Proceedings of the 15th International Conference on Tools and Algorithms for the Construction and Analysis of Systems: Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009,
Evaluation of Text Clustering Algorithms with N-Gram-Based Document Fingerprints
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
The design of a similarity based deduplication system
SYSTOR '09 Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference
Efficient Exact Pattern-Matching in Proteomic Sequences
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
LCS Approximation via Embedding into Local Non-repetitive Strings
CPM '09 Proceedings of the 20th Annual Symposium on Combinatorial Pattern Matching
Fast Searching in Packed Strings
CPM '09 Proceedings of the 20th Annual Symposium on Combinatorial Pattern Matching
A Bit-Parallel Exact String Matching Algorithm for Small Alphabet
FAW '09 Proceedings of the 3d International Workshop on Frontiers in Algorithmics
GRASPm: an efficient algorithm for exact pattern-matching in genomic sequences
International Journal of Bioinformatics Research and Applications
Constant Factor Approximation of Edit Distance of Bounded Height Unordered Trees
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Finding Similarities in Source Code Through Factorization
Electronic Notes in Theoretical Computer Science (ENTCS)
Average-optimal string matching
Journal of Discrete Algorithms
Compact full-text indexing of versioned document collections
Proceedings of the 18th ACM conference on Information and knowledge management
SCAMPI: a scalable CAM-based algorithm for multiple pattern inspection
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Reference-based alignment in large sequence databases
Proceedings of the VLDB Endowment
Dynamic Incremental Hashing in Program Model Checking
Electronic Notes in Theoretical Computer Science (ENTCS)
Bounded similarity querying for time-series data
Information and Computation
Scaled and permuted string matching
Information Processing Letters
PPM with the extended alphabet
Information Sciences: an International Journal
International Journal of Bioinformatics Research and Applications
Maintaining and checking parity in highly available Scalable Distributed Data Structures
Journal of Systems and Software
Tuning string matching for huge pattern sets
CPM'03 Proceedings of the 14th annual conference on Combinatorial pattern matching
Content-dependent chunking for differential compression, the local maximum approach
Journal of Computer and System Sciences
Recursive n-gram hashing is pairwise independent, at best
Computer Speech and Language
Application-guided tool development for architecturally diverse computation
Proceedings of the 2010 ACM Symposium on Applied Computing
Algorithms for weighted matching
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
DBISP2P'05/06 Proceedings of the 2005/2006 international conference on Databases, information systems, and peer-to-peer computing
A new algorithm for pattern matching and unification
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Interactive compression of books
WSEAS Transactions on Computers
SplitScreen: enabling efficient, distributed malware detection
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Improving dictionary based data compression by using previous knowledge and interaction
AMERICAN-MATH'10 Proceedings of the 2010 American conference on Applied mathematics
A case for information-bound referencing
Hotnets-IX Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks
Generalizing prefix filtering to improve set similarity joins
Information Systems
Improved index compression techniques for versioned document collections
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Algorithms and theory of computation handbook
Design of an efficient out-of-core read alignment algorithm
WABI'10 Proceedings of the 10th international conference on Algorithms in bioinformatics
APPROX/RANDOM'10 Proceedings of the 13th international conference on Approximation, and 14 the International conference on Randomization, and combinatorial optimization: algorithms and techniques
pq-hash: an efficient method for approximate XML joins
WAIM'10 Proceedings of the 2010 international conference on Web-age information management
Fast searching in packed strings
Journal of Discrete Algorithms
LCS approximation via embedding into locally non-repetitive strings
Information and Computation
Journal of Discrete Algorithms
On the Compressibility of $\mathcal{NP}$ Instances and Cryptographic Applications
SIAM Journal on Computing
Foundations and Trends in Databases
Clustering weblogs on the basis of a topic detection method
MCPR'10 Proceedings of the 2nd Mexican conference on Pattern recognition: Advances in pattern recognition
Studying software evolution using artefacts' shared information content
Science of Computer Programming
Real-time streaming string-matching
CPM'11 Proceedings of the 22nd annual conference on Combinatorial pattern matching
Simple real-time constant-space string matching
CPM'11 Proceedings of the 22nd annual conference on Combinatorial pattern matching
Forensic triage for mobile phones with DEC0DE
SEC'11 Proceedings of the 20th USENIX conference on Security
Bouma2: a high-performance input-aware multiple string-match algorithm
CIAA'11 Proceedings of the 16th international conference on Implementation and application of automata
Pattern matching in lempel-Ziv compressed strings: fast, simple, and deterministic
ESA'11 Proceedings of the 19th European conference on Algorithms
Collection-based compression using discovered long matching strings
Proceedings of the 20th ACM international conference on Information and knowledge management
An indexing method for two-d pattern matching with applications to digital image searches
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
The Poly1305-AES message-authentication code
FSE'05 Proceedings of the 12th international conference on Fast Software Encryption
A compact representation of nondeterministic (suffix) automata for the bit-parallel approach
Information and Computation
A randomized numerical aligner (rNA)
LATA'10 Proceedings of the 4th international conference on Language and Automata Theory and Applications
Approximate string matching with reduced alphabet
Algorithms and Applications
An algorithmic framework for motif discovery problems in weighted sequences
CIAC'10 Proceedings of the 7th international conference on Algorithms and Complexity
p-Suffix sorting as arithmetic coding
IWOCA'11 Proceedings of the 22nd international conference on Combinatorial Algorithms
Composite pattern discovery for PCR application
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
TDSC: a two-phase duplicate string compression algorithm
APWeb'12 Proceedings of the 14th international conference on Web Technologies and Applications
A bibliography on computational molecular biology and genetics
Mathematical and Computer Modelling: An International Journal
A fast string search algorithm for deep packet classification
Computer Communications
Multi-resolution similarity hashing
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Exact pattern matching with feed-forward bloom filters
Journal of Experimental Algorithmics (JEA)
Matrix Probing and its Conditioning
SIAM Journal on Numerical Analysis
Optimizing positional index structures for versioned document collections
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
ECMAF: an event-based cross-layer service monitoring and adaptation framework
ICSOC'11 Proceedings of the 2011 international conference on Service-Oriented Computing
Reasoning-Based context-aware workflow management in wireless sensor network
ICSOC'11 Proceedings of the 2011 international conference on Service-Oriented Computing
p-Suffix sorting as arithmetic coding
Journal of Discrete Algorithms
Compressed string dictionary look-up with edit distance one
CPM'12 Proceedings of the 23rd Annual conference on Combinatorial Pattern Matching
Time-Space trade-offs for longest common extensions
CPM'12 Proceedings of the 23rd Annual conference on Combinatorial Pattern Matching
A randomized Numerical Aligner (rNA)
Journal of Computer and System Sciences
Overlapping repetitions in weighted sequence
Proceedings of the CUBE International Information Technology Conference
ERNE-BS5: aligning BS-treated sequences by multiple hits on a 5-letters alphabet
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Fast algorithms for selecting specific siRNA in complete mRNA data
WABI'07 Proceedings of the 7th international conference on Algorithms in Bioinformatics
Fast computation of good multiple spaced seeds
WABI'07 Proceedings of the 7th international conference on Algorithms in Bioinformatics
Efficient communication protocols for deciding edit distance
ESA'12 Proceedings of the 20th Annual European conference on Algorithms
SCN'12 Proceedings of the 8th international conference on Security and Cryptography for Networks
Parikh matching in the streaming model
SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Reachability in graph timelines
Proceedings of the 4th conference on Innovations in Theoretical Computer Science
Memory efficient minimum substring partitioning
Proceedings of the VLDB Endowment
Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale
Simple real-time constant-space string matching
Theoretical Computer Science
Homomorphic fingerprints under misalignments: sketching edit and shift distances
Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Using algebraic signatures to check data possession in cloud storage
Future Generation Computer Systems
Asymmetric signature schemes for efficient exact edit similarity query processing
ACM Transactions on Database Systems (TODS)
Learning Cycle Length Through Finite Automata
Mathematics of Operations Research
Exact online two-dimensional pattern matching using multiple pattern matching algorithms
Journal of Experimental Algorithmics (JEA)
Faster algorithms for finding missing patterns
CATS '06 Proceedings of the Twelfth Computing: The Australasian Theory Symposium - Volume 51
Sparse suffix tree construction in small space
ICALP'13 Proceedings of the 40th international conference on Automata, Languages, and Programming - Volume Part I
Various improvements to text fingerprinting
Journal of Discrete Algorithms
Fingerprints in compressed strings
WADS'13 Proceedings of the 13th international conference on Algorithms and Data Structures
Enhancing video accessibility and availability using information-bound references
Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Improving deduplication techniques by accelerating remainder calculations
Discrete Applied Mathematics
Journal of Computer Security - Advances in Security for Communication Networks
Time-space trade-offs for longest common extensions
Journal of Discrete Algorithms
A Speculative Parallel DFA Membership Test for Multicore, SIMD and Cloud Computing Environments
International Journal of Parallel Programming
Hi-index | 0.03 |
We present randomized algorithms to solve the following string-matching problem and some of its generalizations: Given a string X of length n (the pattern) and a string Y (the text), find the first occurrence of X as a consecutive block within Y. The algorithms represent strings of length n by much shorter strings called fingerprints, and achieve their efficiency by manipulating fingerprints instead of longer strings. The algorithms require a constant number of storage locations, and essentially run in real time. They are conceptually simple and easy to implement. The method readily generalizes to higher-dimensional patternmatching problems.