Formal languages
Combinatorial optimization: algorithms and complexity
Combinatorial optimization: algorithms and complexity
Algorithms (2nd ed.)
Elements of machine learning
Parallel Suffix--Prefix-Matching Algorithm and Applications
SIAM Journal on Computing
Journal of the ACM (JACM)
Machine Learning
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
Rapid identification of repeated patterns in strings, trees and arrays
STOC '72 Proceedings of the fourth annual ACM symposium on Theory of computing
Finding submasses in weighted strings with Fast Fourier Transform
Discrete Applied Mathematics
Efficient one-dimensional real scaled matching
Journal of Discrete Algorithms
Journal of Discrete Algorithms
New algorithms for text fingerprinting
Journal of Discrete Algorithms
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
A faster query algorithm for the text fingerprinting problem
ESA'07 Proceedings of the 15th annual European conference on Algorithms
Indexing a dictionary for subset matching queries
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
On table arrangements, scrabble freaks, and jumbled pattern matching
FUN'10 Proceedings of the 5th international conference on Fun with algorithms
A PTAS for the square tiling problem
SPIRE'10 Proceedings of the 17th international conference on String processing and information retrieval
Faster query algorithms for the text fingerprinting problem
Information and Computation
Counting the orderings for multisets in consecutive ones property and PQ-trees
DLT'11 Proceedings of the 15th international conference on Developments in language theory
New algorithms for text fingerprinting
CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching
Journal of Discrete Algorithms
Indexing a dictionary for subset matching queries
Algorithms and Applications
CPM'12 Proceedings of the 23rd Annual conference on Combinatorial Pattern Matching
Consecutive ones property and PQ-trees for multisets: Hardness of counting their orderings
Information and Computation
Parikh matching in the streaming model
SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Various improvements to text fingerprinting
Journal of Discrete Algorithms
Hi-index | 0.00 |
We consider the problem of fingerprinting text by sets of symbols. Specifically, if S is a string, of length n, over a finite, ordered alphabet Σ, and S' is a substring of S, then the fingerprint of S' is the subset φ of Σ of precisely the symbols appearing in S'. In this paper we show efficient methods of answering various queries on fingerprint statistics. Our preprocessing is done in time O(n|Σ|log n log |Σ|) and enables answering the following queries: (1) Given an integer k, compute the number of distinct fingerprints of size k in time O(1). (2) Given a set φ ⊆ Σ, compute the total number of distinct occurrences in S of substrings with fingerprint φ in time O(|Σ|logn).