Fast algorithms for finding nearest common ancestors
SIAM Journal on Computing
Algorithms for approximate string matching
Information and Control
The accuracy of approximate string matching algorithms
Journal of Computer Based Instruction
Theoretical Computer Science
A new distance metric on strings computable in linear time
Discrete Applied Mathematics
Fast approximate string matching
Software—Practice & Experience
Data structures and algorithms for approximate string matching
Journal of Complexity
Fast string matching with k-differences
Journal of Computer and System Sciences - 26th IEEE Conference on Foundations of Computer Science, October 21-23, 1985
A greedy approximation algorithm for constructing shortest common superstrings
Theoretical Computer Science - International Symposium on Mathematical Foundations of Computer Science, Bratisl
On finding lowest common ancestors: simplification and parallelization
SIAM Journal on Computing
Fast parallel and serial approximate string matching
Journal of Algorithms
Efficient text searching
A review of segmentation and contextual analysis techniques for text recognition
Pattern Recognition
A very fast substring search algorithm
Communications of the ACM
Introduction to algorithms
Fast algorithms for two-dimensional and multiple pattern matching (preliminary version)
SWAT '90 Proceedings of the second Scandinavian workshop on Algorithm theory
An improved algorithm for approximate string matching
SIAM Journal on Computing
Handbook of algorithms and data structures: in Pascal and C (2nd ed.)
Handbook of algorithms and data structures: in Pascal and C (2nd ed.)
A new approach to text searching
Communications of the ACM
Fast text searching: allowing errors
Communications of the ACM
Approximate string-matching with q-grams and maximal matches
Theoretical Computer Science - Selected papers of the Combinatorial Pattern Matching School
Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
Approximate Boyer-Moore string matching
SIAM Journal on Computing
Fast algorithms for approximately counting mismatches
Information Processing Letters
Approximate string matching using within-word parallelism
Software—Practice & Experience
Text algorithms
A subquadratic algorithm for approximate regular expression matching
Journal of Algorithms
Fast and practical approximate string matching
Information Processing Letters
Phonetic string matching: lessons from information retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A comparison of approximate string matching algorithms
Software—Practice & Experience
Block edit models for approximate string matching
Theoretical Computer Science - Special issue: Latin American theoretical informatics
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
Applications of approximate word matching in information retrieval
CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Pattern matching algorithms
SIAM Journal on Computing
The art of computer programming, volume 3: (2nd ed.) sorting and searching
The art of computer programming, volume 3: (2nd ed.) sorting and searching
Approximate string matching: a simpler faster algorithm
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
A fast bit-vector algorithm for approximate string matching based on dynamic programming
Journal of the ACM (JACM)
PATRICIA—Practical Algorithm To Retrieve Information Coded in Alphanumeric
Journal of the ACM (JACM)
The String-to-String Correction Problem
Journal of the ACM (JACM)
An Extension of the String-to-String Correction Problem
Journal of the ACM (JACM)
A Space-Economical Suffix Tree Construction Algorithm
Journal of the ACM (JACM)
Block addressing indices for approximate text retrieval
Journal of the American Society for Information Science - Special topic issue: When museum informatics meets the World Wide Web
Very fast and simple approximate string matching
Information Processing Letters
ACM Computing Surveys (CSUR)
The string-to-string correction problem with block moves
ACM Transactions on Computer Systems (TOCS)
A technique for isolating differences between files
Communications of the ACM
A fast string searching algorithm
Communications of the ACM
Efficient string matching: an aid to bibliographic search
Communications of the ACM
A technique for computer detection and correction of spelling errors
Communications of the ACM
Improved approximate pattern matching on hypertext
Theoretical Computer Science
Fast and flexible string matching by combining bit-parallelism and suffix automata
Journal of Experimental Algorithmics (JEA)
High Performance Computational Methods for Biological Sequence Analysis
High Performance Computational Methods for Biological Sequence Analysis
Modern Information Retrieval
Introduction To Automata Theory, Languages, And Computation
Introduction To Automata Theory, Languages, And Computation
Combinatorial Algorithms on Words
Combinatorial Algorithms on Words
Automatic Speech and Speaker Recognition
Automatic Speech and Speaker Recognition
The Design and Analysis of Computer Algorithms
The Design and Analysis of Computer Algorithms
Adding Compression to Block Addressing Inverted Indexes
Information Retrieval
Text-Retrieval: Theory and Practice
Proceedings of the IFIP 12th World Computer Congress on Algorithms, Software, Architecture - Information Processing '92, Volume 1 - Volume I
Multiple Approximate String Matching
WADS '97 Proceedings of the 5th International Workshop on Algorithms and Data Structures
WADS '97 Proceedings of the 5th International Workshop on Algorithms and Data Structures
A String Matching Algorithm Fast on the Average
Proceedings of the 6th Colloquium, on Automata, Languages and Programming
Approximate Pattern Matching with Samples
ISAAC '94 Proceedings of the 5th International Symposium on Algorithms and Computation
A Unified View to String Matching Algorithms
SOFSEM '96 Proceedings of the 23rd Seminar on Current Trends in Theory and Practice of Informatics: Theory and Practice of Informatics
Theoretical and Empirical Comparisons of Approximate String Matching Algorithms
CPM '92 Proceedings of the Third Annual Symposium on Combinatorial Pattern Matching
Approximate String-Matching over Suffix Trees
CPM '93 Proceedings of the 4th Annual Symposium on Combinatorial Pattern Matching
Approximate String Matching and Local Similarity
CPM '94 Proceedings of the 5th Annual Symposium on Combinatorial Pattern Matching
Filtration with q-Samples in Approximate String Matching
CPM '96 Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching
A Faster Algorithm for Approximate String Matching
CPM '96 Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching
Approximate Multiple Strings Search
CPM '96 Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching
CPM '97 Proceedings of the 8th Annual Symposium on Combinatorial Pattern Matching
Estimating the Probability of Approximate Matches
CPM '97 Proceedings of the 8th Annual Symposium on Combinatorial Pattern Matching
Efficient Algorithms for Approximate String Matching with Swaps (Extended Abstract)
CPM '97 Proceedings of the 8th Annual Symposium on Combinatorial Pattern Matching
On Using q-Gram Locations in Approximate String Matching
ESA '95 Proceedings of the Third Annual European Symposium on Algorithms
FOCS '97 Proceedings of the 38th Annual Symposium on Foundations of Computer Science
Proceedings of the ACM SIGPLAN SIGOA symposium on Text manipulation
On the Approximate Pattern Occurrences in a Text
SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
String Matching with Differences by Finite Automata
ICPR '96 Proceedings of the 13th International Conference on Pattern Recognition - Volume 2
A suboptimal lossy data compression based on approximate pattern matching
IEEE Transactions on Information Theory
Fast and flexible string matching by combining bit-parallelism and suffix automata
Journal of Experimental Algorithmics (JEA)
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A syntactic approach for searching similarities within sentences
Proceedings of the eleventh international conference on Information and knowledge management
New and faster filters for multiple approximate string matching
Random Structures & Algorithms
Matchsimile: a flexible approximate matching tool for searching proper names
Journal of the American Society for Information Science and Technology
Database indexing for large DNA and protein sequence collections
The VLDB Journal — The International Journal on Very Large Data Bases
Approximate String Matching in LDAP Based on Edit Distance
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Database Index to Large Biological Sequences
Proceedings of the 27th International Conference on Very Large Data Bases
Approximate String Joins in a Database (Almost) for Free
Proceedings of the 27th International Conference on Very Large Data Bases
Computing the Threshold for q-Gram Filters
SWAT '02 Proceedings of the 8th Scandinavian Workshop on Algorithm Theory
String Matching with Metric Trees Using an Approximate Distance
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Approximate String Matching over Ziv-Lempel Compressed Text
COM '00 Proceedings of the 11th Annual Symposium on Combinatorial Pattern Matching
Indexing Text with Approximate q-Grams
COM '00 Proceedings of the 11th Annual Symposium on Combinatorial Pattern Matching
Identifying Occurrences of Maximal Pairs in Multiple Strings
CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
One-Gapped q-Gram Filtersfor Levenshtein Distance
CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
Faster Bit-Parallel Approximate String Matching
CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A Metric Index for Approximate String Matching
LATIN '02 Proceedings of the 5th Latin American Symposium on Theoretical Informatics
Regular Expression Searching over Ziv-Lempel Compressed Text
CPM '01 Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching
Discovering instances of poetic allusion from anthologies of classical Japanese poems
Theoretical Computer Science
Searching in metric spaces by spatial approximation
The VLDB Journal — The International Journal on Very Large Data Bases
Interactive deduplication using active learning
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Text joins in an RDBMS for web data integration
WWW '03 Proceedings of the 12th international conference on World Wide Web
Probabilistic term variant generator for biomedical terms
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Faster Approximate String Matching over Compressed Text
DCC '01 Proceedings of the Data Compression Conference
Similarity among melodies for music information retrieval
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Future Generation Computer Systems - Selected papers on theoretical and computational aspects of structural dynamical systems in linear algebra and control
Discovering regularities in biosequences: challenges and applications
ICCMSE '03 Proceedings of the international conference on Computational methods in sciences and engineering
Effective text extraction and recognition for WWW images
Proceedings of the 2003 ACM symposium on Document engineering
Fast multipattern search algorithms for intrusion detection
Fundamenta Informaticae - Special issue on computing patterns in strings
Better filtering with gapped q-grams
Fundamenta Informaticae - Special issue on computing patterns in strings
Regular expression searching on compressed text
Journal of Discrete Algorithms
Approximate string matching on Ziv-Lempel compressed text
Journal of Discrete Algorithms
Exact pattern matching for RNA secondary structures
APBC '04 Proceedings of the second conference on Asia-Pacific bioinformatics - Volume 29
Efficient similarity-based operations for data integration
Data & Knowledge Engineering
Iterative record linkage for cleaning and integration
Proceedings of the 9th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Detecting duplicate objects in XML documents
Proceedings of the 2004 international workshop on Information quality in information systems
Methods for evaluating and creating data quality
Information Systems - Special issue: Data quality in cooperative information systems
Content-based music structure analysis with applications to music semantics understanding
Proceedings of the 12th annual ACM international conference on Multimedia
Privacy-preserving data linkage protocols
Proceedings of the 2004 ACM workshop on Privacy in the electronic society
Indexing text data under space constraints
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A recursive MISD architecture for pattern matching
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Constructing Suffix Tree for Gigabyte Sequences with Megabyte Memory
IEEE Transactions on Knowledge and Data Engineering
Average-optimal single and multiple approximate string matching
Journal of Experimental Algorithmics (JEA)
Improving the performance of dictionary-based approaches in protein name recognition
Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Approximate string matching with ordered q-grams
Nordic Journal of Computing
Approximate regular expression searching with arbitrary integer weights
Nordic Journal of Computing
Substructure similarity search in graph databases
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Automatic music video summarization based on audio-visual-text analysis and alignment
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Approximate matching of hierarchical data using pq-grams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
n-gram/2L: a space and time efficient two-level n-gram inverted index structure
VLDB '05 Proceedings of the 31st international conference on Very large data bases
The intractability of computing the Hamming distance
Theoretical Computer Science
Exact matching of RNA secondary structure patterns
Theoretical Computer Science - Pattern discovery in the post genome
Relational clustering for multi-type entity resolution
MRDM '05 Proceedings of the 4th international workshop on Multi-relational mining
Automated cleansing for spend analytics
Proceedings of the 14th ACM international conference on Information and knowledge management
Fast Approximate Search in Large Dictionaries
Computational Linguistics
Named entity learning and verification: expectation maximization in large corpora
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Boosting precision and recall of dictionary-based protein name recognition
BioMed '03 Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine - Volume 13
q-Gram Matching Using Tree Models
IEEE Transactions on Knowledge and Data Engineering
Integrating XML data sources using approximate joins
ACM Transactions on Database Systems (TODS)
Proceedings of the 2006 international workshop on Mining software repositories
Multilingual modeling of cross-lingual spelling variants
Information Retrieval
Approximate string matching using compressed suffix arrays
Theoretical Computer Science
A metric index for approximate string matching
Theoretical Computer Science
Speeding up transposition-invariant string matching
Information Processing Letters
Increased bit-parallelism for approximate and multiple string matching
Journal of Experimental Algorithmics (JEA)
An approximate multi-word matching algorithm for robust document retrieval
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A dictionary for approximate string search and longest prefix search
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Feature-based similarity search in graph structures
ACM Transactions on Database Systems (TODS)
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Content-adaptive digital music watermarking based on music structure analysis
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Collective entity resolution in relational data
ACM Transactions on Knowledge Discovery from Data (TKDD)
A programmable array processor architecture for flexible approximate string matching algorithms
Journal of Parallel and Distributed Computing
Marking musical dictations using the edit distance algorithm
Software—Practice & Experience
s-grams: Defining generalized n-grams for information retrieval
Information Processing and Management: an International Journal
Simulating regional medical record systems (student poster)
ACM-SE 45 Proceedings of the 45th annual southeast regional conference
Estimating the selectivity of approximate string queries
ACM Transactions on Database Systems (TODS)
Matching large schemas: Approaches and evaluation
Information Systems
Rotation and lighting invariant template matching
Information and Computation
Efficient generation of super condensed neighborhoods
Journal of Discrete Algorithms
A Normalized Levenshtein Distance Metric
IEEE Transactions on Pattern Analysis and Machine Intelligence
Data & Knowledge Engineering
Length-weighted string kernels for sequence data classification
Pattern Recognition Letters
Document image analysis for active reading
SADPI '07 Proceedings of the 2007 international workshop on Semantically aware document processing and indexing
Proceedings of the 2007 ACM symposium on Document engineering
Efficient query evaluation on probabilistic databases
The VLDB Journal — The International Journal on Very Large Data Bases
Journal of Systems Architecture: the EUROMICRO Journal
EXTRA: a system for example-based translation assistance
Machine Translation
Deterministic high-speed root-hashing automaton matching coprocessor for embedded network processor
ACM SIGARCH Computer Architecture News - Special issue on the 2006 reconfigurable and adaptive architecture workshop
Vector representations for efficient comparison and search for similar strings
Cybernetics and Systems Analysis
Structuring wiki revision history
Proceedings of the 2007 international symposium on Wikis
Journal of Discrete Algorithms
Efficient query evaluation on probabilistic databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Probabilistic correlation-based similarity measure of unstructured records
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Extending q-grams to estimate selectivity of string matching with low edit distance
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Bit-parallel string matching under Hamming distance in O(n⌈m/w⌉) worst case time
Information Processing Letters
KDD Cup 2007 task 1 winner report
ACM SIGKDD Explorations Newsletter - Special issue on visual analytics
A parallel strategy for biological sequence alignment in restricted memory space
Journal of Parallel and Distributed Computing
Ontology-enhanced automatic chief complaint classification for syndromic surveillance
Journal of Biomedical Informatics
Grid's confidential outsourcing of string matching
SEPADS'07 Proceedings of the 6th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems
A software system for gene sequence database construction based on fast approximate string matching
International Journal of Bioinformatics Research and Applications
Matchmaking and ranking of semantic web services using integrated service profile
International Journal of Metadata, Semantics and Ontologies
Detecting worm variants using machine learning
CoNEXT '07 Proceedings of the 2007 ACM CoNEXT conference
Processor array architectures for flexible approximate string matching
Journal of Systems Architecture: the EUROMICRO Journal
Social aspects of a continuous inspection platform for software source code
Proceedings of the 2008 international workshop on Cooperative and human aspects of software engineering
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Approximate retrieval of XML data with ApproXPath
ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Computational Biology and Chemistry
SEPIA: estimating selectivities of approximate string predicates in large Databases
The VLDB Journal — The International Journal on Very Large Data Bases
S2S: structural-to-syntactic matching similar documents
Knowledge and Information Systems
Summarization system evaluation revisited: N-gram graphs
ACM Transactions on Speech and Language Processing (TSLP)
Discovering regularities in biosequences: Challenges and applications
Journal of Computational Methods in Sciences and Engineering
Improving the bit-parallel NFA of Baeza-Yates and Navarro for approximate string matching
Information Processing Letters
On-Line Approximate String Matching with Bounded Errors
CPM '08 Proceedings of the 19th annual symposium on Combinatorial Pattern Matching
Efficient Similarity Search for Tree-Structured Data
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Evaluating Performance and Quality of XML-Based Similarity Joins
ADBIS '08 Proceedings of the 12th East European conference on Advances in Databases and Information Systems
Learning Languages from Bounded Resources: The Case of the DFA and the Balls of Strings
ICGI '08 Proceedings of the 9th international colloquium on Grammatical Inference: Algorithms and Applications
Pattern Matching Techniques to Identify Syntactic Variations of Tags in Folksonomies
WSKS '08 Proceedings of the 1st world summit on The Knowledge Society: Emerging Technologies and Information Systems for the Knowledge Society
Learning Balls of Strings from Edit Corrections
The Journal of Machine Learning Research
Ed-Join: an efficient algorithm for similarity joins with edit distance constraints
Proceedings of the VLDB Endowment
Generating efficient safe query plans for probabilistic databases
Data & Knowledge Engineering
Semi-local longest common subsequences in subquadratic time
Journal of Discrete Algorithms
Incremental discovery of the irredundant motif bases for all suffixes of a string in O(n2logn) time
Theoretical Computer Science
A framework for recommending OLAP queries
Proceedings of the ACM 11th international workshop on Data warehousing and OLAP
A string matching approach for visual retrieval and classification
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Fast and compact regular expression matching
Theoretical Computer Science
Alignment of biological sequences with quality scores
International Journal of Bioinformatics Research and Applications
Improving the space cost of k-NN search in metric spaces by using distance estimators
Multimedia Tools and Applications
Note: k-difference matching in amortized linear time for all the words in a text
Theoretical Computer Science
Artificial Intelligence in Medicine
A fast scalable automaton-matching accelerator for embedded content processors
ACM Transactions on Embedded Computing Systems (TECS)
Performance evaluation of similarity join for real time information integration
Proceedings of the 2nd Bangalore Annual Compute Conference
Sourcerer: mining and searching internet-scale software repositories
Data Mining and Knowledge Discovery
Processing of Korean Natural Language Queries Using Local Grammars
ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Nested Counters in Bit-Parallel String Matching
LATA '09 Proceedings of the 3rd International Conference on Language and Automata Theory and Applications
The design of a similarity based deduplication system
SYSTOR '09 Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference
Approximating edit distance in near-linear time
Proceedings of the forty-first annual ACM symposium on Theory of computing
Events discovery for personal video recorders
Proceedings of the seventh european conference on European interactive television conference
Efficient top-k algorithms for fuzzy search in string collections
Proceedings of the First International Workshop on Keyword Search on Structured Data
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Extending autocompletion to tolerate errors
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Efficient approximate entity extraction with edit distance constraints
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Intelligent hybrid approach to false identity detection
Proceedings of the 12th International Conference on Artificial Intelligence and Law
Parallel identification of the spelling variants in corpora
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
A Tag Clustering Method to Deal with Syntactic Variations on Collaborative Social Networks
ICWE '9 Proceedings of the 9th International Conference on Web Engineering
Establishing Correspondences between Models with the Epsilon Comparison Language
ECMDA-FA '09 Proceedings of the 5th European Conference on Model Driven Architecture - Foundations and Applications
IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
Robust similarity measures for named entities matching
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
The Normalized Compression Distance as a Distance Measure in Entity Identification
ICDM '09 Proceedings of the 9th Industrial Conference on Advances in Data Mining. Applications and Theoretical Aspects
A discriminative candidate generator for string transformations
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Recommending Multidimensional Queries
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
From Nerode's congruence to suffix automata with mismatches
Theoretical Computer Science
Journal of Artificial Intelligence Research
Proceedings of the 21st International Conference on Association Francophone d'Interaction Homme-Machine
Average-optimal string matching
Journal of Discrete Algorithms
Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts
ACM Transactions on Algorithms (TALG)
Efficient algorithms for approximate member extraction using signature-based inverted lists
Proceedings of the 18th ACM conference on Information and knowledge management
Approximate string matching by combining automaton approach and binary neural networks
ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
The pq-gram distance between ordered labeled trees
ACM Transactions on Database Systems (TODS)
Information extraction for search engines using fast heuristic techniques
Data & Knowledge Engineering
Assessment of approximate string matching in a biomedical text retrieval problem
Computers in Biology and Medicine
Efficient approximate search on string collections
Proceedings of the VLDB Endowment
An adaptive multi-policy grid service for biological sequence comparison
Journal of Parallel and Distributed Computing
EC-TEL '09 Proceedings of the 4th European Conference on Technology Enhanced Learning: Learning in the Synergy of Multiple Disciplines
An Automaton for Motifs Recognition in DNA Sequences
MICAI '09 Proceedings of the 8th Mexican International Conference on Artificial Intelligence
Interactive learning of the acoustic properties of household objects
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Fuzzy automata with ε-moves compute fuzzy measures between strings
Fuzzy Sets and Systems
Automatic generation of bid phrases for online advertising
Proceedings of the third ACM international conference on Web search and data mining
Filtering methods for content-based retrieval on indexed symbolic music databases
Information Retrieval
Maispion: a tool for analysing and visualising open source software developer communities
IWST '09 Proceedings of the International Workshop on Smalltalk Technologies
Algorithms for memory hierarchies: advanced lectures
Algorithms for memory hierarchies: advanced lectures
Average-optimal multiple approximate string matching
CPM'03 Proceedings of the 14th annual conference on Combinatorial pattern matching
Indexing structures for approximate string matching
CIAC'03 Proceedings of the 5th Italian conference on Algorithms and complexity
Similarity join in metric spaces
ECIR'03 Proceedings of the 25th European conference on IR research
Using similarity-based operations for resolving data-level conflicts
BNCOD'03 Proceedings of the 20th British national conference on Databases
An efficient algorithm for finding gene-specific probes for DNA microarrays
ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
Efficient and scalable indexing techniques for biological sequence data
BIRD'07 Proceedings of the 1st international conference on Bioinformatics research and development
Structural similarity between XML documents and DTDs
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Shape recognition and retrieval: A structural approach using velocity function
CAIP'07 Proceedings of the 12th international conference on Computer analysis of images and patterns
Graph-based concept identification and disambiguation for enterprise search
Proceedings of the 19th international conference on World wide web
On the suffix automaton with mismatches
CIAA'07 Proceedings of the 12th international conference on Implementation and application of automata
Shape representation and classification using boundary radius function
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
Tuning approximate Boyer-Moore for gene sequences
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Approximate string matching with Lempel-Ziv compressed indexes
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Similarity joins of text with incomplete information formats
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
A novel implementation of the FITE-TRT translation method
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Parallelisation of sequence comparison algorithms using hybridised parallel techniques
HONET'09 Proceedings of the 6th international conference on High capacity optical networks and enabling technologies
Improving OCR accuracy for classical critical editions
ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
Linear-time protein 3-D structure searching with insertions and deletions
WABI'09 Proceedings of the 9th international conference on Algorithms in bioinformatics
String distances and uniformities
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Allomorfessor: towards unsupervised morpheme analysis
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A stack decoder approach to approximate string matching
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
A hash trie filter method for approximate string matching in genomic databases
Applied Intelligence
REAL: an efficient REad ALigner for next generation sequencing reads
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
A fixed-parameter algorithm for string-to-string correction
CATS '10 Proceedings of the Sixteenth Symposium on Computing: the Australasian Theory - Volume 109
Estimating peer similarity using distance of shared files
IPTPS'10 Proceedings of the 9th international conference on Peer-to-peer systems
A filtering algorithm for k-mismatch with don't cares
Information Processing Letters
Element similarity measures in XML schema matching
Information Sciences: an International Journal
Supporting location-based approximate-keyword queries
Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Comparing canonicalizations of historical German text
SIGMORPHON '10 Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology
Travel route recommendation using geotags in photo sharing sites
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Approximate all-pairs suffix/prefix overlaps
CPM'10 Proceedings of the 21st annual conference on Combinatorial pattern matching
Prefix tree indexing for similarity search and similarity joins on genomic data
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Code analyzer for an online course management system
Journal of Systems and Software
The longest common extension problem revisited and applications to approximate string searching
Journal of Discrete Algorithms
A concept hierarchy based ontology mapping approach
KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Exact and efficient proximity graph computation
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
AIMSA'10 Proceedings of the 14th international conference on Artificial intelligence: methodology, systems, and applications
Automated country name disambiguation for code set alignment
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Website fingerprinting and identification using ordered feature sequences
ESORICS'10 Proceedings of the 15th European conference on Research in computer security
Unsupervised measures for parameter selection of binarization algorithms
Pattern Recognition
Disclosing false identity through hybrid link analysis
Artificial Intelligence and Law
Trie-join: efficient trie-based string similarity joins with edit-distance constraints
Proceedings of the VLDB Endowment
Nearest-neighbor guided evaluation of data reliability and its applications
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
An algorithm to solve the motif alignment problem for approximate nested tandem repeats
RECOMB-CG'10 Proceedings of the 2010 international conference on Comparative genomics
The gapped suffix array: a new index structure for fast approximate matching
SPIRE'10 Proceedings of the 17th international conference on String processing and information retrieval
Estimation of quality of service in spelling correction using Kullback-Leibler divergence
Expert Systems with Applications: An International Journal
TIDES--a new descriptor for time series oscillation behavior
Geoinformatica
Entity Resolution and Information Quality
Entity Resolution and Information Quality
Linguistically annotated reordering: Evaluation and analysis
Computational Linguistics
Analysis of techniques for building intrusion tolerant server systems
MILCOM'03 Proceedings of the 2003 IEEE conference on Military communications - Volume II
Indexing methods for approximate dictionary searching: Comparative analysis
Journal of Experimental Algorithmics (JEA)
String matching with inversions and translocations in linear average time (most of the time)
Information Processing Letters
Foundations and Trends in Databases
A novel fingerprint algorithm based on line-segment chain
MUSP'06 Proceedings of the 6th WSEAS international conference on Multimedia systems & signal processing
Species identification based on approximate matching
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Entering the circle of trust: developer initiation as committers in open-source projects
Proceedings of the 8th Working Conference on Mining Software Repositories
Efficient exact edit similarity query processing with the asymmetric signature scheme
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A semantic approach to ETL technologies
Data & Knowledge Engineering
Efficient similarity joins for near-duplicate detection
ACM Transactions on Database Systems (TODS)
Differential dependencies: Reasoning and discovery
ACM Transactions on Database Systems (TODS)
Ontology and instance matching
Knowledge-driven multimedia information extraction and ontology evolution
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
ACART: an API Compliance and Analysis Report Tool for discovering reference design traceability
Proceedings of the 49th Annual Southeast Regional Conference
Efficient matching of biological sequences allowing for non-overlapping inversions
CPM'11 Proceedings of the 22nd annual conference on Combinatorial pattern matching
Interactive object recognition using proprioceptive and auditory feedback
International Journal of Robotics Research
Scalable pattern search analysis
MCPR'11 Proceedings of the Third Mexican conference on Pattern recognition
Aircraft engine fleet monitoring using self-organizing maps and edit distance
WSOM'11 Proceedings of the 8th international conference on Advances in self-organizing maps
Comparators for compound object identification
RSFDGrC'11 Proceedings of the 13th international conference on Rough sets, fuzzy sets, data mining and granular computing
PG-join: proximity graph based string similarity joins
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
An android based medication reminder system: a concept analysis approach
ICCS'11 Proceedings of the 19th international conference on Conceptual structures for discovering knowledge
Compressed directed acyclic word graph with application in local alignment
COCOON'11 Proceedings of the 17th annual international conference on Computing and combinatorics
Heliza: talking dirty to the attackers
Journal in Computer Virology
User-assisted alignment of Arabic historical manuscripts
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
On-line approximate string matching with bounded errors
Theoretical Computer Science
Next best step and expert recommendation for collaborative processes in it service management
BPM'11 Proceedings of the 9th international conference on Business process management
Improved stable retrieval in noisy collections
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Two for the price of one: a model for parallel and incremental computation
Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Continuously monitoring the correlations of massive discrete streams
Proceedings of the 20th ACM international conference on Information and knowledge management
Efficient similarity search: arbitrary similarity measures, arbitrary composition
Proceedings of the 20th ACM international conference on Information and knowledge management
PDFMeat: managing publications on the semantic desktop
Proceedings of the 20th ACM international conference on Information and knowledge management
Automatically estimating the incidence of symptoms recorded in GP free text notes
Proceedings of the first international workshop on Managing interoperability and complexity in health systems
Pass-join: a partition-based method for similarity joins
Proceedings of the VLDB Endowment
Enhancing trie-based syntactic pattern recognition using AI heuristic search strategies
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Extracting statistics indicators from tables of basic structure
Pattern Recognition and Image Analysis
A methodological contribution to music sequences analysis
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
All semi-local longest common subsequences in subquadratic time
CSR'06 Proceedings of the First international computer science conference on Theory and Applications
A dictionary-based approach to fast and accurate name matching in large law enforcement databases
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Structured data clouding across multiple webs
Information Systems
Dotted suffix trees a structure for approximate text indexing
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
ICMLC'05 Proceedings of the 4th international conference on Advances in Machine Learning and Cybernetics
Multiple polyline to polygon matching
ISAAC'05 Proceedings of the 16th international conference on Algorithms and Computation
Efficient longest common subsequence computation using bulk-synchronous parallelism
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
An add-on to rule-based sifters for multi-recipient spam emails
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Estimating recall and precision for vague queries in databases
CAiSE'05 Proceedings of the 17th international conference on Advanced Information Systems Engineering
Random access to grammar-compressed strings
Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms
CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
An efficient algorithm for generating super condensed neighborhoods
CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
Scalable distributed indexing and query processing over Linked Data
Web Semantics: Science, Services and Agents on the World Wide Web
Street address correction based on spelling techniques
BNCOD'05 Proceedings of the 22nd British National conference on Databases: enterprise, Skills and Innovation
DynMap: mapping short reads to multiple related genomes
Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine
On bit-parallel processing of multi-byte text
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
New bit-parallel indel-distance algorithm
WEA'05 Proceedings of the 4th international conference on Experimental and Efficient Algorithms
Approximate all-pairs suffix/prefix overlaps
Information and Computation
Video clip matching using MPEG-7 descriptors and edit distance
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Data cleaning and transformation using the AJAX framework
GTTSE'05 Proceedings of the 2005 international conference on Generative and Transformational Techniques in Software Engineering
Unified view of backward backtracking in short read mapping
Algorithms and Applications
MDSM: Microarray database schema matching using the Hungarian method
Information Sciences: an International Journal
Multiple valued logic approach for matching patient records in multiple databases
Journal of Biomedical Informatics
Towards a process model for identifying knowledge-related structures in product data
PAKM'06 Proceedings of the 6th international conference on Practical Aspects of Knowledge Management
Faster generation of super condensed neighbourhoods using finite automata
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Restricted transposition invariant approximate string matching under edit distance
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Practical and optimal string matching
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Information retrieval of sequential data in heterogeneous XML databases
AMR'05 Proceedings of the Third international conference on Adaptive Multimedia Retrieval: user, context, and feedback
Mining repositories to reveal the community structures of open source software projects
Proceedings of the 50th Annual Southeast Regional Conference
Cross-language information retrieval with latent topic models trained on a comparable corpus
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Automated web application testing using search based software engineering
ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Can we beat the prefix filtering?: an adaptive framework for similarity join and search
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Querying event sequences by exact match or similarity search: Design and empirical evaluation
Interacting with Computers
Fast and cache-oblivious dynamic programming with local dependencies
LATA'12 Proceedings of the 6th international conference on Language and Automata Theory and Applications
Approximate regular expressions and their derivatives
LATA'12 Proceedings of the 6th international conference on Language and Automata Theory and Applications
Complete-Thread extraction from web forums
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Charge and reduce: A fixed-parameter algorithm for String-to-String Correction
Discrete Optimization
Uniting formal and informal descriptive power: Reconciling ontologies with folksonomies
International Journal of Information Management: The Journal for Information Professionals
WSM: a novel algorithm for subgraph matching in large weighted graphs
Journal of Intelligent Information Systems
ASTERIX: scalable warehouse-style web data integration
Proceedings of the Ninth International Workshop on Information Integration on the Web
Directing gaze in narrative art
Proceedings of the ACM Symposium on Applied Perception
A framework for robust discovery of entity synonyms
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Recognising sentence similarity using similitude and dissimilarity features
International Journal of Advanced Intelligence Paradigms
Incremental set recommendation based on class differences
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
The smoothed complexity of edit distance
ACM Transactions on Algorithms (TALG)
Hybrid Matching Algorithm for Personal Names
Journal of Data and Information Quality (JDIQ)
Can I clone this piece of code here?
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Efficient similarity search in very large string sets
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Assessing the effort of repairing the accessibility of web sites
ICCHP'12 Proceedings of the 13th international conference on Computers Helping People with Special Needs - Volume Part I
The MADlib analytics library: or MAD skills, the SQL
Proceedings of the VLDB Endowment
Super-Linear indices for approximate dictionary searching
SISAP'12 Proceedings of the 5th international conference on Similarity Search and Applications
Journal of Biomedical Informatics
Fast Multipattern Search Algorithms for Intrusion Detection
Fundamenta Informaticae - Computing Patterns in Strings
Better Filtering with Gapped q-Grams
Fundamenta Informaticae - Computing Patterns in Strings
Touching from a distance: website fingerprinting attacks and defenses
Proceedings of the 2012 ACM conference on Computer and communications security
An unsupervised and data-driven approach for spell checking in Vietnamese OCR-scanned texts
HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
WHAM: A High-Throughput Sequence Alignment Method
ACM Transactions on Database Systems (TODS)
AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Incremental discovery of irredundant motif bases in time O(|Σ|n2 log n)
WABI'07 Proceedings of the 7th international conference on Algorithms in Bioinformatics
Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts
CPM'07 Proceedings of the 18th annual conference on Combinatorial Pattern Matching
Optimal offline extraction of irredundant motif bases
COCOON'07 Proceedings of the 13th annual international conference on Computing and Combinatorics
Improving XML instances comparison with preprocessing algorithms
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Efficient indexing algorithms for approximate pattern matching in text
Proceedings of the Seventeenth Australasian Document Computing Symposium
Approximate regional sequence matching for genomic databases
The VLDB Journal — The International Journal on Very Large Data Bases
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Sequential pattern mining -- approaches and algorithms
ACM Computing Surveys (CSUR)
Super-resolution of single text image by sparse representation
Proceeding of the workshop on Document Analysis and Recognition
Linked data classification: a feature-based approach
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Approximate string matching by position restricted alignment
Proceedings of the Joint EDBT/ICDT 2013 Workshops
FPI: a novel indexing method using frequent patterns for approximate string searches
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Efficient fuzzy search in large text collections
ACM Transactions on Information Systems (TOIS)
Comparable dependencies over heterogeneous data
The VLDB Journal — The International Journal on Very Large Data Bases
PartSS: an efficient partition-based filtering for edit distance constraints
ADC '11 Proceedings of the Twenty-Second Australasian Database Conference - Volume 115
Efficient top-k algorithms for approximate substring matching
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Efficient string-matching allowing for non-overlapping inversions
Theoretical Computer Science
A taxonomy of privacy-preserving record linkage techniques
Information Systems
Normalised LCS-based method for indexing multidimensional data cube
International Journal of Intelligent Information and Database Systems
PEDIVHANDI: multimodal indexation and retrieval system for lecture videos
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Clustering genome data based on approximate matching
International Journal of Data Analysis Techniques and Strategies
A partition-based method for string similarity joins with edit-distance constraints
ACM Transactions on Database Systems (TODS)
Optimal hashing schemes for entity matching
Proceedings of the 22nd international conference on World Wide Web
A comparison of identity merge algorithms for software repositories
Science of Computer Programming
A distributed framework for scaling Up LSH-based computations in privacy preserving record linkage
Proceedings of the 6th Balkan Conference in Informatics
A new non-exact aho-corasick framework for ECG classification
ACM SIGARCH Computer Architecture News
Reusability of open-source program code: a conceptual model and empirical investigation
ACM SIGSOFT Software Engineering Notes
Mining entity attribute synonyms via compact clustering
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Asymmetric signature schemes for efficient exact edit similarity query processing
ACM Transactions on Database Systems (TODS)
Discovering attribute and entity synonyms for knowledge integration and semantic web search
Proceedings of the 3rd International Workshop on Semantic Search Over the Web
Evaluating the acceleration of typical scientific problems on the GPU
Proceedings of the South African Institute for Computer Scientists and Information Technologists Conference
Semantic content-based recommendation of software services using context
ACM Transactions on the Web (TWEB)
Detecting intrusions in encrypted control traffic
Proceedings of the first ACM workshop on Smart energy grid security
Effectiveness of an implementation method for retrieving similar strings by trie structures
International Journal of Computer Applications in Technology
An error tolerant memory aid for reduced cognitive load in number copying tasks
UAHCI'13 Proceedings of the 7th international conference on Universal Access in Human-Computer Interaction: user and context diversity - Volume 2
Editorial: Efficient discovery of similarity constraints for matching dependencies
Data & Knowledge Engineering
A Comparison of String Similarity Measures for Toponym Matching
Proceedings of The First ACM SIGSPATIAL International Workshop on Computational Models of Place
RCSI: scalable similarity search in thousand(s) of genomes
Proceedings of the VLDB Endowment
On repairing structural problems in semi-structured data
Proceedings of the VLDB Endowment
Text searching allowing for inversions and translocations of factors
Discrete Applied Mathematics
Proceedings of the 12th Brazilian Symposium on Human Factors in Computing Systems
Clustering with Proximity Graphs: Exact and Efficient Algorithms
International Journal of Knowledge-Based Organizations
Efficient indexing techniques for record matching and deduplication
International Journal of Computational Vision and Robotics
Journal of Information Science
Deduplication of metadata harvested from Open Archives Initiative repositories
Information Services and Use - Mining the Digital Information Networks
Transform invariant text extraction
The Visual Computer: International Journal of Computer Graphics
On a compact encoding of the swap automaton
Information Processing Letters
Hi-index | 0.01 |
We survey the current techniques to cope with the problem of string matching that allows errors. This is becoming a more and more relevant issue for many fast growing areas such as information retrieval and computational biology. We focus on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current developments, and the central ideas of the algorithms and their complexities. We present a number of experiments to compare the performance of the different algorithms and show which are the best choices. We conclude with some directions for future work and open problems.