Phonetic string matching: lessons from information retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Recursive hashing functions for n-grams
ACM Transactions on Information Systems (TOIS)
q-gram based database searching using a suffix array (QUASAR)
RECOMB '99 Proceedings of the third annual international conference on Computational molecular biology
Two and higher dimensional pattern matching in optimal expected time
SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
A fast bit-vector algorithm for approximate string matching based on dynamic programming
Journal of the ACM (JACM)
Melodic matching techniques for large music databases
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
The interlace polynomial: a new graph polynomial
SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
New and faster filters for multiple approximate string matching
Random Structures & Algorithms
Database indexing for large DNA and protein sequence collections
The VLDB Journal — The International Journal on Very Large Data Bases
A Database Index to Large Biological Sequences
Proceedings of the 27th International Conference on Very Large Data Bases
Approximate String Joins in a Database (Almost) for Free
Proceedings of the 27th International Conference on Very Large Data Bases
Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Computing the Threshold for q-Gram Filters
SWAT '02 Proceedings of the 8th Scandinavian Workshop on Algorithm Theory
Near Neighbor Search in Large Metric Spaces
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Exact and Efficient Computation of the Expected Number of Missing and Common Words in Random Texts
COM '00 Proceedings of the 11th Annual Symposium on Combinatorial Pattern Matching
Better Filtering with Gapped q-Grams
CPM '01 Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching
One-Gapped q-Gram Filtersfor Levenshtein Distance
CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
Accelerating Approximate Subsequence Search on Large Protein Sequence Databases
CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Better filtering with gapped q-grams
Fundamenta Informaticae - Special issue on computing patterns in strings
Combinatorics of periods in strings
Journal of Combinatorial Theory Series A
A System for Multiattribute Drug Product Comparison
Journal of Medical Systems
XML stream processing using tree-edit distance embeddings
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
Comparing inverted files and signature files for searching a large lexicon
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Similarity evaluation on tree-structured data
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Substructure similarity search in graph databases
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
The interlace polynomial of a graph
Journal of Combinatorial Theory Series B - Special issue dedicated to professor W. T. Tutte
Approximate matching of hierarchical data using pq-grams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Fast Approximate Search in Large Dictionaries
Computational Linguistics
q-Gram Matching Using Tree Models
IEEE Transactions on Knowledge and Data Engineering
siRNA off-target search: a hybrid q-gram based filtering approach
Proceedings of the 5th international workshop on Bioinformatics
An incrementally maintainable index for approximate lookups in hierarchical data
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Multi-column substring matching for database schema translation
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
An approximate multi-word matching algorithm for robust document retrieval
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Feature-based similarity search in graph structures
ACM Transactions on Database Systems (TODS)
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Identification of confusable drug names: a new approach and evaluation methodology
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
s-grams: Defining generalized n-grams for information retrieval
Information Processing and Management: an International Journal
Indexing schemes for similarity search in datasets of short protein fragments
Information Systems
Vector representations for efficient comparison and search for similar strings
Cybernetics and Systems Analysis
Stemming Indonesian: A confix-stripping approach
ACM Transactions on Asian Language Information Processing (TALIP)
Probabilistic correlation-based similarity measure of unstructured records
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Compacting music signatures for efficient music retrieval
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Matchmaking and ranking of semantic web services using integrated service profile
International Journal of Metadata, Semantics and Ontologies
Substructure similarity measurement in chinese recipes
Proceedings of the 17th international conference on World Wide Web
Finite automata for testing composition-based reconstructibility of sequences
Journal of Computer and System Sciences
Hardness of optimal spaced seed design
Journal of Computer and System Sciences
Summarization system evaluation revisited: N-gram graphs
ACM Transactions on Speech and Language Processing (TSLP)
On-Line Approximate String Matching with Bounded Errors
CPM '08 Proceedings of the 19th annual symposium on Combinatorial Pattern Matching
Efficient Similarity Search for Tree-Structured Data
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
A Comparative Evaluation of XML Difference Algorithms with Genomic Data
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Application of q-Gram Distance in Digital Forensic Search
IWCF '08 Proceedings of the 2nd international workshop on Computational Forensics
Evaluating Performance and Quality of XML-Based Similarity Joins
ADBIS '08 Proceedings of the 12th East European conference on Advances in Databases and Information Systems
Similarity of Names Across Scripts: Edit Distance Using Learned Costs of N-Grams
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Comparison of s-gram Proximity Measures in Out-of-Vocabulary Word Translation
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
Sibling Distance for Rooted Labeled Trees
New Frontiers in Applied Data Mining
Sourcerer: mining and searching internet-scale software repositories
Data Mining and Knowledge Discovery
Lemmatization of Polish person names
ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Finding variants of out-of-vocabulary words in Arabic
Semitic '07 Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources
The pq-gram distance between ordered labeled trees
ACM Transactions on Database Systems (TODS)
MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring
OTM '09 Proceedings of the Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems: ADI, CAMS, EI2N, ISDE, IWSSA, MONET, OnToContent, ODIS, ORM, OTM Academy, SWWS, SEMELS, Beyond SAWSDL, and COMBEK 2009
An incremental clustering scheme for data de-duplication
Data Mining and Knowledge Discovery
Subsequent patient visit detection in a high volume OPD using record linkage techniques
Proceedings of the Third Annual ACM Bangalore Conference
String distance metrics for reference matching and search query correction
BIS'07 Proceedings of the 10th international conference on Business information systems
N-gram analysis based on zero-suppressed BDDs
JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence
Fast search algorithms for position specific scoring matrices
BIRD'07 Proceedings of the 1st international conference on Bioinformatics research and development
A hash trie filter method for approximate string matching in genomic databases
Applied Intelligence
An efficient duplicate record detection using q-grams array inverted index
DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
Multimodal sn,k-grams: a skipping-based similarity model in information retrieval
ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part I
Finding Significant Matches of Position Weight Matrices in Linear Time
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Generalised Sequence Signatures through symbolic clustering
International Journal of Data Mining and Bioinformatics
A more specific events classification to improve crawling techniques
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems
Foundations and Trends in Databases
PG-join: proximity graph based string similarity joins
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
On-line approximate string matching with bounded errors
Theoretical Computer Science
An improved sequential clustering algorithm
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part I
A publication process model to enable privacy-aware data sharing
IBM Journal of Research and Development
Selecting Oligonucleotide Probes for Whole-Genome Tiling Arrays with a Cross-Hybridization Potential
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Pattern occurrences in multicomponent models
STACS'05 Proceedings of the 22nd annual conference on Theoretical Aspects of Computer Science
Efficient q-gram filters for finding all ε-matches over a given length
RECOMB'05 Proceedings of the 9th Annual international conference on Research in Computational Molecular Biology
The q-gram distance for ordered unlabeled trees
DS'05 Proceedings of the 8th international conference on Discovery Science
Maximal words in sequence comparisons based on subword composition
Algorithms and Applications
Approximate string matching with reduced alphabet
Algorithms and Applications
N-gram similarity and distance
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Efficient searching top-k semantic similar words
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Towards efficient similar sentences extraction
IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Better Filtering with Gapped q-Grams
Fundamenta Informaticae - Computing Patterns in Strings
A comparison of index-based lempel-Ziv LZ77 factorization algorithms
ACM Computing Surveys (CSUR)
Journal of Combinatorial Theory Series B
A survey of query-by-humming similarity methods
Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments
On multiset of factors of a word
Information Processing Letters
Indexing dataspaces with partitions
World Wide Web
A two-phase algorithm for mining sequential patterns with differential privacy
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Genomic Sequence Fragment Identification using Quasi-Alignment
Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Similarity evaluation in XML schema and XLink
Proceedings of the 19th Brazilian symposium on Multimedia and the web
Clustering with Proximity Graphs: Exact and Efficient Algorithms
International Journal of Knowledge-Based Organizations
Hi-index | 0.00 |