The connection machine
Parallel free-text search on the connection machine system
Communications of the ACM - Special issue on parallelism
The multimedia object presentation manager of MINOS: a symmetric approach
SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
Signature files: an access method for documents and its analytical performance evaluation
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Laser optical disk: the coming revolution in on-line storage
Communications of the ACM
Partial-match retrieval using indexed descriptor files
Communications of the ACM
A fast string searching algorithm
Communications of the ACM
Efficient string matching: an aid to bibliographic search
Communications of the ACM
Elements of the randomized combinatorial file structure
SIGIR '71 Proceedings of the 1971 international ACM SIGIR conference on Information storage and retrieval
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
A Multimedia Office Filing System
VLDB '83 Proceedings of the 9th International Conference on Very Large Data Bases
A Method for Speeding Up Text Retrieval
Databases for Business and Office Applications, Database Week
Associative/parallel processors for searching very large textual data bases
CAW '77 Proceedings of the 3rd workshop on Computer architecture : Non-numeric processing
Implementing ranking strategies using text signatures
ACM Transactions on Information Systems (TOIS)
Partitioned signature files: design issues and performance evaluation
ACM Transactions on Information Systems (TOIS)
File organizations and access methods for CLV disks
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Multikey access methods based on term discrimination and signature clustering
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Office documents on a database kernel—filing, retrieval, and archiving
COCS '90 Proceedings of the ACM SIGOIS and IEEE CS TC-OA conference on Office information systems
A dynamic signature technique for multimedia databases
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Dynamic partitioning of signature files
ACM Transactions on Information Systems (TOIS)
Optimal weight assignment for signature generation
ACM Transactions on Database Systems (TODS)
Frame-sliced partitioned parallel signature files
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Estimating accesses in partitioned signature file organizations
ACM Transactions on Information Systems (TOIS)
Evaluation of signature files as set access facilities in OODBs
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
A new character-based indexing method using frequency data for Japanese documents
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
ACM Transactions on Information Systems (TOIS)
A new method for similarity indexing of market basket data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Accessing data cubes along complex dimensions
Proceedings of the 2nd ACM international workshop on Data warehousing and OLAP
Intensive Data Management in Parallel Systems: A Survey
Distributed and Parallel Databases
Superimposing codes representing hierarchical information in web directories
Proceedings of the 3rd international workshop on Web information and data management
IEEE Transactions on Knowledge and Data Engineering
Efficient Signature File Methods for Text Retrieval
IEEE Transactions on Knowledge and Data Engineering
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Browsing Electronic Mail: Experiences Interfacing a Mail System to a DBMS
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Fast Text Access Methods for Optical and Large Magnetic Disks: Designs and Performance Comparison
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Hamming Filters: A Dynamic Signature File Organization for Parallel Stores
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Signature File Methods for Semantic Query Caching
ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
Semantic caching of Web queries
The VLDB Journal — The International Journal on Very Large Data Bases
New Access Index for Fast Execution of Conjunctive Queries over Text Data
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
Comparing inverted files and signature files for searching a large lexicon
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Efficient in-memory extensible inverted file
Information Systems
A text retrieval package for the unix operating system
USTC'94 Proceedings of the USENIX Summer 1994 Technical Conference on USENIX Summer 1994 Technical Conference - Volume 1
Indexing time series using signatures
Intelligent Data Analysis
On the SD-tree construction for optimal signature operations
COMPUTE '08 Proceedings of the 1st Bangalore Annual Compute Conference
Optimization of restricted searches in web directories using hybrid data structures
ECIR'03 Proceedings of the 25th European conference on IR research
A constraint-based tool for data integrity management on the web
Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
XML-based e-barter system for circular supply exchange
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
The optimal size of a signature
Mathematical and Computer Modelling: An International Journal
Hi-index | 0.00 |
Signature files have attracted a lot of interest as an access method for text and specifically for messages in the office environment. Messages are stored sequentially in the message file, whereas their hash-coded abstractions (signatures) are stored sequentially in the signature file. To answer a query, the signature file is examined first, and many nonqualifying messages are immediately rejected. In this paper we examine the problem of designing signature extraction methods and studying their performance. We describe two old methods, generalize another one, and propose a new method and its variation. We provide exact and approximate formulas for the dependency between the false drop probability and the signature size for all the methods, and we show that the proposed method (VBC) achieves approximately ten times smaller false drop probability than the old methods, whereas it is well suited for collections of documents with variable document sizes.