Spatial tessellations: concepts and applications of Voronoi diagrams
Spatial tessellations: concepts and applications of Voronoi diagrams
LH*—a scalable, distributed data structure
ACM Transactions on Database Systems (TODS)
Multidimensional access methods
ACM Computing Surveys (CSUR)
The small-world phenomenon: an algorithmic perspective
STOC '00 Proceedings of the thirty-second annual ACM symposium on Theory of computing
Reconciling schemas of disparate data sources: a machine-learning approach
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
ACM Computing Surveys (CSUR)
Near Neighbor Search in Large Metric Spaces
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Searching in metric spaces by spatial approximation
The VLDB Journal — The International Journal on Very Large Data Bases
SETS: search enhanced by topic segmentation
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Brief announcement: efficient flooding in power-law networks
Proceedings of the twenty-second annual symposium on Principles of distributed computing
P-Grid: a self-organizing structured P2P system
ACM SIGMOD Record
Querying the internet with PIER
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
An efficient nearest neighbor algorithm for P2P settings
dg.o '05 Proceedings of the 2005 national conference on Digital government research
Building and Querying a P2P Virtual World
Geoinformatica
M-Chord: a scalable distributed similarity search structure
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
On scalability of the similarity search in the world of peers
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Nearest neighbor search in metric spaces through Content-Addressable Networks
Information Processing and Management: an International Journal
Using a distributed quadtree index in peer-to-peer networks
The VLDB Journal — The International Journal on Very Large Data Bases
SCAN: a small-world structured p2p overlay for multi-dimensional queries
Proceedings of the 16th international conference on World Wide Web
Proceedings of the international workshop on Workshop on multimedia information retrieval
Nearest neighbor search in metric spaces through Content-Addressable Networks
Information Processing and Management: an International Journal
Peer-to-peer similarity search in metric spaces
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Hierarchical multidimensional search in peer-to-peer networks
Computer Communications
Range queries on structured overlay networks
Computer Communications
Modelling P2P data networks under complex system theory
International Journal of Computational Science and Engineering
Scalability comparison of Peer-to-Peer similarity search structures
Future Generation Computer Systems
Scalable community-driven data sharing in e-science grids
Future Generation Computer Systems
Distributed similarity search in high dimensions using locality sensitive hashing
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Approximate voronoi cell computation on spatial data streams
The VLDB Journal — The International Journal on Very Large Data Bases
SiMPSON: Efficient Similarity Search in Metric Spaces over P2P Structured Overlay Networks
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Efficient range query processing in metric spaces over highly distributed data
Distributed and Parallel Databases
Challenges in Personalizing and Decentralizing the Web: An Overview of GOSSPLE
SSS '09 Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Querying similarity in metric social networks
NBiS'07 Proceedings of the 1st international conference on Network-based information systems
A content-addressable network for similarity search in metric spaces
DBISP2P'05/06 Proceedings of the 2005/2006 international conference on Databases, information systems, and peer-to-peer computing
Enabling portability in advanced information-centric services over structured peer-to-peer systems
Journal of Network and Computer Applications
The GOSSPLE anonymous social network
Proceedings of the ACM/IFIP/USENIX 11th International Conference on Middleware
P2P-based multidimensional indexing methods: A survey
Journal of Systems and Software
Clustering peers based on contents for efficient similarity search
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Modelling peer-to-peer data networks under complex system theory
DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Metric-Based similarity search in unstructured peer-to-peer systems
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
HyperDex: a distributed, searchable key-value store
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
HyperDex: a distributed, searchable key-value store
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
A structured overlay for multi-dimensional range queries
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Hi-index | 0.00 |
Peer-to-peer Data Networks (PDNs) are large-scale, self-organizing, distributed query processing systems. Familiar examples of PDN are peer-to-peer file-sharing networks, which support exact-match search queries to locate user-requested files. In this paper, we formalize the more general problem of similarity-search in PDNs, and propose a family of distributed access methods, termed Small-World Access Methods (SWAM), for efficient execution of various similarity-search queries, namely exact-match, range, and k-nearest-neighbor queries. Unlike its predecessors, i.e., LH* and DHTs, SWAM does not control the assignment of data objects to PDN nodes; each node autonomously stores its own data. Besides, SWAM supports all similarity-search queries on multiple attributes. SWAM guarantees that the query object will be found (if it exists in the network) in average time logarithmically proportional to the network size. Moreover, once the query object is found, all the similar objects would be in its proximate network neighborhood and hence enabling efficient range and k-nearest-neighbor queries. As a specific instance of SWAM, we propose SWAM-V, a Voronoi-based SWAM that indexes PDNs with multi-attribute data objects. For a PDN with N nodes SWAM-V has query time, communication cost, and computation cost of O(log N) for exact-match queries, and O(log N + sN) and O(log N + k) for range queries (with selectivity s) and kNN queries, respectively. Our experiments show that SWAM-V consistently outperforms a similarity-search enabled version of CAN in query time and communication cost by a factor of 2 to 3.