Combining Multiple Learning Strategies for Effective Cross Validation
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Stuff I've seen: a system for personal information retrieval and re-use
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The description logic handbook
P-Grid: a self-organizing structured P2P system
ACM SIGMOD Record
Toward network data independence
ACM SIGMOD Record
The Piazza peer data management project
ACM SIGMOD Record
The hyperion project: from data integration to data coordination
ACM SIGMOD Record
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Leveraging context to resolve identity in photo albums
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Exploring social annotations for the semantic web
Proceedings of the 15th international conference on World Wide Web
Reconciling while tolerating disagreement in collaborative data sharing
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
P-TAG: large scale automatic generation of personalized annotation tags for the web
Proceedings of the 16th international conference on World Wide Web
Towards effective browsing of large scale social annotations
Proceedings of the 16th international conference on World Wide Web
Towards automatic extraction of event and place semantics from flickr tags
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
GridVine: An Infrastructure for Peer Information Management
IEEE Internet Computing
Efficient query evaluation on probabilistic databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Emergent Semantics
Semantic desktop 2.0: the Gnowsis experience
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag?
Proceedings of the 10th ACM workshop on Web information and data management
Atlas: Storing, updating and querying RDF(S) data on top of DHTs
Web Semantics: Science, Services and Agents on the World Wide Web
The state of the art in content-based image retrieval in P2P networks
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Data sharing in networked environments: organization, platforms and issues
CIT'11 Proceedings of the 5th WSEAS international conference on Communications and information technology
Quality-aware similarity assessment for entity matching in Web data
Information Systems
Hi-index | 0.00 |
With the commoditization of digital devices, personal information and media sharing is becoming a key application on the pervasive Web. In such a context, data annotation rather than data production is the main bottleneck. Metadata scarcity represents a major obstacle preventing efficient information processing in large and heterogeneous communities. However, social communities also open the door to new possibilities for addressing local metadata scarcity by taking advantage of global collections of resources. We propose to tackle the lack of metadata in large-scale distributed systems through a collaborative process leveraging on both content and metadata. We develop a community-based and self-organizing system called PicShark in which information entropy--in terms of missing metadata--is gradually alleviated through decentralized instance and schema matching. Our approach focuses on semi-structured metadata and confines computationally expensive operations to the edge of the network, while keeping distributed operations as simple as possible to ensure scalability. PicShark builds on structured Peer-to-Peer networks for distributed look-up operations, but extends the application of self-organization principles to the propagation of metadata and the creation of schema mappings. We demonstrate the practical applicability of our method in an image sharing scenario and provide experimental evidences illustrating the validity of our approach.