The TSIMMIS Approach to Mediation: Data Models and Languages
Journal of Intelligent Information Systems - Special issue: next generation information technologies and systems
EDUTELLA: a P2P networking infrastructure based on RDF
Proceedings of the 11th international conference on World Wide Web
Scaling Access to Heterogeneous Data Sources with DISCO
IEEE Transactions on Knowledge and Data Engineering
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
MiniCon: A scalable algorithm for answering queries using views
The VLDB Journal — The International Journal on Very Large Data Bases
Efficiently Ordering Query Plans for Data Integration
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Mediated query processing over autonomous data sources
Mediated query processing over autonomous data sources
The Piazza peer data management project
ACM SIGMOD Record
Completeness of integrated information sources
Information Systems - Special issue: Data quality in cooperative information systems
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
Introduction to Operations Research and Revised CD-ROM 8
Introduction to Operations Research and Revised CD-ROM 8
Querying the internet with PIER
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Query processing over incomplete autonomous databases
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
SQLB: a query allocation framework for autonomous consumers and providers
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Journal on data semantics VIII
Source selection in large scale data contexts: an optimization approach
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Query planning in the presence of overlapping sources
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Hi-index | 0.00 |
This paper concerns querying in large scale virtual organizations. Such organizations are characterized by a challenging data context involving a large number of distributed data sources with strong heterogeneity and uncontrolled data overlapping. In that context, data source selection during query evaluation is particularly important and complex. To cope with this task, we propose OptiSource, an original strategy for source selection using combinatorial optimization techniques combined to organizational knowledge of the virtual organization. Experiment numerical results show that OptiSource is a robust strategy that improves the precision and the recall of the source selection process. This paper presents the data and knowledge models, the definition of OptiSource, the related mathematical model, the prototype and an extensive experimental study.