Magic sets and other strange ways to implement logic programs (extended abstract)
PODS '86 Proceedings of the fifth ACM SIGACT-SIGMOD symposium on Principles of database systems
Learning object identification rules for information integration
Information Systems - Data extraction, cleaning and reconciliation
What You Always Wanted to Know About Datalog (And Never Dared to Ask)
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Query containment and rewriting using views for regular path queries under constraints
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Kepler: An Extensible System for Design and Execution of Scientific Workflows
SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
MetaQuerier: querying structured web sources on-the-fly
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Checking query containment with the CQC method
Data & Knowledge Engineering
GORDIAN: efficient and scalable discovery of composite keys
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Fine-grained access control to web databases
Proceedings of the 12th ACM symposium on Access control models and technologies
Bioinformatics
DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
On-the-Fly Integration and Ad Hoc Querying of Life Sciences Databases Using LifeDB
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
An Algebraic Language for Semantic Data Integration on the Hidden Web
ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing
Data integration for the relational web
Proceedings of the VLDB Endowment
Query containment under bag and bag-set semantics
Information Processing Letters
Record linkage with uniqueness constraints and erroneous values
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
Evaluation of entity resolution approaches on real-world match problems
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
Life Sciences research extensively and routinely use external online databases, tools and applications for the implementation of computational pipelines. These applications are among the truly distributed and highly collaborative global systems in existence. Since the resources these applications use are designed to serve individual users, they adopt an all-or-nothing model in which users necessarily have to accept the entire response even though only a fraction of the response is relevant. In computational pipelines involving several databases and complex repeat operations, costs due to unnecessary data transmissions and computations could be significant enough to reduce productivity and make the applications sluggish. Since these resources are autonomous, and do not accept user instructions or queries, users are not able to customize their behavior in order to reduce network latency and wasteful computation or data transmission. Obviously, such a resource utilization and sharing model is wasteful and expensive. In this paper, our goal is to propose a new collaborative data integration and computational pipeline execution model for systems biology research. We show that in our envisioned model, arbitrary sites are able to accept user constraints and limited processing instructions to avoid wasteful computation resulting in improved overall efficiency. We also demonstrate that the proposed collaborative model does not breach site security or infringe upon its autonomy.