Parallel database systems: the future of high performance database systems
Communications of the ACM
Data integration: a theoretical perspective
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
EntityRank: searching entities directly and holistically
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Bigtable: A Distributed Storage System for Structured Data
ACM Transactions on Computer Systems (TOCS)
Pig latin: a not-so-foreign language for data processing
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Damia: data mashups for intranet applications
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A Domain-Specific Language for Web APIs and Services Mashups
ICSOC '07 Proceedings of the 5th international conference on Service-Oriented Computing
SCOPE: easy and efficient parallel processing of massive data sets
Proceedings of the VLDB Endowment
Rapid prototyping of semantic mash-ups through semantic web pipes
Proceedings of the 18th international conference on World wide web
ACM SIGMOD Record
Frameworks for entity matching: A comparison
Data & Knowledge Engineering
Nephele/PACTs: a programming model and execution framework for web-scale analytical processing
Proceedings of the 1st ACM symposium on Cloud computing
Enhancing Scalability and Performance of Mashups Through Merging and Operator Reordering
ICWS '10 Proceedings of the 2010 IEEE International Conference on Web Services
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Enhancing MapReduce via Asynchronous Data Processing
ICPADS '10 Proceedings of the 2010 IEEE 16th International Conference on Parallel and Distributed Systems
CoMaP: a cooperative overlay-based mashup platform
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems - Volume Part I
WETSUIT: an efficient mashup tool for searching and fusing web entities
Proceedings of the VLDB Endowment
Active XML-based Web data integration
Information Systems Frontiers
Hi-index | 0.00 |
The advent of cloud computing technologies shows great promise for web engineering and facilitates the development of flexible, distributed, and scalable web applications. Data integration can notably benefit from cloud computing because integrating web data is usually an expensive task. This paper introduces CloudFuice, a data integration system that follows a mashup-like specification of advanced dataflows for data integration. CloudFuice's task-based execution approach allows for an efficient, asynchronous, and parallel execution of dataflows in the cloud and utilizes recent cloud-based web engineering instruments. We demonstrate and evaluate CloudFuice's applicability for mashup-based data integration in the cloud with the help of a first prototype implementation.