MapReduce: a flexible data processing tool
Communications of the ACM - Amir Pnueli: Ahead of His Time
pygrametl: a powerful programming framework for extract-transform-load programmers
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Hi-index | 0.00 |
This paper presents ETLMR, a parallel Extract-Transform-Load (ETL) programming framework based on MapReduce. It has builtin support for high-level ETL-specific constructs including star schemas, snowflake schemas, and slowly changing dimensions (SCDs). ETLMR gives both high programming productivity and high ETL scalability.