MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
MapReduce optimization using regulated dynamic prioritization
Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
Hive: a warehousing solution over a map-reduce framework
Proceedings of the VLDB Endowment
HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads
Proceedings of the VLDB Endowment
Optimizing joins in a map-reduce environment
Proceedings of the 13th International Conference on Extending Database Technology
HadoopDB in action: building real world applications
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Twister: a runtime for iterative MapReduce
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Join Optimization in the MapReduce Environment for Column-wise Data Store
SKG '10 Proceedings of the 2010 Sixth International Conference on Semantics, Knowledge and Grids
HaLoop: efficient iterative data processing on large clusters
Proceedings of the VLDB Endowment
Hadoop++: making a yellow elephant run like a cheetah (without it even noticing)
Proceedings of the VLDB Endowment
Cheetah: a high performance, custom data warehouse on top of MapReduce
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
In this paper, we design a kind of big data processing framework SemanMR (Semantic MapReduce). SemanMR is a programming framework based on the Hadoop MapReduce programming model. SemanMR provide a kind of bid data processing mechanism based on the metadata cluster of distributed file systems or cloud databases. In addition, we add some semantic index on the big data, and so it will improve our processing efficiency in SemanMR. SemanMR is a kind of big data processing internetware in the cloud environment.