CloudVista: interactive and economical visual cluster analysis for big data in the cloud

Authors:
Huiqi Xu;Zhen Li;Shumin Guo;Keke Chen
Affiliations:
Wright State University, Dayton, Ohio;Wright State University, Dayton, Ohio;Wright State University, Dayton, Ohio;Wright State University, Dayton, Ohio
Venue:
Proceedings of the VLDB Endowment
Year:
2012

Citing 5
Cited 0

BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
VISTA: validating and refining clusters via visualization

Information Visualization
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
CloudVista: visual cluster exploration for extreme scale data in the cloud

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds

CLOUD '11 Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Analysis of big data has become an important problem for many business and scientific applications, among which clustering and visualizing clusters in big data raise some unique challenges. This demonstration presents the CloudVista prototype system to address the problems with big data caused by using existing data reduction approaches. It promotes a whole-big-data visualization approach that preserves the details of clustering structure. The prototype system has several merits. (1) Its visualization model is naturally parallel, which guarantees the scalability. (2) The visual frame structure minimizes the data transferred between the cloud and the client. (3) The RandGen algorithm is used to achieve a good balance between interactivity and batch processing. (4) This approach is also designed to minimize the financial cost of interactive exploration in the cloud. The demonstration will highlight the problems with existing approaches and show the advantages of the CloudVista approach. The viewers will have the chance to play with the CloudVista prototype system and compare the visualization results generated with different approaches.