Optimizing queries with expensive video predicates in cloud environment

Authors:
Yansong Zhang;Lisheng Yu;Xiao Zhang;Shan Wang;Hui Li
Affiliations:
DEKE Lab, Renmin University of China, Beijing, China and National Survey Research Centre at Renmin University of China, Beijing, China;DEKE Lab, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijing, China;DEKE Lab, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijing, China;DEKE Lab, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijing, China;DEKE Lab, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijing, China
Venue:
Concurrency and Computation: Practice & Experience
Year:
2012

Citing 17
Cited 1

Join processing in database systems with large main memories

ACM Transactions on Database Systems (TODS)
Towards an open architecture for LDL

VLDB '89 Proceedings of the 15th international conference on Very large data bases
Predicate migration: optimizing queries with expensive predicates

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Optimizing disjunctive queries with expensive predicates

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Approximating block accesses in database organizations

Communications of the ACM
Access path selection in a relational database management system

SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
SMOOTH - A Distributed Multimedia Database System

Proceedings of the 27th International Conference on Very Large Data Bases
Optimization of Queries with User-defined Predicates

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A new histogram method for sparse attributes: the averaged rectangular attribute cardinality map

ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
Self-tuning cost modeling of user-defined functions in an object-relational DBMS

ACM Transactions on Database Systems (TODS)
Cardinality estimation using sample views with quality assurance

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
BilVideo video database management system

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Oracle SecureFiles System

Proceedings of the VLDB Endowment
A Database Approach for Accelerate Video Data Access

Advances in Web and Network Technologies, and Information Management
HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads

Proceedings of the VLDB Endowment
Exact cardinality query optimization for optimizer testing

Proceedings of the VLDB Endowment
Cheetah: a high performance, custom data warehouse on top of MapReduce

Proceedings of the VLDB Endowment

Special Issue on Multimedia Computing and Management in Cloud Environment

Concurrency and Computation: Practice & Experience

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the rapid developments in video processing technologies, video data have increased rapidly and become popular in our daily life for both professional and consumer applications such as surveillance, education and entertainment. Because of the increasing processing workload, more and more queries with expensive video predicates are being implemented in a parallel environment for better performance. Such requirements entail that the data management system not only be able to store and access video content, but also be able to optimize queries that have expensive video predicates in an effective and efficient way in a cloud environment. In previous research literatures, parallel and distributed policies and query optimizations in relational database management systems are often based on the disk input/output (I/O) cost of involved operations and network transmission cost. However, for a query that contains expensive video predicates in a cloud environment, the traditional cost estimation model does not work well. Although researchers have proposed some approaches that can solve the problem in certain situations, there are still some unresolved issues, and these approaches need further optimizations. This paper is motivated by a real-world large supermarket business data and video surveillance data management scenario in a parallel environment. By considering the characteristics of video data and their expensive processing, we present methods named operating results buffer and operating results buffer-C for implementing expensive video predicates at simple node, mapping video data and executing expensive video predicates in a cloud environment, which reduce the cost of video data transmission and the invoking times of expensive video predicates. We propose a novel query optimization approach that reconstructs the join order-based estimation for attribute cardinality and computes the total cost with I/O, network and expensive processing. This approach reduces the invoking times of expensive video predicates to a greater degree and gives a better solution for mixed query optimization, which contains traditional data types and large object operations in a cloud environment. Our query performance improves by 30% to 80% compared with existing expensive predicates query optimization methods. Copyright © 2011 John Wiley & Sons, Ltd.