XQueC: A query-conscious compressed XML database
ACM Transactions on Internet Technology (TOIT)
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Effective asymmetric XML compression
Software—Practice & Experience
XML compression techniques: A survey and comparison
Journal of Computer and System Sciences
Efficient XML usage within wireless sensor networks
Proceedings of the 4th Annual International Conference on Wireless Internet
Compressing and indexing labeled trees, with applications
Journal of the ACM (JACM)
Knowledge and Information Systems
XQPoint: a queriable homomorphic XML compressor
IIT'09 Proceedings of the 6th international conference on Innovations in information technology
CSC: supporting queries on compressed cached XML
ADC '09 Proceedings of the Twentieth Australasian Conference on Australasian Database - Volume 92
Searchable compression of office documents by XML schema subtraction
XSym'10 Proceedings of the 7th international XML database conference on Database and XML technologies
A query-friendly compression for GML documents
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Updates on grammar-compressed XML data
BNCOD'11 Proceedings of the 28th British national conference on Advances in databases
Data management for mobile Ajax web 2.0 applications
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Fast multi-update operations on compressed XML data
BNCOD'13 Proceedings of the 29th British National conference on Big Data
Hi-index | 0.00 |
XML has already become the de facto standard for specifying and exchanging data on the Web. However, XML is by nature verbose and thus XML documents are usually large in size, a factor that hinders its practical usage, since it substantially increases the costs of storing, processing, and exchanging data. In order to tackle this problem, many XML-specific compression systems, such as XMill, XGrind, XMLPPM, and Millau, have recently been proposed. However, these systems usually suffer from the following two inadequacies: They either sacrifice performance in terms of compression ratio and execution time in order to support a limited range of queries, or perform full decompression prior to processing queries over compressed documents.In this paper, we address the above problems by exploiting the information provided by a Document Type Definition (DTD) associated with an XML document. We show that a DTD is able to facilitate better compression as well as generate more usable compressed data to support querying. We present the architecture of the XCQ, which is a compression and querying tool for handling XML data. XCQ is based on a novel technique we have developed called DTD Tree and SAX Event Stream Parsing (DSP). The documents compressed by XCQ are stored in Partitioned Path-Based Grouping (PPG) data streams, which are equipped with a Block Statistics Signature (BSS) indexing scheme. The indexed PPG data streams support the processing of XML queries that involve selection and aggregation, without the need for full decompression. In order to study the compression performance of XCQ, we carry out comprehensive experiments over a set of XML benchmark datasets.