Schema Independent XML Compressor

Authors:
Zhongyu Joan Lu;Baydaa Al-Hamadani;Raad F. Alwan
Affiliations:
University of Huddersfield, UK;University of Huddersfield, UK;Philadelphia University, Jordan
Venue:
International Journal of Information Retrieval Research
Year:
2011

Citing 20
Cited 0

Arithmetic coding revisited

ACM Transactions on Information Systems (TOIS)
XMill: an efficient compressor for XML data

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Millau: an encoding format for efficient representation and exchange of XML over the Web

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Accelerating XPath location steps

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
XPRESS: a queriable compression for XML data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
TIMBER: a native system for querying XML

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XGRIND: A Query-Friendly XML Compressor

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Compressing and searching XML data via two zips

Proceedings of the 15th international conference on World Wide Web
Data Compression: The Complete Reference

Data Compression: The Complete Reference
XQueC: A query-conscious compressed XML database

ACM Transactions on Internet Technology (TOIT)
Type-Based Compression of XML Data

DCC '07 Proceedings of the 2007 Data Compression Conference
An analysis of XML compression efficiency

Proceedings of the 2007 workshop on Experimental computer science
Mixed mode XML query processing

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Effective asymmetric XML compression

Software—Practice & Experience
XML Tree Structure Compression

DEXA '08 Proceedings of the 2008 19th International Conference on Database and Expert Systems Application
XML compression techniques: A survey and comparison

Journal of Computer and System Sciences
Services and Business Computing Solutions With Xml: Applications for Quality Management and Best Processes

Services and Business Computing Solutions With Xml: Applications for Quality Management and Best Processes
XML Lossy Text Compression: A Preliminary Study

XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies
XPathMark: an XPath benchmark for the XMark generated data

XSym'05 Proceedings of the Third international conference on Database and XML Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

XML has become the standard way for representing and transforming data over the World Wide Web. The problem with XML documents is that they have a very high ratio of redundancy, which makes these documents demanding a large storage capacity and large network band-width for transmission. This study designs a system for compressing and querying XML documents XMLCQ which compresses the XML document without the need to its schema or DTD to minimize the amount of technologies associated with these documents. XMLCQ first compressed the XML document by separating its data into containers according to the path of these data from the root to the leaf, then it compressed these containers using a back-end compression technique. The compressed file then could be retrieved with any kind of queries applied. Only the required information is decompressed and submitted to the user. Depending on several experiments, the query processor part of the system showed the ability to answer different kinds of queries ranging from simple exact match queries to complex ones. Furthermore, this paper introduced the idea of retrieving information from more than one compressed XML documents.