ACM Transactions on Information Systems (TOIS)
XMill: an efficient compressor for XML data
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Millau: an encoding format for efficient representation and exchange of XML over the Web
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Algorithms and programming models for efficient representation of XML for Internet applications
Proceedings of the 10th international conference on World Wide Web
Introduction to Information Theory and Data Compression
Introduction to Information Theory and Data Compression
A Corpus for the Evaluation of Lossless Compression Algorithms
DCC '97 Proceedings of the Conference on Data Compression
XPRESS: a queriable compression for XML data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
DCC '02 Proceedings of the Data Compression Conference
Compressing XML with Multiplexed Hierarchical PPM Models
DCC '01 Proceedings of the Data Compression Conference
XGRIND: A Query-Friendly XML Compressor
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
AXECHOP: A Grammar-based Compressor for XML
DCC '05 Proceedings of the Data Compression Conference
Service-oriented architecture for mobile applications
Proceedings of the 1st international workshop on Software architectures and mobility
Hi-index | 0.00 |
XML simplifies data exchange amongst disparate computers, but is notoriously verbose and has spawned development of a variety of XML compressors and binary formats. Some formats allow streaming access to the data without complete decompression. We present an XML test file corpus, akin to corpora such as the Canterbury corpus and a combined efficiency metric integrating compression ratio and speed. We then use the test corpus to assess 14 general-purpose and XML-specific compressors against the efficiency and other metrics. After constructing linear regression models, we identify the factors influencing compressor selection and then rank the best-performing compressors.