An analysis of XML compression efficiency

  • Authors:
  • Christopher J. Augeri;Dursun A. Bulutoglu;Barry E. Mullins;Rusty O. Baldwin;Leemon C. Baird, III

  • Affiliations:
  • Wright Patterson Air Force Base, Dayton, OH;Wright Patterson Air Force Base, Dayton, OH;Wright Patterson Air Force Base, Dayton, OH;Wright Patterson Air Force Base, Dayton, OH;United States Air Force Academy (USAFA), Colorado Springs, CO

  • Venue:
  • Proceedings of the 2007 workshop on Experimental computer science
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML simplifies data exchange among heterogeneous computers, but it is notoriously verbose and has spawned the development of many XML-specific compressors and binary formats. We present an XML test corpus and a combined efficiency metric integrating compression ratio and execution speed. We use this corpus and linear regression to assess 14 general-purpose and XML-specific compressors relative to the proposed metric. We also identify key factors when selecting a compressor. Our results show, XMill or WBXML may be useful in some instances, but a general-purpose compressor is often the best choice.