The art of computer programming, volume 3: (2nd ed.) sorting and searching
The art of computer programming, volume 3: (2nd ed.) sorting and searching
XMill: an efficient compressor for XML data
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Vinci: a service-oriented architecture for rapid development of web applications
Proceedings of the 10th international conference on World Wide Web
Algorithms and programming models for efficient representation of XML for Internet applications
Proceedings of the 10th international conference on World Wide Web
Compressing XML with Multiplexed Hierarchical PPM Models
DCC '01 Proceedings of the Data Compression Conference
How to build a WebFountain: An architecture for very large-scale text analytics
IBM Systems Journal
Significant Characteristics to Abstract Content: Long Term Preservation of Information
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
Hi-index | 0.00 |
XML provides a universal and portable format for document and data exchange. While the syntax and specification of XML makes documents both human readable and machine parsable, it is often at the expense of efficiency when representing simple data structures.We investigate the ``costs'' associated with XML serialization from several resource perspectives: storage, transport, processing and human readability. These experiments are done within the context of a large text-centric service oriented architecture -- IBM's WebFountain project.We find that for several applications, human readable formats outperform binary equivalents, especially in the area of data size, and that the costs of processing encoded binary data often exceeds that of processing terse human readable formats.