XML parsing: a threat to database performance
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Geography Mark-Up Language: Foundation for the Geo-Web
Geography Mark-Up Language: Foundation for the Geo-Web
XML screamer: an integrated approach to high performance XML parsing, validation and deserialization
Proceedings of the 15th international conference on World Wide Web
On the Problem of Coupling Java Algorithms and XML Parsers (Invited Paper)
DEXA '06 Proceedings of the 17th International Conference on Database and Expert Systems Applications
A case study in SIMD text processing with parallel bit streams: UTF-8 to UTF-16 transcoding
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
A Data Parallel Algorithm for XML DOM Parsing
XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies
A 1 cycle-per-byte XML parsing accelerator
Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays
How to improve XML web services performance?
Proceedings of the International Conference and Workshop on Emerging Trends in Technology
Designing efficient XML web services
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Parallel scanning with bitstream addition: an XML case study
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Memory-side acceleration for XML parsing
NPC'11 Proceedings of the 8th IFIP international conference on Network and parallel computing
XLynx—An FPGA-based XML filter for hybrid XQuery processing
ACM Transactions on Database Systems (TODS) - Invited papers issue
Hi-index | 0.00 |
Parabix (parallel bit streams for XML) is an open-source XML parser that employs the SIMD (single-instruction multiple-data) capabilities of modern-day commodity processors to deliver dramatic performance improvements over traditional byte-at-a-time parsing technology. Byte-oriented character data is first transformed to a set of 8 parallel bit streams, each stream comprising one bit per character code unit. Character validation, transcoding and lexical item stream formation are all then carried out in parallel using bitwise logic and shifting operations. Byte-at-a-time scanning loops in the parser are replaced by bit scan loops that can advance by as many as 64 positions with a single instruction. A performance study comparing Parabix with the open-source Expat and Xerces parsers is carried out using the PAPI toolkit. Total CPU cycle counts, level 2 data cache misses and branch mispredictions are measured and compared for each parser. The performance of Parabix is further studied with a breakdown of the cycle counts across the core components of the parser. Prospects for further performance improvements are also outlined, with a particular emphasis on leveraging the intraregister parallelism of SIMD processing to enable intrachip parallelism on multicore architectures.