Parallel scanning with bitstream addition: an XML case study

  • Authors:
  • Robert D. Cameron;Ehsan Amiri;Kenneth S. Herdy;Dan Lin;Thomas C. Shermer;Fred P. Popowich

  • Affiliations:
  • Simon Fraser University, Surrey, BC, Canada;Simon Fraser University, Surrey, BC, Canada;Simon Fraser University, Surrey, BC, Canada;Simon Fraser University, Surrey, BC, Canada;Simon Fraser University, Surrey, BC, Canada;Simon Fraser University, Surrey, BC, Canada

  • Venue:
  • Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A parallel scanning method using the concept of bitstream addition is introduced and studied in application to the problem of XML parsing and well-formedness checking. On processors supporting W-bit addition operations, the method can perform up to W finite state transitions per instruction. The method is based on the concept of parallel bitstream technology, in which parallel streams of bits are formed such that each stream comprises bits in one-to-one correspondence with the character code units of a source data stream. Parsing routines are initially prototyped in Python using its native support for unbounded integers to represent arbitrary-length bitstreams. A compiler then translates the Python code into low-level C-based implementations. These low-level implementations take advantage of the SIMD (single-instruction multipledata) capabilities of commodity processors to yield a dramatic speed-up over traditional alternatives employing byte-at-a-time parsing.