The print scraping system

  • Authors:
  • Brian Ray;JingJiang He;Chia-Chu Chiang;Jim Melescue

  • Affiliations:
  • University of Arkansas at Little Rock, Little Rock, Arkansas;University of Arkansas at Little Rock, Little Rock, Arkansas;University of Arkansas at Little Rock, Little Rock, Arkansas;Software Development Syntel#8482/ LLC., Jonesboro, Arkansas

  • Venue:
  • Proceedings of the 43rd annual Southeast regional conference - Volume 2
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Computerized documents are created using a wide variety of software and exist in a wide variety of formats, which usually causes difficulties for any application system which is intended to process documents after their initial creation. Extracting text data from files in various file formats and placing it in a designated template file for further processing enables companies to manage documents more efficiently and effectively. The current Syntel™ AutoMail® System extracts and processes data from ASCII text and database files. The main goal of this project is to develop a system suitable for installation on end-user machines which will hook in to the Windows XP print processing system and extract data from the print data stream which are stored in the spool directory. The software will then allow the user to view the pages as they will appear on the printer and select the portions of the page which are exported to a file in a Syntel™ format for further processing such as insertion of additional text and barcodes into the file and post-verification of the data.