The elements of graphing data
C4.5: programs for machine learning
C4.5: programs for machine learning
Direct Least Square Fitting of Ellipses
IEEE Transactions on Pattern Analysis and Machine Intelligence
Visualizing Data
Layout-Based Approach for Extracting Constructive Elements of Bar-Charts
GREC '97 Selected Papers from the Second International Workshop on Graphics Recognition, Algorithms and Systems
Text/Graphics Separation Revisited
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Unsupervised learning of soft patterns for generating definitions from online news
Proceedings of the 13th international conference on World Wide Web
Evaluation of an extraction-based approach to answering definitional questions
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Learning surface text patterns for a Question Answering system
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Multi-Level Component Grouping Algorithm and Its Applications
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Using string-kernels for learning semantic parsers
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Chart Image Classification Using Multiple-Instance Learning
WACV '07 Proceedings of the Eighth IEEE Workshop on Applications of Computer Vision
NPIC: hierarchical synthetic image classification using image search and generic features
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
ReVision: automated classification, analysis and redesign of chart images
Proceedings of the 24th annual ACM symposium on User interface software and technology
Automatically recognizing intended messages in grouped bar charts
Diagrams'12 Proceedings of the 7th international conference on Diagrammatic Representation and Inference
Visualizing computer ethics using infographics
Proceedings of the 18th ACM conference on Innovation and technology in computer science education
Hi-index | 0.00 |
Information graphics, or infographics, are visual representations of information, data or knowledge. Understanding of infographics in documents is a relatively new research problem, which becomes more challenging when infographics appear as raster images. This paper describes technical details and practical applications of the system we built for recognizing and understanding imaged infographics located in document pages. To recognize infographics in raster form, both graphical symbol extraction and text recognition need to be performed. The two kinds of information are then auto-associated to capture and store the semantic information carried by the infographics. Two practical applications of the system are introduced in this paper, including supplement to traditional optical character recognition (OCR) system and providing enriched information for question answering (QA). To test the performance of our system, we conducted experiments using a collection of downloaded and scanned infographic images. Another set of scanned document pages from the University of Washington document image database were used to demonstrate how the system output can be used by other applications. The results obtained confirm the practical value of the system.