Document image analysis for active reading

Authors:
Claudie Faure;Nicole Vincent
Affiliations:
CNRS -- LTCI, Paris cedex;CRIP5 -- Université Paris Descartes, Paris cedex
Venue:
SADPI '07 Proceedings of the 2007 international workshop on Semantically aware document processing and indexing
Year:
2007

Citing 18
Cited 2

Beyond paper: supporting active reading with free form digital ink annotations

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Toward an ecology of hypertext annotation

Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
A guided tour to approximate string matching

ACM Computing Surveys (CSUR)
Imaged Document Text Retrieval Without OCR

IEEE Transactions on Pattern Analysis and Machine Intelligence
Information Retrieval can Cope with Many Errors

Information Retrieval
Retrieval methods for English-text with missrecognized OCR characters

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Detection, Extraction and Representation of Tables

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Difficult and Urgent Open Problems in Document Image Analysis for Libraries

DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
Promises and Challenges of Digital Libraries and Document Image Analysis: A Humanist's Perspective

DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
A survey of table recognition: Models, observations, transformations, and inferences

International Journal on Document Analysis and Recognition
Semiology of graphics

Semiology of graphics
Interactive Document Processing and Digital Libraries

DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
How people use presentation to search for a link: expanding the understanding of accessibility on the web

W4A '06 Proceedings of the 2006 international cross-disciplinary workshop on Web accessibility (W4A): Building the mobile web: rediscovering accessibility?
Influence de l'organisation spatiale des affichages sur l'efficacité de la recherche visuelle

IHM 2005 Proceedings of the 17th international conference on Francophone sur l'Interaction Homme-Machine
User-driven page layout analysis of historical printed books

International Journal on Document Analysis and Recognition
Quantifying degree of goal directedness in document navigation: application to the evaluation of the perspective-drag technique

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatic keyword extraction from historical document images

DAS'06 Proceedings of the 7th international conference on Document Analysis Systems

A Model and Environment for Improving Multimedia Intensive Reading Practices

AMT '09 Proceedings of the 5th International Conference on Active Media Technology
A model and environment for improving multimedia scholarly reading practices

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A huge number of documents that were only available in libraries are now on the web. The web access is a solution to protect the cultural heritage and to facilitate knowledge transmission. Most of these documents are displayed as images of the original paper pages and are indexed by hand. In this paper, we present how and why Document Image Analysis contributes to build the Digital Libraries of the future. Readers expect human-centred interactive reading stations, which imply the production of hyperdocuments to fit the reader's intentions and needs. Image analysis allows extracting and categorizing the meaningful document components and relationships; it also provides readers' adapted visualisation of the original images. Document Image Analysis is an essential prerequisite to enrich hyperdocuments that support content-based readers' activities such as information seeking and navigation. This paper focuses the function of the original image: a reference for the reader and the input data that are processed to automatically detect what makes sense in a document.