Document logical structure analysis based on perceptive cycles

Authors:
Yves Rangoni;Abdel Belaïd
Affiliations:
Loria Research Center – Read Group, Vandœuvre-lès-Nancy, France;Loria Research Center – Read Group, Vandœuvre-lès-Nancy, France
Venue:
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Year:
2006

Citing 3
Cited 1

Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
An introduction to variable and feature selection

The Journal of Machine Learning Research
Data categorization for a context return applied to logical document structure recognition

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition

Logical Structure Recovery in Scholarly Articles with Rich Document Features

International Journal of Digital Library Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a Neural Network (NN) approach for logical document structure extraction. In this NN architecture, called Transparent Neural Network (TNN), the document structure is stretched along the layers, allowing an interpretation decomposition from physical (NN input) to logical (NN output) level. The intermediate layers represent successive interpretation steps. Each neuron is apparent and associated to a logical element. The recognition proceeds by repetitive perceptive cycles propagating the information through the layers. In case of low recognition rate, an enhancement is achieved by error backpropagation leading to correct or pick up a more adapted input feature subset. Several feature subsets are created using a modified filter method. The first experiments performed on scientific documents are encouraging.