Recognizing address blocks on mail pieces
AI Magazine
Classification of newspaper image blocks using texture analysis
Computer Vision, Graphics, and Image Processing
Algorithms for Graphics and Imag
Algorithms for Graphics and Imag
Multilingual Text-to-Speech Synthesis
Multilingual Text-to-Speech Synthesis
Fundamentals of Data Structures in C
Fundamentals of Data Structures in C
Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Rational Design for a Weighted Finite-State Transducer Library
WIA '97 Revised Papers from the Second International Workshop on Implementing Automata
Using White Space for Automated Document Structuring
Using White Space for Automated Document Structuring
Multilingual text analysis for text-to-speech synthesis
Natural Language Engineering
A matrix grammar for the document processing
IEA/AIE'93 Proceedings of the 6th international conference on Industrial and engineering applications of artificial intelligence and expert systems
Exploring discussion lists: steps and directions
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Deriving marketing intelligence from online discussion
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
LIPTUS: associating structured and unstructured information in a banking environment
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Segmenting email message text into zones
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Interpreting contact details out of e-mail signature blocks
Proceedings of the 21st international conference companion on World Wide Web
Implicit bookmarking: Improving support for revisitation in within-document reading tasks
International Journal of Human-Computer Studies
Hi-index | 0.00 |
The signature block is a common structured component found in email messages. Accurate identification and analysis of signature blocks is important in many multimedia messaging and information retrieval applications such as email text-to-speech rendering, automatic construction of personal address databases, and interactive message retrieval. It is also a very challenging task, because signature blocks often appear in complex two-dimensional layouts which are guided only by loose conventions. Traditional text analysis methods designed to deal with sequential text cannot handle two-dimensional structures, while the highly unconstrained nature of signature blocks makes the application of two-dimensional grammars very difficult. In this article, we describe an algorithm for signature block analysis which combines two-dimensional structural segmentation with one-dimensional grammatical constraints. The information obtained from both layout and linguistic analysis is integrated in the form of weighted finite-state transducers. The algorithm is currently implemented as a component in a preprocessing system for email text-to-speech rendering.