A hierarchical picture coding scheme
Pattern Recognition
Document image analysis
A nearest-neighbor chain based approach to skew estimation in document images
Pattern Recognition Letters
Architectural optimizations for low-power, real-time speech recognition
Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems
A multi-modal architecture for cellular phones
Proceedings of the 6th international conference on Multimodal interfaces
DynaSpeak: SRI's scalable speech recognizer for embedded and mobile systems
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Hi-index | 0.00 |
While reading devices for the visually impaired have been available for many years, they are often expensive and difficult to use. The image processing required to enable the reading task is a composition of several important sub-tasks, such as image capture, binarization, pyramidal representation, region segmentation, regions grouping, separation of text sentences from images, words recognition, etc. In this paper we deal with some of these sub-tasks in an effort to prototype a machine (Tyflos-reader) that will read a document for a person with a visual impairment and respond to voice commands for control. The methodology used and illustrative results are provided in this paper.