Video OCR for Digital News Archive
CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Hi-index | 0.00 |
It is important to use pattern information (e.g. TV newscasts) and textual information (e.g. newspapers) together. For this purpose, we describe a method for aligning articles in TV newscasts and newspapers. Also, we describe a method for extracting a newspaper article and its follow-ups. In order to align articles, the alignment system uses words extracted from telops in TV newscasts. The recall and the precision of the alignment process are 97% and 89%, respectively. On the other hand, in order to obtain a newspaper and its follow-ups, the system uses typical expressions which give signs of subsequent articles. The recall and precision are 80% and 85%, respectively. Using the results of these processes, we develop a browsing and retrieval system for articles in TV newscasts and newspapers.