A spatial-temporal approach for video caption detection and recognition
IEEE Transactions on Neural Networks
Localization and recognition of the scoreboard in sports video based on SIFT point matching
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Hi-index | 0.00 |
This paper presents an effective and efficient detection and recognition of scoreboard caption method for baseball videos. The method first identifies the scoreboard type using template matching and then extracts the caption region of each type. Next it recognizes the extracted caption utilizing a novel digit recognition scheme which is constructed by a simple neural network classifier. It results in a much simpler method with significantly higher recognition rate over that of the universal OCR scheme. Experimental results demonstrate the effectiveness of the proposed method and indicate that it identifies twelve scoreboard types correctly and recognizes scoreboard caption over 98%.