Automatic Caption Localization in Compressed Video
IEEE Transactions on Pattern Analysis and Machine Intelligence
ICDAR 2003 Robust Reading Competitions
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Proceedings of the 12th annual ACM international conference on Multimedia
A Performance Evaluation of Local Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Time-Efficient Cascade for Real-Time Object Detection: With applications for the visually impaired
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Text Locating Competition Results
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Outdoors augmented reality on mobile phone using loxel-based visual feature organization
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Fast and robust text detection in images and video frames
Image and Vision Computing
Detecting and reading text in natural scenes
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A Laplacian Approach to Multi-Oriented Text Detection in Video
IEEE Transactions on Pattern Analysis and Machine Intelligence
Building descriptive and discriminative visual codebook for large-scale image applications
Multimedia Tools and Applications
Text localization and recognition in complex scenes using local features
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
A method for text localization and recognition in real-world images
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Large scale image search with geometric coding
MM '11 Proceedings of the 19th ACM international conference on Multimedia
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
Detecting texts of arbitrary orientations in natural images
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
CARD: Compact And Real-time Descriptors
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Scalar quantization for large scale image search
Proceedings of the 20th ACM international conference on Multimedia
Hi-index | 0.00 |
Scene text is widely observed in our daily life and has many important multimedia applications. Unlike document text, scene text usually exhibits large variations in font and language, and suffers from low resolution, occlusions and complex background. In this paper, we present a novel scale-based region growing algorithm for scene text detection. We first distinguish SIFT features in text regions from those in background by exploring the inter- and intra-statistics of SIFT features. Then scene text regions in images are identified by scale-based region growing, which explores the geometric context of SIFT keypoints in local regions. Our algorithm is very effective to detect multilingual text in various fonts, sizes, and with complex background. In addition, it offers insights on efficiently deploying local features in numerous applications, such as visual search. We evaluate our algorithm on three datasets and achieve the state-of-the-art performance.