Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
GCap: Graph-based Automatic Image Captioning
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 9 - Volume 09
An adaptive graph model for automatic image annotation
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Fast Random Walk with Restart and Its Applications
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Supervised Learning of Semantic Classes for Image Annotation and Retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-Time Computerized Annotation of Pictures
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple Bernoulli relevance models for image and video annotation
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A correlation approach for automatic image annotation
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Automatic image semantic interpretation using social action and tagging data
Multimedia Tools and Applications
Mining multi-tag association for image tagging
World Wide Web
Tagging image by exploring weighted correlation between visual features and tags
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Image tagging by exploiting feature correlation
ICADL'11 Proceedings of the 13th international conference on Asia-pacific digital libraries: for cultural heritage, knowledge dissemination, and future creation
SympGraph: a framework for mining clinical notes through symptom relation graphs
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Improving image tags by exploiting web search results
Multimedia Tools and Applications
A spatio-temporal pyramid matching for video retrieval
Computer Vision and Image Understanding
Hi-index | 0.00 |
In this paper, we present a graph-based scheme founded on the GCap method of Pan et al. [12] to perform automatic image annotation. Our approach, namely enhanced GCap (EGCap), takes advantage of the canonical correlation analysis technique (CCA) to shorten the semantic gap in the image space and define a new metric in the text space to correlate annotations. As a result, graph linkage errors at the image level are decreased and the consistency of tags output by the system is improved. Besides, we introduce graph link weighting techniques based on inverse document frequency and CCA metric which are proved to enhance the annotation quality. Simple and self-consistent, the present approach achieves image tagging in real time due to the lightweight Local Binary Pattern image features used, the absence of image segmentation, and the reduced size of feature vectors after CCA projection. We test the proposed approach against top-grade state-of-the-art techniques on Corel and Flickr databases, and show the effectiveness of our method in terms of per-word, per-image and processing time performance indicators.