Video classification using transform coefficients

Authors:
A. Girgensohn;J. Foote
Affiliations:
FX Palo Alto Lab., CA, USA;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Year:
1999

Citing 0
Cited 12

Video Manga: generating semantically meaningful video summaries

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Automatically linking multimedia meeting documents by image matching

HYPERTEXT '00 Proceedings of the eleventh ACM on Hypertext and hypermedia
Time-Constrained Keyframe Selection Technique

Multimedia Tools and Applications
VideoCube: A Novel Tool for Video Mining and Classification

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
Hierarchical video content description and summarization using unified semantic and visual similarity

Multimedia Systems
A Survey of MPEG-1 Audio, Video and Semantic Analysis Techniques

Multimedia Tools and Applications
Supervised tensor learning

Knowledge and Information Systems
A Novel Video Classification Method Based on Hybrid Generative/Discriminative Models

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Text-based video content classification for online video-sharing sites

Journal of the American Society for Information Science and Technology
A method of generating table of contents for educational videos

PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part II
Video summarization: techniques and classification

ICCVG'12 Proceedings of the 2012 international conference on Computer Vision and Graphics
Video genre classification using weighted kernel logistic regression

Advances in Multimedia - Special issue on Multimedia Applications for Smart Device in Ubiquitous Environments

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes techniques for classifying video frames using statistical models of reduced DCT or Hadamard transform coefficients. When decimated in time and reduced using truncation or principal component analysis, transform coefficients taken across an entire frame image allow rapid modeling, segmentation and similarity calculation. Unlike color-histogram metrics, this approach models image composition and works on grayscale images. Modeling the statistics of the transformed video frame images gives a likelihood measure that allows video to be segmented, classified, and ranked by similarity for retrieval. Experiments are presented that show an 87% correct classification rate for different classes. Applications are presented including a content-aware video browser.