Text-based video content classification for online video-sharing sites

Authors:
Chunneng Huang;Tianjun Fu;Hsinchun Chen
Affiliations:
AI Lab, Department of Management Information Systems, University of Arizona, Tucson, AZ 85721;AI Lab, Department of Management Information Systems, University of Arizona, Tucson, AZ 85721;AI Lab, Department of Management Information Systems, University of Arizona, Tucson, AZ 85721
Venue:
Journal of the American Society for Information Science and Technology
Year:
2010

Citing 49
Cited 4

A comparative study of ID3 and backpropagation for English text-to-speech mapping

Proceedings of the seventh international conference (1990) on Machine learning
Automatic recognition and analysis of human faces and facial expressions: a survey

Pattern Recognition
Content-Based Video Indexing and Retrieval

IEEE MultiMedia
The nature of statistical learning theory

The nature of statistical learning theory
Automatic recognition of film genres

Proceedings of the third ACM international conference on Multimedia
A machine learning approach to inductive query by examples: an experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing

Journal of the American Society for Information Science
Trawling the Web for emerging cyber-communities

WWW '99 Proceedings of the eighth international conference on World Wide Web
Rule-based video classification system for basketball video indexing

MULTIMEDIA '00 Proceedings of the 2000 ACM workshops on Multimedia
Classification of summarized videos using hidden markov models on compressed chromaticity signatures

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Guest Editor's Introduction: Content-Based Multimedia Indexing and Retrieval

IEEE MultiMedia
Induction of Decision Trees

Machine Learning
News video classification using SVM-based multimodal classifiers and combination strategies

Proceedings of the tenth ACM international conference on Multimedia
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval

ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
VideoCube: A Novel Tool for Video Mining and Classification

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
Authorship Attribution with Support Vector Machines

Applied Intelligence
Style mining of electronic messages for multiple authorship discrimination: first results

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A mid-level representation framework for semantic sports video analysis

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
A multi-modal system for the retrieval of semantic video events

Computer Vision and Image Understanding - Special issue on event detection in video
Automatic Image Orientation Detection via Confidence-Based Integration of Low-Level and Semantic Cues

IEEE Transactions on Pattern Analysis and Machine Intelligence
Language independent authorship attribution using character level language models

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Applying Authorship Analysis to Extremist-Group Web Forum Messages

IEEE Intelligent Systems
A framework for authorship identification of online messages: Writing-style features and classification techniques

Journal of the American Society for Information Science and Technology
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Sports video classification using HMMS

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Video classification using spatial-temporal features and PCA

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
Mining communities and their relationships in blogs: A study of online hate groups

International Journal of Human-Computer Studies
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Tagging video: conventions and strategies of the YouTube community

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Content-based video indexing of TV broadcast news using hidden Markov models

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Video classification using transform coefficients

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Motion pattern-based video classification and retrieval

EURASIP Journal on Applied Signal Processing
Automatic Genre Classification of TV Programmes Using Gaussian Mixture Models and Neural Networks

DEXA '07 Proceedings of the 18th International Conference on Database and Expert Systems Applications
Rule-based Event Detection of Broadcast Baseball Videos Using Mid-level Cues

ICICIC '07 Proceedings of the Second International Conference on Innovative Computing, Informatio and Control
Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace

ACM Transactions on Information Systems (TOIS)
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums

ACM Transactions on Information Systems (TOIS)
A hybrid approach to Web forum interactional coherence analysis

Journal of the American Society for Information Science and Technology
Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment

IEEE Transactions on Pattern Analysis and Machine Intelligence
Stylometric Identification in Electronic Markets: Scalability and Robustness

Journal of Management Information Systems
Computational methods in authorship attribution

Journal of the American Society for Information Science and Technology
Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Management Science
Perspectives on social tagging

Journal of the American Society for Information Science and Technology
Visual cue cluster construction via information bottleneck principle and kernel density estimation

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Estimation of fuzzy Gaussian mixture and unsupervised statistical image segmentation

IEEE Transactions on Image Processing
Statistical models of video structure for content analysis and characterization

IEEE Transactions on Image Processing
Estimation of generalized mixture in the case of correlated sensors

IEEE Transactions on Image Processing
An efficient and effective region-based image retrieval framework

IEEE Transactions on Image Processing
Learning Midlevel Image Features for Natural Scene and Texture Classification

IEEE Transactions on Circuits and Systems for Video Technology

Improved video categorization from text metadata and user comments

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
An evaluation of classification models for question topic categorization

Journal of the American Society for Information Science and Technology
Artificial immune system for illicit content identification in social media

Journal of the American Society for Information Science and Technology
Enriching media fragments with named entities for video classification

Proceedings of the 22nd international conference on World Wide Web companion

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the emergence of Web 2.0, sharing personal content, communicating ideas, and interacting with other online users in Web 2.0 communities have become daily routines for online users. User-generated data from Web 2.0 sites provide rich personal information (e.g., personal preferences and interests) and can be utilized to obtain insight about cyber communities and their social networks. Many studies have focused on leveraging user-generated information to analyze blogs and forums, but few studies have applied this approach to video-sharing Web sites. In this study, we propose a text-based framework for video content classification of online-video sharing Web sites. Different types of user-generated data (e.g., titles, descriptions, and comments) were used as proxies for online videos, and three types of text features (lexical, syntactic, and content-specific features) were extracted. Three feature-based classification techniques (C4.5, Naïve Bayes, and Support Vector Machine) were used to classify videos. To evaluate the proposed framework, user-generated data from candidate videos, which were identified by searching user-given keywords on YouTube, were first collected. Then, a subset of the collected data was randomly selected and manually tagged by users as our experiment data. The experimental results showed that the proposed approach was able to classify online videos based on users' interests with accuracy rates up to 87.2%, and all three types of text features contributed to discriminating videos. Support Vector Machine outperformed C4.5 and Naïve Bayes techniques in our experiments. In addition, our case study further demonstrated that accurate video-classification results are very useful for identifying implicit cyber communities on video-sharing Web sites. © 2010 Wiley Periodicals, Inc.