SVM Classification Using Sequences of Phonemes and Syllables

Authors:
Gerhard Paass;Edda Leopold;Martha Larson;Jörg Kindermann;Stefan Eickeler
Affiliations:
-;-;-;-;-
Venue:
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Year:
2002

Citing 6
Cited 9

A system for retrieving speech documents

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Inductive learning algorithms and representations for text categorization

Proceedings of the seventh international conference on Information and knowledge management
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?

Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Support vector machines for spam categorization

IEEE Transactions on Neural Networks

A survey of kernels for structured data

ACM SIGKDD Explorations Newsletter
On optimal degree selection for polynomial kernel with support vector machines: Theoretical and empirical investigations

International Journal of Knowledge-based and Intelligent Engineering Systems
Support for seamless data exchanges between web services through information mapping analysis using kernel methods

Expert Systems with Applications: An International Journal
PodCred: a framework for analyzing podcast preference

Proceedings of the 2nd ACM workshop on Information credibility on the web
Overview of VideoCLEF 2008: automatic generation of topic-based feeds for dual language audio-visual content

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Metadata and multilinguality in video classification

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval
Generating web-based corpora for video transcripts categorization

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we use SVMs to classify spoken and written documents. We show that classification accuracy for written material is improved by the utilization of strings of sub-word units with dramatic gains for small topic categories. The classification of spoken documents for large categories using sub-word units is only slightly worse than for written material, with a larger drop for small topicc ategories. Finally it is possible, without loss, to train SVMs on syllables generated from written material and use them to classify audio documents. Our results confirm the strong promise that SVMs hold for robust audio document classification, and suggest that SVMs can compensate for speech recognition error to an extent that allows a significant degree of topic independence to be introduced into the system.