Using One-Class Classification Techniques in the Anti-phoneme Problem

Authors:
Gábor Gosztolya;András Bánhalmi;László Tóth
Affiliations:
MTA-SZTE Research Group on Artificial Intelligence, of the Hungarian Academy of Sciences and University of Szeged, Szeged, Hungary H-6720;MTA-SZTE Research Group on Artificial Intelligence, of the Hungarian Academy of Sciences and University of Szeged, Szeged, Hungary H-6720;MTA-SZTE Research Group on Artificial Intelligence, of the Hungarian Academy of Sciences and University of Szeged, Szeged, Hungary H-6720
Venue:
IbPRIA '09 Proceedings of the 4th Iberian Conference on Pattern Recognition and Image Analysis
Year:
2009

Citing 4
Cited 0

Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Telephone speech recognition via the combination of knowledge sources in a segmental speech model

Acta Cybernetica
Estimating the Support of a High-Dimensional Distribution

Neural Computation
Counter-Example Generation-Based One-Class Classification

ECML '07 Proceedings of the 18th European conference on Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we focus on the anti-phoneme modelling part of segment-based speech recognition, where we have to distinguish the real phonemes from anything else which may appear (like parts of phonemes, several consecutive phonemes and noise). As it has to be performed while only having samples of the correct phonemes, it is an example of one-class classification. To solve this problem, first all phonemes are modelled with a number of Gaussian distributions; then the problem is converted into a two-class classification task by generating counter-examples; this way some machine learning algorithm (like ANNs) can be used to separate the two classes. We tested two methods for a counter-example generation like this: one was a solution specific to the anti-phoneme problem, while the other used a general algorithm. By making modifications to the latter to reduce its time requirements, we were able to achieve an improvement in the recognition scores of over 60% compared to having no anti-phoneme model at all, and it performed considerably better than the other two methods.