Incorporating external information in bayesian classifiers via linear feature transformations

Authors:
Tapio Pahikkala;Jorma Boberg;Aleksandr Mylläri;Tapio Salakoski
Affiliations:
Turku Centre for Computer Science (TUCS), Department of Information Technology, University of Turku, Turku, Finland;Turku Centre for Computer Science (TUCS), Department of Information Technology, University of Turku, Turku, Finland;Turku Centre for Computer Science (TUCS), Department of Information Technology, University of Turku, Turku, Finland;Turku Centre for Computer Science (TUCS), Department of Information Technology, University of Turku, Turku, Finland
Venue:
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Year:
2006

Citing 4
Cited 2

A Winnow-Based Approach to Context-Sensitive Spelling Correction

Machine Learning - Special issue on natural language learning
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
An empirical study of smoothing techniques for language modeling

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Estimating continuous distributions in Bayesian classifiers

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence

Matrix representations, linear transformations, and kernels for disambiguation in natural language

Machine Learning
Locality kernels for sequential data and their applications to parse ranking

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Naive Bayes classifier is a frequently used method in various natural language processing tasks. Inspired by a modified version of the method called the flexible Bayes classifier, we explore the use of linear feature transformations together with the Bayesian classifiers, because it provides us an elegant way to endow the classifier with an external information that is relevant to the task. While the flexible Bayes classifier is based on the idea of using kernel density estimation to obtain the class conditional probabilities of continuously valued attributes, we use the linear transformations to smooth the feature frequency counts of discrete valued attributes. We evaluate the method on the context sensitive spelling error correction problem using the Reuters corpus. For this particular task, we define a positional feature transformation and a word feature transformation that take advantage of the positional information of the context words and the part-of-speech information of words, respectively. Our experimental results show that the performance of the Bayesian classifiers in the natural language disambiguation tasks can be improved with the proposed transformations and that the incorporation of external information via the linear feature transformations is a promising research direction.