Neural Learning from Unbalanced Data Using Noise Modeling

Authors:
Hong Guo;Yi Lu Murphey
Affiliations:
-;-
Venue:
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Year:
2001

Citing 6
Cited 0

The appeal of parallel distributed processing

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence

Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence
Bayesian methods for adaptive models

Bayesian methods for adaptive models
Artificial Neural Networks for Intelligent Manufacturing

Artificial Neural Networks for Intelligent Manufacturing
Simulation and the Monte Carlo Method

Simulation and the Monte Carlo Method
On the Generalization Ability of Neural Network Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes the result of our study on neural learning to solve the classification problems in which data is unbalanced and noisy. We use multidimensional Gaussian distribution to analyze the separation of different class samples in a training data set, and then generate artificial noise samples in the training set using a noise modeling algorithm. The noise analysis allows us to identify special densities in the feature space that are prone to prediction error. We argue that by properly generate extra training data samples around the noise densities, we can train a neural network to have stronger capability of generalization and control the classification error of the trained neural network. In particular, we focus on the problems that require a neural network to make favorable classification to a particular class. The noise modeling algorithm has been implemented to solve a classification problem of good(pass) and bad(fail) vehicles in test sites of automobile assembly plants and a multi-layered Back Propagation neural network has been used in our experiments. The experimental results showed that the noise modeling algorithm was very effective in generate extra data samples that can be used to train a neural network to make favorable decisions to a minority class and to have increased generalization capability.