Determination of sample size using power analysis and optimum bin size of histogram features

  • Authors:
  • V. Indira;R. Vasanthakumari;N. R. Sakthivel;V. Sugumaran

  • Affiliations:
  • Department of Mathematics, Sri Manakula Vinayagar Engineering College, Madagadipet, Puducherry – 605107, India.;Department of Mathematics, Kasthurba College for Women, Villianur, Puducherry, India.;Department of Mechanical Engineering, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Ettimadai, Coimbatore, India.;Department of Mechatronics Engineering, SRM University, Kattankulathur, Kanchepuram Dt., India

  • Venue:
  • International Journal of Data Analysis Techniques and Strategies
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Vibration signals are used in fault diagnosis of rotary machines as a source of information. Lots of work have been reported on identification of faults in roller bearing by using many techniques. Of late, application of machine learning approach in fault diagnosis is gaining momentum. Machine learning approach consists of chain of activities like, data acquisition, feature extraction, feature selection and feature classification. While histogram features are used, there are still a few questions to be answered such as how many histogram bins are to be used to extract features and how many samples to be used to train the classifier. This paper provides a mathematical study to choose the bin size and the minimum sample size to train the classifier using power analysis with statistical stability. A typical bearing fault diagnosis problem is taken as a case for illustration and the results are compared with that of entropy based algorithm (J48) for determining minimum sample size and bin size.