A data mining approach considering missing values for the optimization of semiconductor-manufacturing processes

Authors:
Doh-Soon Kwak;Kwang-Jae Kim
Affiliations:
Samsung Electronics, San 61, Banwol-Dong, Hwasung, Gyeonggi 445-701, Republic of Korea;Division of Mechanical and Industrial Engineering, Pohang University of Science and Technology, San 31, Hyoja-Dong, Nam-Gu, Pohang, Kyungbuk 790-784, Republic of Korea
Venue:
Expert Systems with Applications: An International Journal
Year:
2012

Citing 4
Cited 2

Bump hunting in high-dimensional data

Statistics and Computing
Handling Missing Data in Trees: Surrogate Splits or Statistical Imputation

PKDD '99 Proceedings of the Third European Conference on Principles of Data Mining and Knowledge Discovery
Exploratory Data Mining and Data Cleaning

Exploratory Data Mining and Data Cleaning
Flexible patient rule induction method for optimizing process variables in discrete type

Expert Systems with Applications: An International Journal

An analysis on the use of pre-processing methods in evolutionary fuzzy systems for subgroup discovery

Expert Systems with Applications: An International Journal
A data mining driven risk profiling method for road asset management

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	12.05

Visualization

Abstract

Due to the rapid development of information technologies, abundant data have become readily available. Data mining techniques have been used for process optimization in many manufacturing processes in automotive, LCD, semiconductor, and steel production, among others. However, a large amount of missing values occurs in the data set due to several causes (e.g., data discarded by gross measurement errors, measurement machine breakdown, routine maintenance, sampling inspection, and sensor failure), which frequently complicate the application of data mining to the data set. This study proposes a new procedure for optimizing processes called missing values-Patient Rule Induction Method (m-PRIM), which handles the missing-values problem systematically and yields considerable process improvement, even if a significant portion of the data set has missing values. A case study in a semiconductor manufacturing process is conducted to illustrate the proposed procedure.