Outlier effects on databases

Authors:
Ahmet Kaya
Affiliations:
Tire Kutsan Post, Secondary Vocational School, Ege University, Tire, İzmir, Turkey
Venue:
ADVIS'04 Proceedings of the Third international conference on Advances in Information Systems
Year:
2004

Citing 3
Cited 0

Estimation of time series parameters in the presence of outliers

Technometrics
Applied multivariate statistical analysis

Applied multivariate statistical analysis
Anchoring data quality dimensions in ontological foundations

Communications of the ACM

Quantified Score

Hi-index	0.00

Visualization

Abstract

Real data and databases always contain some kind of heterogenity or contamination, which is called “outliers”. Outliers are defined as the few observations or records which appear to be inconsistent with the remainder group of the sample and more effective on prediction values. Isolated outliers may also have positive impact on the results of data analysis and data mining. In this study, we are concerned with outliers in time series which have two special cases, innovational outlier (IO) and additive outlier (AO). The occurence of AO indicates that action is required, possibly to adjust the measuring instrument or at least to print an error message on the database. However, if IO occurs, no adjustment of the measurement operation is required. At the end of the study, the results of the simulation and variance analysis on the produced data sets are emphasized.