An extensible metadata framework for data quality assessment of composite structures

Authors:
José Farinha;Maria José Trigueiros
Affiliations:
ISCTE/ADETTI, Department of Science and Information Technology, Lisbon, Portugal;ISCTE/ADETTI, Department of Science and Information Technology, Lisbon, Portugal
Venue:
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Year:
2007

Citing 9
Cited 1

The Unified Modeling Language user guide

The Unified Modeling Language user guide
AJAX: an extensible data cleaning tool

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Data Quality: The Accuracy Dimension

Data Quality: The Accuracy Dimension
Potter's Wheel: An Interactive Data Cleaning System

Proceedings of the 27th International Conference on Very Large Data Bases
Exploratory Data Mining and Data Cleaning

Exploratory Data Mining and Data Cleaning
Data quality through knowledge engineering

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A data quality metamodel extension to CWM

APCCM '07 Proceedings of the fourth Asia-Pacific conference on Comceptual modelling - Volume 67
A metamodel for OCL

UML'99 Proceedings of the 2nd international conference on The unified modeling language: beyond the standard
A library of OCL specification patterns for behavioral specification of software components

CAiSE'06 Proceedings of the 18th international conference on Advanced Information Systems Engineering

Using inheritance in a metadata based approach to data quality assessment

Proceedings of the first international workshop on Model driven service engineering and data quality and security

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data quality is a critical issue both in operational databases and in data warehouse systems. Data quality assessment is a strong requirement regarding the ETL subsystem, since bad data may destroy data warehouse credibility. During the last two decades, research and development efforts in the data quality field have produced techniques for data profiling and cleaning, which focus on detecting and correcting bad values in data. Little efforts have been done considering data quality when it relates to the well-formedness of coarse grained data structures resulting from the assembly of linked data records. This paper proposes a metadata model that supports the structural validation of linked data records, from a data quality point of view. The metamodel is built on top of the CWM standard and it supports the specification of data structure quality rules in a high level of abstraction, as well as by means of very specific fine grained business rules.