The Unified Modeling Language user guide
The Unified Modeling Language user guide
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Data Quality: The Accuracy Dimension
Data Quality: The Accuracy Dimension
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Exploratory Data Mining and Data Cleaning
Exploratory Data Mining and Data Cleaning
Data quality through knowledge engineering
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A data quality metamodel extension to CWM
APCCM '07 Proceedings of the fourth Asia-Pacific conference on Comceptual modelling - Volume 67
UML'99 Proceedings of the 2nd international conference on The unified modeling language: beyond the standard
A library of OCL specification patterns for behavioral specification of software components
CAiSE'06 Proceedings of the 18th international conference on Advanced Information Systems Engineering
Using inheritance in a metadata based approach to data quality assessment
Proceedings of the first international workshop on Model driven service engineering and data quality and security
Hi-index | 0.00 |
Data quality is a critical issue both in operational databases and in data warehouse systems. Data quality assessment is a strong requirement regarding the ETL subsystem, since bad data may destroy data warehouse credibility. During the last two decades, research and development efforts in the data quality field have produced techniques for data profiling and cleaning, which focus on detecting and correcting bad values in data. Little efforts have been done considering data quality when it relates to the well-formedness of coarse grained data structures resulting from the assembly of linked data records. This paper proposes a metadata model that supports the structural validation of linked data records, from a data quality point of view. The metamodel is built on top of the CWM standard and it supports the specification of data structure quality rules in a high level of abstraction, as well as by means of very specific fine grained business rules.