LOF: identifying density-based local outliers
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Mining database structure; or, how to build a data quality browser
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
Data Mining and Knowledge Discovery
Data Mining and Knowledge Discovery
Schema Mapping as Query Discovery
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Cleansing Data for Mining and Warehousing
DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Introduction: Service-oriented computing
Communications of the ACM - Service-oriented computing
Hi-index | 0.00 |
Service-Oriented Architecture (SOA) is a new paradigm for integrating distributed software, especially e-Business application. It is essential to exchange reliable data between services. In this paper, we propose a methodology for detecting and cleansing dirty data between services, which is different from cleansing static and large data on database systems. We also develop a data cleansing service based on SOA. The service for cleansing interacting data makes it possible to improve the quality of services and to manage data effectively for a variety of SOA-based applications. As an empirical study, we applied this service to clean dirty data between CRM and ERP services and showed that the dirty data rate could be reduced by more than 30%.