Mining database structure; or, how to build a data quality browser
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Conceptual modeling for ETL processes
Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
Declarative Data Cleaning: Language, Model, and Algorithms
Proceedings of the 27th International Conference on Very Large Data Bases
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
On the Logical Modeling of ETL Processes
CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
Fuzzy Rule-Based Framework for Medical Record Validation
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Mediation in a dynamic context: arguing for a request-oriented approach and structuring it
Web-enabled systems integration
Learning to match and cluster large high-dimensional data sets for data integration
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Cleaning the Spurious Links in Data
IEEE Intelligent Systems
Efficient similarity-based operations for data integration
Data & Knowledge Engineering
A framework for analysis of data freshness
Proceedings of the 2004 international workshop on Information quality in information systems
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
A cost-based model and effective heuristic for repairing constraints by value modification
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
ETL queues for active data warehousing
Proceedings of the 2nd international workshop on Information quality in information systems
State-Space Optimization of ETL Workflows
IEEE Transactions on Knowledge and Data Engineering
A generic and customizable framework for the design of ETL scenarios
Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
Enhancing Data Analysis with Noise Removal
IEEE Transactions on Knowledge and Data Engineering
Adaptive Name Matching in Information Integration
IEEE Intelligent Systems
One-to-many data transformations through data mappers
Data & Knowledge Engineering
A data quality metamodel extension to CWM
APCCM '07 Proceedings of the fourth Asia-Pacific conference on Comceptual modelling - Volume 67
Systematic development of data mining-based data quality tools
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Improving data quality: consistency and accuracy
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Conditional functional dependencies for capturing data inconsistencies
ACM Transactions on Database Systems (TODS)
ACM Computing Surveys (CSUR)
The VLDB Journal — The International Journal on Very Large Data Bases
Cardinality estimation in ETL processes
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Using inheritance in a metadata based approach to data quality assessment
Proceedings of the first international workshop on Model driven service engineering and data quality and security
Frameworks for entity matching: A comparison
Data & Knowledge Engineering
A flexible framework for multisensor data fusion using data stream management technologies
Proceedings of the 2009 EDBT/ICDT Workshops
A generic and customizable framework for the design of ETL scenarios
Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
A framework for the design of ETL scenarios
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Using similarity-based operations for resolving data-level conflicts
BNCOD'03 Proceedings of the 20th British national conference on Databases
GDR: a system for guided data repair
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Proceedings of the VLDB Endowment
Wrangler: interactive visual specification of data transformation scripts
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Support for user involvement in data cleaning
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Proactive wrangling: mixed-initiative end-user programming of data transformation scripts
Proceedings of the 24th annual ACM symposium on User interface software and technology
E-ETL: framework for managing evolving etl processes
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
OntoDataClean: ontology-based integration and preprocessing of distributed data
ISBMDA'06 Proceedings of the 7th international conference on Biological and Medical Data Analysis
Resumption of data extraction process in parallel data warehouses
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Workflow based security incident management
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Avoiding error-prone reordering optimization during legal systems migration
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Graph-based modeling of ETL activities with multi-level transformations and updates
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Data mapper: an operator for expressing one-to-many data transformations
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Declarative data fusion – syntax, semantics, and implementation
ADBIS'05 Proceedings of the 9th East European conference on Advances in Databases and Information Systems
Blueprints and measures for ETL workflows
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
Data cleaning and transformation using the AJAX framework
GTTSE'05 Proceedings of the 2005 international conference on Generative and Transformational Techniques in Software Engineering
Influence of balancing used in a distributed data warehouse on the extraction process
TEAA'05 Proceedings of the 31st VLDB conference on Trends in Enterprise Application Architecture
Integrating open government data with stratosphere for more transparency
Web Semantics: Science, Services and Agents on the World Wide Web
Information Visualization - Special issue on State of the Field and New Research Directions
An extensible metadata framework for data quality assessment of composite structures
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Hi-index | 0.00 |
@@@@ groups together matching pairs with a high similarity value by applying a given grouping criteria (e.g. by transitive closure). Finally, ging collapses each individual cluster into a tuple of the resulting data source. AJAX provides @@@@ for specifying data cleaning programs, which consists of SQL statements enriched with a set of specific primitives to express these transformations.AJAX also @@@@. It allows the user to interact with an executing data cleaning program to handle exceptional cases and to inspect intermediate results. Finally, AJAX provides @@@@ @@@@ that permits users to determine the source and processing of data for debugging purposes.We will present the AJAX system applied to two real world problems: the consolidation of a telecommunication database, and the conversion of a dirty database of bibliographic references into a set of clean, normalized, and redundancy free relational tables maintaining the same data.