A Proof Procedure for Data Dependencies
Journal of the ACM (JACM)
Incomplete Information in Relational Databases
Journal of the ACM (JACM)
Optimal imputation of erroneous data: Categorical data, general edits
Operations Research
Entity identification in database integration
Information Sciences: an International Journal
Chasing constrained tuple-generating dependencies
PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Theoretical Computer Science - Special issue: principles and practice of constraint programming
Computational problems related to the design of normal form relational schemas
ACM Transactions on Database Systems (TODS)
Minimum Covers in Relational Database Model
Journal of the ACM (JACM)
Constraint-generating dependencies
Journal of Computer and System Sciences
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Approximation algorithms
Problem of Incomplete Information in Relational Databases
Problem of Incomplete Information in Relational Databases
Foundations of Databases: The Logical Level
Foundations of Databases: The Logical Level
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
Data Mining and Knowledge Discovery
Declarative Data Cleaning: Language, Model, and Algorithms
Proceedings of the 27th International Conference on Very Large Data Bases
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Conditional Dependencies for Horizontal Decompositions
Proceedings of the 10th Colloquium on Automata, Languages and Programming
Errors Detection and Correction in Large Scale Data Collecting
IDA '01 Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis
On the decidability and complexity of query answering over inconsistent and incomplete databases
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data dependencies in the relational model of databases: a generalization
Data dependencies in the relational model of databases: a generalization
A Logical Framework for Querying and Repairing Inconsistent Databases
IEEE Transactions on Knowledge and Data Engineering
Answer sets for consistent query answering in inconsistent databases
Theory and Practice of Logic Programming
Methods for evaluating and creating data quality
Information Systems - Special issue: Data quality in cooperative information systems
A cost-based model and effective heuristic for repairing constraints by value modification
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Minimal-change integrity maintenance using tuple deletions
Information and Computation
Database repairing using updates
ACM Transactions on Database Systems (TODS)
Theory of Relational Databases
Theory of Relational Databases
Extending dependencies with conditions
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Improving data quality: consistency and accuracy
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Increasing the Expressivity of Conditional Functional Dependencies without Extra Complexity
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Logic programs for consistently querying data integration systems
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Query rewriting and answering under constraints in data integration systems
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
On the computational complexity of minimal-change integrity maintenance in relational databases
Inconsistency Tolerance
Dependencies revisited for improving data quality
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Propagating functional dependencies with conditions
Proceedings of the VLDB Endowment
Semandaq: a data quality system based on conditional functional dependencies
Proceedings of the VLDB Endowment
A revival of integrity constraints for data cleaning
Proceedings of the VLDB Endowment
Incorporating cardinality constraints and synonym rules into conditional functional dependencies
Information Processing Letters
Relative information completeness
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Estimating the confidence of conditional functional dependencies
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Stream warehousing with DataDepot
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
ICFCA '09 Proceedings of the 7th International Conference on Formal Concept Analysis
Measuring and Comparing Effectiveness of Data Quality Techniques
CAiSE '09 Proceedings of the 21st International Conference on Advanced Information Systems Engineering
Conditional Dependencies: A Principled Approach to Improving Data Quality
BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
Analyses and Validation of Conditional Dependencies with Built-in Predicates
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Constraint processing in relational database systems: from theory to implementation
Proceedings of the 2010 ACM Symposium on Applied Computing
GDR: a system for guided data repair
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Relative information completeness
ACM Transactions on Database Systems (TODS)
Consistent query answers from virtually integrated XML data
Journal of Systems and Software
Exploiting conflict structures in inconsistent databases
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Towards certain fixes with editing rules and master data
Proceedings of the VLDB Endowment
Data Auditor: exploring data quality and semantics using pattern tableaux
Proceedings of the VLDB Endowment
Interaction between record matching and data repairing
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Differential dependencies: Reasoning and discovery
ACM Transactions on Database Systems (TODS)
Characterization of optimal complements of database views defined by projection
SDKB'10 Proceedings of the 4th international conference on Semantics in data and knowledge bases
Checking enforcement of integrity constraints in database applications based on code patterns
Journal of Systems and Software
Functional dependency discovery via Bayes net analysis
MAMECTIS/NOLASC/CONTROL/WAMUS'11 Proceedings of the 13th WSEAS international conference on mathematical methods, computational techniques and intelligent systems, and 10th WSEAS international conference on non-linear analysis, non-linear systems and chaos, and 7th WSEAS international conference on dynamical systems and control, and 11th WSEAS international conference on Wavelet analysis and multirate systems: recent researches in computational techniques, non-linear systems and control
Extending functional dependency to detect abnormal data in RDF graphs
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Improving data quality by source analysis
Journal of Data and Information Quality (JDIQ)
Design by example for SQL table definitions with functional dependencies
The VLDB Journal — The International Journal on Very Large Data Bases
Conditional functional dependencies: an FCA point of view
ICFCA'10 Proceedings of the 8th international conference on Formal Concept Analysis
ACM Transactions on Database Systems (TODS)
Towards certain fixes with editing rules and master data
The VLDB Journal — The International Journal on Very Large Data Bases
Statistical distortion: consequences of data cleaning
Proceedings of the VLDB Endowment
An efficient approach to identify n-wMVD for eliminating data redundancy
Proceedings of the CUBE International Information Technology Conference
Determining the Currency of Data
ACM Transactions on Database Systems (TODS)
Discovering conditional inclusion dependencies
Proceedings of the 21st ACM international conference on Information and knowledge management
A sound and complete chase procedure for constrained tuple-generating dependencies
Journal of Intelligent Information Systems
The data analytics group at the qatar computing research institute
ACM SIGMOD Record
Reasoning about functional and full hierarchical dependencies over partial relations
Information Sciences: an International Journal
Determining the relative accuracy of attributes
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
NADEEF: a commodity data cleaning system
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Extended dimensions for cleaning and querying inconsistent data warehouses
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Discovering denial constraints
Proceedings of the VLDB Endowment
The LLUNATIC data-cleaning framework
Proceedings of the VLDB Endowment
Extending inclusion dependencies with conditions
Theoretical Computer Science
Sampling from repairs of conditional functional dependency violations
The VLDB Journal — The International Journal on Very Large Data Bases
ACM SIGMOD Record
Hi-index | 0.00 |
We propose a class of integrity constraints for relational databases, referred to as conditional functional dependencies (CFDs), and study their applications in data cleaning. In contrast to traditional functional dependencies (FDs) that were developed mainly for schema design, CFDs aim at capturing the consistency of data by enforcing bindings of semantically related values. For static analysis of CFDs we investigate the consistency problem, which is to determine whether or not there exists a nonempty database satisfying a given set of CFDs, and the implication problem, which is to decide whether or not a set of CFDs entails another CFD. We show that while any set of transitional FDs is trivially consistent, the consistency problem is NP-complete for CFDs, but it is in PTIME when either the database schema is predefined or no attributes involved in the CFDs have a finite domain. For the implication analysis of CFDs, we provide an inference system analogous to Armstrong's axioms for FDs, and show that the implication problem is coNP-complete for CFDs in contrast to the linear-time complexity for their traditional counterpart. We also present an algorithm for computing a minimal cover of a set of CFDs. Since CFDs allow data bindings, in some cases CFDs may be physically large, complicating the detection of constraint violations. We develop techniques for detecting CFD violations in SQL as well as novel techniques for checking multiple constraints by a single query. We also provide incremental methods for checking CFDs in response to changes to the database. We experimentally verify the effectiveness of our CFD-based methods for inconsistency detection. This work not only yields a constraint theory for CFDs but is also a step toward a practical constraint-based method for improving data quality.