Consistent query answers in inconsistent databases
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
On propagation of deletions and annotations through views
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
Data Mining and Knowledge Discovery
Condensed Representation of Database Repairs for Consistent Query Answering
ICDT '03 Proceedings of the 9th International Conference on Database Theory
Declarative Data Cleaning: Language, Model, and Algorithms
Proceedings of the 27th International Conference on Very Large Data Bases
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
On the decidability and complexity of query answering over inconsistent and incomplete databases
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A Logical Framework for Querying and Repairing Inconsistent Databases
IEEE Transactions on Knowledge and Data Engineering
Minimal-change integrity maintenance using tuple deletions
Information and Computation
Logic programs for consistently querying data integration systems
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Query rewriting and answering under constraints in data integration systems
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
On the computational complexity of minimal-change integrity maintenance in relational databases
Inconsistency Tolerance
An intensional approach to the specification of test cases for database applications
Proceedings of the 28th international conference on Software engineering
Describing differences between databases
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Leveraging aggregate constraints for deduplication
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Consistent data for inconsistent XML document
Information and Software Technology
Referential integrity quality metrics
Decision Support Systems
Extending dependencies with conditions
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Improving data quality: consistency and accuracy
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
A three-valued semantics for querying and repairing inconsistent databases
Annals of Mathematics and Artificial Intelligence
Conditional functional dependencies for capturing data inconsistencies
ACM Transactions on Database Systems (TODS)
Dependencies revisited for improving data quality
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
World-set decompositions: Expressiveness and efficient algorithms
Theoretical Computer Science
Preferred Database Repairs Under Aggregate Constraints
SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
Reconciling Inconsistent Data in Probabilistic XML Data Integration
BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
Consistent Query Answering: The First Ten Years
SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Semandaq: a data quality system based on conditional functional dependencies
Proceedings of the VLDB Endowment
ACM Computing Surveys (CSUR)
How Dirty Is Your Relational Database? An Axiomatic Approach
ECSQARU '07 Proceedings of the 9th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
Approximate Probabilistic Query Answering over Inconsistent Databases
ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
Repair checking in inconsistent databases: algorithms and complexity
Proceedings of the 12th International Conference on Database Theory
On approximating optimum repairs for functional dependency violations
Proceedings of the 12th International Conference on Database Theory
Conditional Dependencies: A Principled Approach to Improving Data Quality
BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
Analyses and Validation of Conditional Dependencies with Built-in Predicates
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Consistent query answers in the presence of universal constraints
Information Systems
The VLDB Journal — The International Journal on Very Large Data Bases
Generic entity resolution with negative rules
The VLDB Journal — The International Journal on Very Large Data Bases
Querying and repairing inconsistent numerical databases
ACM Transactions on Database Systems (TODS)
Computing repairs for inconsistent XML document using chase
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
The consistency extractor system: Answer set programs for consistent query answering in databases
Data & Knowledge Engineering
Polynomial time queries over inconsistent databases with functional dependencies and foreign keys
Data & Knowledge Engineering
ERACER: a database approach for statistical inference and data cleaning
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Consistent query answers in inconsistent probabilistic databases
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
GDR: a system for guided data repair
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Development of foundation models for Internet of Things
Frontiers of Computer Science in China
Exploiting conflict structures in inconsistent databases
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Consistent answers to boolean aggregate queries under aggregate constraints
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part II
Towards certain fixes with editing rules and master data
Proceedings of the VLDB Endowment
Sampling the repairs of functional dependency violations under hard constraints
Proceedings of the VLDB Endowment
Record linkage with uniqueness constraints and erroneous values
Proceedings of the VLDB Endowment
Range-consistent answers of aggregate queries under aggregate constraints
SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
Efficient policy-based inconsistency management in relational knowledge bases
SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
Handling dirty databases: from user warning to data cleaning -- towards an interactive approach
SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
Evaluation query answer over inconsistent database with annotations
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Annotation based query answer over inconsistent database
Journal of Computer Science and Technology
Proceedings of the VLDB Endowment
Interaction between record matching and data repairing
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Improving XML data quality with functional dependencies
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Detecting and repairing anomalous evolutions in noisy environments
Annals of Mathematics and Artificial Intelligence
Conflict-aware historical data fusion
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Extending functional dependency to detect abnormal data in RDF graphs
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Cost-efficient repair in inconsistent probabilistic databases
Proceedings of the 20th ACM international conference on Information and knowledge management
Consistent query answering: five easy pieces
ICDT'07 Proceedings of the 11th international conference on Database Theory
World-set decompositions: expressiveness and efficient algorithms
ICDT'07 Proceedings of the 11th international conference on Database Theory
Complexity and approximation of fixing numerical attributes in databases under integrity constraints
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Consistent query answers on numerical databases under aggregate constraints
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Improving data quality by source analysis
Journal of Data and Information Quality (JDIQ)
Repairing inconsistent XML documents
KSEM'06 Proceedings of the First international conference on Knowledge Science, Engineering and Management
Validity-sensitive querying of XML databases
EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
DART: a data acquisition and repairing tool
EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Preference-driven querying of inconsistent relational databases
EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Project-Join-Repair: an approach to consistent query answering under functional dependencies
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Towards certain fixes with editing rules and master data
The VLDB Journal — The International Journal on Very Large Data Bases
Detecting suspect answers in the presence of inconsistent information
FoIKS'12 Proceedings of the 7th international conference on Foundations of Information and Knowledge Systems
Contributions to personalizable knowledge integration
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Repairing XML functional dependency violations
Information Sciences: an International Journal
Probabilistic query answering over inconsistent databases
Annals of Mathematics and Artificial Intelligence
The data analytics group at the qatar computing research institute
ACM SIGMOD Record
NADEEF: a commodity data cleaning system
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
A data cleaning framework based on user feedback
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Minimal spatio-temporal database repairs
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Efficient recovery of missing events
Proceedings of the VLDB Endowment
NADEEF: a generalized data cleaning system
Proceedings of the VLDB Endowment
On repairing structural problems in semi-structured data
Proceedings of the VLDB Endowment
The LLUNATIC data-cleaning framework
Proceedings of the VLDB Endowment
Extending inclusion dependencies with conditions
Theoretical Computer Science
Policy-based inconsistency management in relational databases
International Journal of Approximate Reasoning
Query answering under probabilistic uncertainty in Datalog+ / - ontologies
Annals of Mathematics and Artificial Intelligence
Sampling from repairs of conditional functional dependency violations
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
Data integrated from multiple sources may contain inconsistencies that violate integrity constraints. The constraint repair problem attempts to find "low cost" changes that, when applied, will cause the constraints to be satisfied. While in most previous work repair cost is stated in terms of tuple insertions and deletions, we follow recent work to define a database repair as a set of value modifications. In this context, we introduce a novel cost framework that allows for the application of techniques from record-linkage to the search for good repairs. We prove that finding minimal-cost repairs in this model is NP-complete in the size of the database, and introduce an approach to heuristic repair-construction based on equivalence classes of attribute values. Following this approach, we define two greedy algorithms. While these simple algorithms take time cubic in the size of the database, we develop optimizations inspired by algorithms for duplicate-record detection that greatly improve scalability. We evaluate our framework and algorithms on synthetic and real data, and show that our proposed optimizations greatly improve performance at little or no cost in repair quality.