A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
Cyc: toward programs with common sense
Communications of the ACM
Multiple database integration in CALIDA: design and implementation
ISCI '90 Proceedings of the first international conference on systems integration on Systems integration '90
Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
Answering heterogeneous database queries with degrees of uncertainty
Distributed and Parallel Databases
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
An algebraic transformation framework for multidatabase queries
Distributed and Parallel Databases
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
Query caching and optimization in distributed mediator systems
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Query reformulation for dynamic information integration
Journal of Intelligent Information Systems - Special issue on intelligent integration of information
Integrating information by outerjoins and full disjunctions (extended abstract)
PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
InfoSleuth: agent-based semantic integration of information in open and dynamic environments
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Infomaster: an information integration system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The distributed information search component (Disco) and the World Wide Web
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The TSIMMIS Approach to Mediation: Data Models and Languages
Journal of Intelligent Information Systems - Special issue: next generation information technologies and systems
The Carnot Heterogeneous Database Project: Implemented Applications
Distributed and Parallel Databases
Lore: a database management system for semistructured data
ACM SIGMOD Record
Querying multimedia data from multiple repositories by content: the Garlic project
Proceedings of the third IFIP WG2.6 working conference on Visual database systems 3 (VDB-3)
Ariadne: a system for constructing mediators for Internet sources
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Modeling Web sources for information integration
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Consistent query answers in inconsistent databases
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An adaptive query execution system for data integration
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
The functional data model and the data languages DAPLEX
ACM Transactions on Database Systems (TODS)
An overview of the multidatabase system MRDSM
ACM '85 Proceedings of the 1985 ACM annual conference on The range of computing : mid-80's perspective: mid-80's perspective
An overview and classification of mediated query systems
ACM SIGMOD Record
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
The Papyrus integrated data server
PDIS '91 Proceedings of the first international conference on Parallel and distributed information systems
Database Systems: The Complete Book
Database Systems: The Complete Book
Garlic: a new flavor of federated query processing for DB2
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Compiling Source Descriptions for Efficient and Flexible Information Integration
Journal of Intelligent Information Systems
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
Data Mining and Knowledge Discovery
Data Mining and Knowledge Discovery
IEEE Transactions on Knowledge and Data Engineering
Current Approaches to Handling Imperfect Information in Data and Knowledge Bases
IEEE Transactions on Knowledge and Data Engineering
Scaling Access to Heterogeneous Data Sources with DISCO
IEEE Transactions on Knowledge and Data Engineering
An Overview of the Distributed Query System DQS
EDBT '88 Proceedings of the International Conference on Extending Database Technology: Advances in Database Technology
Fusion Queries over Internet Databases
EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
A Generic Query-Translation Framework for a Mediator Architecture
ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Resolving Attribute Incompatibility in Database Integration: An Evidential Reasoning Approach
Proceedings of the Tenth International Conference on Data Engineering
The Nimble XML Data Integration System
Proceedings of the 17th International Conference on Data Engineering
Condensed Representation of Database Repairs for Consistent Query Answering
ICDT '03 Proceedings of the 9th International Conference on Database Theory
UNIBASE - An Integrated Access to Databases
VLDB '84 Proceedings of the 10th International Conference on Very Large Data Bases
Completeness Information and Its Application to Query Processing
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Using SQL to Build New Aggregates and Extenders for Object- Relational Systems
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Declarative Data Cleaning: Language, Model, and Algorithms
Proceedings of the 27th International Conference on Very Large Data Bases
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
The Use of Information Capacity in Schema Integration and Translation
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
The Rufus System: Information Organization for Semi-Structured Data
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Object Fusion in Mediator Systems
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Processing Queries Over Generalization Hierarchies in a Multidatabase System
VLDB '83 Proceedings of the 9th International Conference on Very Large Data Bases
Integrating and Managing Conflicting Data
PSI '02 Revised Papers from the 4th International Andrei Ershov Memorial Conference on Perspectives of System Informatics: Akademgorodok, Novosibirsk, Russia
Object Identification in Multidatabase Systems
Proceedings of the IFIP WG 2.6 Database Semantics Conference on Interoperable Database Systems (DS-5)
An Architecture for Transparent Access to Semantically Heterogeneous Information Sources
CIA '97 Proceedings of the First International Workshop on Cooperative Information Agents
Distributed Query Processing Strategies in Mermaid, A Frontend to Data Management Systems
Proceedings of the First International Conference on Data Engineering
Integrating life sciences data-with a little Garlic
BIBE '00 Proceedings of the 1st IEEE International Symposium on Bioinformatics and Biomedical Engineering
COOPIS '96 Proceedings of the First IFCIS International Conference on Cooperative Information Systems
Conflict Tolerant Queries in AURORA
COOPIS '99 Proceedings of the Fourth IECIS International Conference on Cooperative Information Systems
An Extensible Framework for Data Cleaning
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Efficient similarity-based operations for data integration
Data & Knowledge Engineering
Canonical abstraction for outerjoin optimization
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Detecting duplicate objects in XML documents
Proceedings of the 2004 international workshop on Information quality in information systems
Utility-based resolution of data inconsistencies
Proceedings of the 2004 international workshop on Information quality in information systems
Information Systems - Special issue: Data quality in cooperative information systems
Completeness of integrated information sources
Information Systems - Special issue: Data quality in cooperative information systems
Computing consistent query answers using conflict hypergraphs
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Schema Matching Using Duplicates
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Data exchange: getting to the core
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
An incremental algorithm for computing ranked full disjunctions
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A cost-based model and effective heuristic for repairing constraints by value modification
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
ConQuer: efficient management of inconsistent databases
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Supporting executable mappings in model management
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
DogmatiX tracks down duplicates in XML
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Enterprise information integration: successes, challenges and controversies
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
The INFOMIX system for advanced integration of incomplete and inconsistent data
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data cleaning in microsoft SQL server 2005
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
OLAP over uncertain and imprecise data
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Automatic data fusion with HumMer
VLDB '05 Proceedings of the 31st international conference on Very large data bases
ConQuer: a system for efficient querying over inconsistent databases
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Working Models for Uncertain Data
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Trio: a system for data, uncertainty, and lineage
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Planning executing sensing and replanning for information gathering
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Using similarity-based operations for resolving data-level conflicts
BNCOD'03 Proceedings of the 20th British national conference on Databases
Planning to gather inforrnation
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Preferred generalized answers for inconsistent databases
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Inconsistency tolerance in P2P data integration: an epistemic logic approach
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Complexity and approximation of fixing numerical attributes in databases under integrity constraints
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Consistent query answers on numerical databases under aggregate constraints
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Declarative data fusion – syntax, semantics, and implementation
ADBIS'05 Proceedings of the 9th East European conference on Advances in Databases and Information Systems
A framework for merging, repairing and querying inconsistent databases
ADBIS'06 Proceedings of the 10th East European conference on Advances in Databases and Information Systems
Preference-driven querying of inconsistent relational databases
EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
TupleRank: ranking discovered content in virtual databases
NGITS'06 Proceedings of the 6th international conference on Next Generation Information Technologies and Systems
Semistructured data: the TSIMMIS experience
ADBIS'97 Proceedings of the First East-European conference on Advances in Databases and Information systems
Knowledge networks for pervasive services
Proceedings of the 2009 international conference on Pervasive services
Data fusion: resolving data conflicts for integration
Proceedings of the VLDB Endowment
"Same, Same but Different" A Survey on Duplicate Detection Methods for Situation Awareness
OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part II
Subsumption and complementation as data fusion operators
Proceedings of the 13th International Conference on Extending Database Technology
Finding an application-appropriate model for XML data warehouses
Information Systems
Interoperability by design using the StdTrip tool: an a priori approach
Proceedings of the 6th International Conference on Semantic Systems
Collective taxonomizing: A collaborative approach to organizing document repositories
Decision Support Systems
A generic framework for handling uncertain data with local correlations
Proceedings of the VLDB Endowment
DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
Dealing with matching variability of semantic web data using contexts
CAiSE'10 Proceedings of the 22nd international conference on Advanced information systems engineering
Rank-score characteristics (RSC) function and cognitive diversity
BI'10 Proceedings of the 2010 international conference on Brain informatics
Scalable data exchange with functional dependencies
Proceedings of the VLDB Endowment
Towards duplicate detection for situation awareness based on spatio-temporal relations
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
Data cleaning and query answering with matching dependencies and matching functions
Proceedings of the 14th International Conference on Database Theory
Data integration systems for scientific applications
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems
A self-training approach for resolving object coreference on the semantic web
Proceedings of the 20th international conference on World wide web
Proceedings of the 4th International Workshop on Logic in Databases
Polymorphic queries for P2P systems
Information Systems
Creating knowledge out of interlinked data: making the web a data washing machine
Proceedings of the International Conference on Web Intelligence, Mining and Semantics
Keyword search over relational databases: a metadata approach
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Introduction to linked data and its lifecycle on the web
RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
AMT'11 Proceedings of the 7th international conference on Active media technology
BI'11 Proceedings of the 2011 international conference on Brain informatics
Incorporating domain knowledge and user expertise in probabilistic Tuple merging
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Conflict-aware historical data fusion
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Ontologies and functional dependencies for data integration and reconciliation
ER'11 Proceedings of the 30th international conference on Advances in conceptual modeling: recent developments and new directions
Experiences with service-oriented middleware for dynamic instrumentation of enterprise DRE systems
OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part II
PARIS: probabilistic alignment of relations, instances, and schema
Proceedings of the VLDB Endowment
Improving data quality by source analysis
Journal of Data and Information Quality (JDIQ)
Web Semantics: Science, Services and Agents on the World Wide Web
Healthcare information fusion using context-aware agents
HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part I
Provenance based conflict handling strategies
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
LIMES: a time-efficient approach for large-scale link discovery on the web of data
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Sieve: linked data quality assessment and fusion
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Leveraging matching dependencies for guided user feedback in linked data applications
Proceedings of the Ninth International Workshop on Information Integration on the Web
Preserving information content in RDF using bounded homomorphisms
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
EAGLE: efficient active learning of link specifications using genetic programming
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Constructing virtual documents for ontology matching using mapreduce
JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
ACM Transactions on Interactive Intelligent Systems (TiiS) - Special issue on highlights of the decade in interactive intelligent systems
Tractable cases of clean query answering under entity resolution via matching dependencies
SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Query rewriting using datalog for duplicate resolution
Datalog 2.0'12 Proceedings of the Second international conference on Datalog in Academia and Industry
Link discovery with guaranteed reduction ratio in affine spaces with minkowski measures
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
DEQA: deep web extraction for question answering
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
Using information quality for the identification of relevant web data sources: a proposal
Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
A mediator-based approach for integrating heterogeneous multimedia sources
Multimedia Tools and Applications
Indeterministic Handling of Uncertain Decisions in Deduplication
Journal of Data and Information Quality (JDIQ) - Special Issue on Entity Resolution
Data Linking for the Semantic Web
International Journal on Semantic Web & Information Systems
Less is more: selecting sources wisely for integration
Proceedings of the VLDB Endowment
Truth finding on the deep web: is the problem solved?
Proceedings of the VLDB Endowment
HIL: a high-level scripting language for entity integration
Proceedings of the 16th International Conference on Extending Database Technology
Assessing linkset quality for complementing third-party datasets
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Determining the relative accuracy of attributes
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
A taxonomy of privacy-preserving record linkage techniques
Information Systems
Discovering interesting information with advances in web technology
ACM SIGKDD Explorations Newsletter
Compact explanation of data fusion decisions
Proceedings of the 22nd international conference on World Wide Web
Automatic data transformation: breaching the walled gardens of social network platforms
APCCM '13 Proceedings of the Ninth Asia-Pacific Conference on Conceptual Modelling - Volume 143
Introduction to linked data and its lifecycle on the web
RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
End-user development of mobile mashups
DUXU'13 Proceedings of the Second international conference on Design, User Experience, and Usability: web, mobile, and product design - Volume Part IV
Discovering linkage points over web data
Proceedings of the VLDB Endowment
Advanced Engineering Informatics
FIF: A fuzzy information fusion algorithm based on multi-criteria decision making
Knowledge-Based Systems
Hi-index | 0.00 |
The development of the Internet in recent years has made it possible and useful to access many different information systems anywhere in the world to obtain information. While there is much research on the integration of heterogeneous information systems, most commercial systems stop short of the actual integration of available data. Data fusion is the process of fusing multiple records representing the same real-world object into a single, consistent, and clean representation. This article places data fusion into the greater context of data integration, precisely defines the goals of data fusion, namely, complete, concise, and consistent data, and highlights the challenges of data fusion, namely, uncertain and conflicting data values. We give an overview and classification of different ways of fusing data and present several techniques based on standard and advanced operators of the relational algebra and SQL. Finally, the article features a comprehensive survey of data integration systems from academia and industry, showing if and how data fusion is performed in each.