Update propagation protocols for replicated databates
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
GlOSS: text-source discovery over the Internet
ACM Transactions on Database Systems (TODS)
EXPRESS: a data EXtraction, Processing, and Restructuring System
ACM Transactions on Database Systems (TODS)
The impact of database selection on distributed searching
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An architecture for transparent access to diverse data sources
Component database systems
A new approach to developing and implementing eager database replication protocols
ACM Transactions on Database Systems (TODS)
Capabilities-based query rewriting in mediator systems
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Building efficient and effective metasearch engines
ACM Computing Surveys (CSUR)
Data integration: a theoretical perspective
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
Data Mining and Knowledge Discovery
Optimizing Queries Across Diverse Data Sources
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Optimization of Nested Queries in a Distributed Relational Database
VLDB '84 Proceedings of the 10th International Conference on Very Large Data Bases
Schema Mapping as Query Discovery
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
A Data Transformation System for Biological Data Sources
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Exploratory Data Mining and Data Cleaning
Exploratory Data Mining and Data Cleaning
Flexible Support of Team Processes by Adaptive Workflow Systems
Distributed and Parallel Databases
CORDS: automatic discovery of correlations and soft functional dependencies
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Advances in dataflow programming languages
ACM Computing Surveys (CSUR)
Natural Language Engineering
Schema mappings, data exchange, and metadata management
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data exchange: semantics and query answering
Theoretical Computer Science - Database theory
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Principles of dataspace systems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Managing information extraction: state of the art and research directions
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Record linkage: similarity measures and algorithms
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Accessing the web: from search to integration
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Beyond the data deluge: data integration and bio-ontologies
Journal of Biomedical Informatics - Special issue: Biomedical ontologies
Data Integration in the Life Sciences: Third International Workshop, DILS 2006, Hinxton, UK, July 20-22, 2006, Proceedings (Lecture Notes in Computer Science)
DB2 design advisor: integrated automatic physical database design
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Model management 2.0: manipulating richer mappings
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Data integration with uncertainty
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Discovering topical structures of databases
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Information integration in the enterprise
Communications of the ACM - Enterprise information integration: and other tools for merging data
From Schema and Model Translation to a Model Management System
BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
STBenchmark: towards a benchmark for mapping systems
Proceedings of the VLDB Endowment
Incompleteness in information integration
Proceedings of the VLDB Endowment
The Harmony Integration Workbench
Journal on Data Semantics XI
On keys, foreign keys and nullable attributes in relational mapping systems
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Clio: Schema Mapping Creation and Data Exchange
Conceptual Modeling: Foundations and Applications
New Challenges in Information Integration
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Structural characterizations of schema-mapping languages
Communications of the ACM - Amir Pnueli: Ahead of His Time
Data fusion: resolving data conflicts for integration
Proceedings of the VLDB Endowment
Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information Integration
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Schema and data translation: a personal perspective
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
LIVE: a lineage-supported versioned DBMS
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Semantic annotation of web objects using constrained conditional random fields
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Foundations of uncertain-data integration
Proceedings of the VLDB Endowment
2D correlative-chain conditional random fields for semantic annotation of web objects
Journal of Computer Science and Technology
Conflict-aware historical data fusion
SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Adapting Searchy to extract data using evolved wrappers
Expert Systems with Applications: An International Journal
DSToolkit: an architecture for flexible dataspace management
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
What is the IQ of your data transformation system?
Proceedings of the 21st ACM international conference on Information and knowledge management
On the foundations of probabilistic information integration
Proceedings of the 21st ACM international conference on Information and knowledge management
A compact representation for efficient uncertain-information integration
Proceedings of the 17th International Database Engineering & Applications Symposium
MatchBench: benchmarking schema matching algorithms for schematic correspondences
BNCOD'13 Proceedings of the 29th British National conference on Big Data
Hi-index | 0.01 |
Information integration is becoming a critical problem for businesses and individuals alike. Data volumes are sky-rocketing, and new sources and types of information are proliferating. This paper briefly reviews some of the key research accomplishments in information integration (theory and systems), then describes the current state-of-the-art in commercial practice, and the challenges (still) faced by CIOs and application developers. One critical challenge is choosing the right combination of tools and technologies to do the integration. Although each has been studied separately, we lack a unified (and certainly, a unifying) understanding of these various approaches to integration. Experience with a variety of integration projects suggests that we need a broader framework, perhaps even a theory, which explicitly takes into account requirements on the result of the integration, and considers the entire end-to-end integration process.