Capturing more meaning in databases
Journal of Management Information Systems
Stategic alternatives and inter-organizational system implementations: an overview
Journal of Management Information Systems
Evolution towards strategic applications of databases through composite information systems
Journal of Management Information Systems
Information resource management: a metadata perspective
Journal of Management Information Systems
Journal of Management Information Systems - Special issue: Decision support and knowledge-based systems
Automated resolution of semantic heterogeneity in multidatabases
ACM Transactions on Database Systems (TODS)
The relationship between recall and precision
Journal of the American Society for Information Science
String searching algorithms
Overview of the second text retrieval conference (TREC-2)
TREC-2 Proceedings of the second conference on Text retrieval conference
Semantic similarity relations and computation in schema integration
Data & Knowledge Engineering
The linguistic level: contribution for conceptual design, view integration, reuse and documentation
Data & Knowledge Engineering - Special issue natural language for data bases
Supporting schema integration by linguistic instruments
Data & Knowledge Engineering - Special issue natural language for data bases
Semantic integration of conceptual schemas
Data & Knowledge Engineering - Special issue natural language for data bases
Schema integration: past, present, and future
Management of heterogeneous and autonomous database systems
Managing heterogeneous information systems through discovery and retrieval of generic concepts
Journal of the American Society for Information Science
Data & Knowledge Engineering
Intensional and extensional integration and abstraction of heterogeneous databases
Data & Knowledge Engineering
Matching records in a national medical patient index
Communications of the ACM
Discovering and reconciling value conflicts for numerical data integration
Information Systems - Data extraction, cleaning and reconciliation
DIRECT: a system for mining data value conversion rules from disparate data sources
Decision Support Systems
Multi-User View Integration System (MUVIS): An Expert System for View Integration
Proceedings of the Sixth International Conference on Data Engineering
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
Reducing Inconsistency in Integrating Data From Different Sources
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
Asessing Semnatic Similarities among Geospatial Feature Class Definitions
INTEROP '99 Proceedings of the Second International Conference on Interoperating Geographic Information Systems
Automatic Classification of Semantic Concepts in View Specifications
DEXA '96 Proceedings of the 7th International Conference on Database and Expert Systems Applications
Semantic Based Schema Analysis
DEXA '98 Proceedings of the 9th International Conference on Database and Expert Systems Applications
Information Systems Research
An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources
IEEE Transactions on Knowledge and Data Engineering
On schema matching with opaque column names and data values
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
iMAP: discovering complex semantic matches between database schemas
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Journal of Management Information Systems - Special section: Strategic and competitive information systems
Journal of Management Information Systems
Information Exploitation and Interorganizational Systems Ownership
Journal of Management Information Systems
Coordinating for Flexibility in e-Business Supply Chains
Journal of Management Information Systems
Journal of Management Information Systems
Integration in Electronic Exchange Environments
Journal of Management Information Systems
Journal of Management Information Systems
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Design science in information systems research
MIS Quarterly
Information about information: a taxonomy of views
MIS Quarterly
Matching Attributes across Overlapping Heterogeneous Data Sources Using Mutual Information
Journal of Database Management
Hi-index | 0.00 |
Identifying attribute correspondences across heterogeneous databases is a critical and time-consuming step in integrating the databases. Past research has applied correlation analysis techniques to explore correspondences between attributes. These techniques, however, are appropriate for numeric attributes that are linearly related. This paper proposes an information-theoretic approach to exploring correspondences between attributes in heterogeneous databases. The proposed approach is applicable to character attributes, as well as to numeric attributes, regardless whether or not they are linearly related. It overcomes some serious shortcomings of previous approaches based on correlation analysis and has much broader applicability. The proposed procedure samples both matching and nonmatching pairs of records from the databases under consideration, applies matching functions to compare pairs of attributes, and then uses the mutual information to measure the dependency between a matching function as applied to a pair of attributes and the class (i.e., matching or nonmatching) of a pair of records. A high mutual information index implies a potential attribute correspondence, which is presented to the analyst for further evaluation. The paper also presents some empirical results demonstrating the utility of the proposed approach.