Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Reconciling schemas of disparate data sources: a machine-learning approach
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Learning to map between ontologies on the semantic web
Proceedings of the 11th international conference on World Wide Web
Data integration: a theoretical perspective
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Schema Mapping as Query Discovery
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Database Schema Matching Using Machine Learning with Feature Selection
CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Artificial Intelligence: A Modern Approach
Artificial Intelligence: A Modern Approach
Discovering Direct and Indirect Matches for Schema Elements
DASFAA '03 Proceedings of the Eighth International Conference on Database Systems for Advanced Applications
On schema matching with opaque column names and data values
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Statistical schema matching across web query interfaces
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Guest Editors' Introduction: Special Section on Peer-to-Peer-Based Data Management
IEEE Transactions on Knowledge and Data Engineering
An interactive clustering-based approach to integrating source query interfaces on the deep Web
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
iMAP: discovering complex semantic matches between database schemas
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
The SMART Retrieval System—Experiments in Automatic Document Processing
The SMART Retrieval System—Experiments in Automatic Document Processing
COMA: a system for flexible combination of schema matching approaches
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Structures, semantics and statistics
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Similarity search for web services
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Instance-based schema matching for web databases by domain-specific query probing
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Issues in stacked generalization
Journal of Artificial Intelligence Research
Corpus-based knowledge representation
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Introduction to the special issue on semantic integration
ACM SIGMOD Record
Mining structures for semantics
ACM SIGKDD Explorations Newsletter
Mining semantics for large scale integration on the web: evidences, insights, and challenges
ACM SIGKDD Explorations Newsletter
A search engine for natural language applications
WWW '05 Proceedings of the 14th international conference on World Wide Web
Schema and ontology matching with COMA++
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Tuning schema matching software using synthetic scenarios
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Semantic-integration research in the database community
AI Magazine - Special issue on semantic integration
Versatile structural disambiguation for semantic-aware applications
Proceedings of the 14th ACM international conference on Information and knowledge management
Secure collaboration in mediator-free environments
Proceedings of the 12th ACM conference on Computer and communications security
Queue - Semi-structured Data
Automatic structured query transformation over distributed digital libraries
Proceedings of the 2006 ACM symposium on Applied computing
Principles of dataspace systems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data integration through transform reuse in the Morpheus project
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Putting context into schema matching
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
ACM SIGMOD Record
eTuner: tuning schema matching software using synthetic scenarios
The VLDB Journal — The International Journal on Very Large Data Bases
SRI: exploiting semantic information for effective query routing in a PDMS
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Using Bayesian decision for ontology mapping
Web Semantics: Science, Services and Agents on the World Wide Web
Formal Model Merging Applied to Class Diagram Integration
Electronic Notes in Theoretical Computer Science (ENTCS)
Information retrieval and machine learning for probabilistic schema matching
Information Processing and Management: an International Journal
Matching large schemas: Approaches and evaluation
Information Systems
Leveraging data and structure in ontology integration
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Query relaxation using malleable schemas
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Visualization of Heterogeneous Data
IEEE Transactions on Visualization and Computer Graphics
Workflow authorisation in mediator-free environments
International Journal of Security and Networks
Automatically refining the wikipedia infobox ontology
Proceedings of the 17th international conference on World Wide Web
Towards a global schema for web entities
Proceedings of the 17th international conference on World Wide Web
Semantic text similarity using corpus-based word similarity and string similarity
ACM Transactions on Knowledge Discovery from Data (TKDD)
Applications of corpus-based semantic similarity and word segmentation to database schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Ontology change: Classification and survey
The Knowledge Engineering Review
On Capturing Semantics in Ontology Mapping
World Wide Web
Joining the results of heterogeneous search engines
Information Systems
Bootstrapping Information Extraction from Semi-structured Web Pages
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
PicShark: mitigating metadata scarcity through large-scale P2P collaboration
The VLDB Journal — The International Journal on Very Large Data Bases
WebTables: exploring the power of tables on the web
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
Integrating web query results: holistic schema matching
Proceedings of the 17th ACM conference on Information and knowledge management
Discovering Semantically Similar Associations (SeSA) for Complex Mappings between Conceptual Models
ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
Collecting Community-Based Mappings in an Ontology Repository
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Ten Challenges for Ontology Matching
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Reconciliando dados de cunho acadêmico
SBBD '08 Proceedings of the 23rd Brazilian symposium on Databases
Web-scale extraction of structured data
ACM SIGMOD Record
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Extension of Schema Matching Platform ASMADE to Constraints and Mapping Expression
Advanced Internet Based Systems and Applications
BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
Design of a temporal geosocial semantic web for military stabilization and reconstruction operations
Proceedings of the ACM SIGKDD Workshop on CyberSecurity and Intelligence Informatics
Site-Wide Wrapper Induction for Life Science Deep Web Databases
DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
A model for matching and integrating heterogeneous relational biomedical databases schemas
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Information integration with uncertainty
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Ontology matching with semantic verification
Web Semantics: Science, Services and Agents on the World Wide Web
Ontology based schema matching and mapping approach for structured databases
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
HAMSTER: using search clicklogs for schema and taxonomy matching
Proceedings of the VLDB Endowment
Managing XML Schema Mappings and Annotations in P2P Data Integration Systems
OTM '09 Proceedings of the Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems: ADI, CAMS, EI2N, ISDE, IWSSA, MONET, OnToContent, ODIS, ORM, OTM Academy, SWWS, SEMELS, Beyond SAWSDL, and COMBEK 2009
The software EBox: integrated information for situational awareness
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Structural and semantic aspects of similarity of Document Type Definitions and XML schemas
Information Sciences: an International Journal
Towards automatization of domain modeling
Data & Knowledge Engineering
Partial and dynamic ontology mapping model in dialogs of agents
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Semantic matching: algorithms and implementation
Journal on data semantics IX
Proceedings of the 13th International Conference on Database Theory
Integrating schemas of heterogeneous relational databases through schema matching
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
An ontology based approach to automating data integration in scientific workflows
Proceedings of the 7th International Conference on Frontiers of Information Technology
A comparative analysis of similarity measurement techniques through SimReq framework
Proceedings of the 7th International Conference on Frontiers of Information Technology
Combining logic and probabilities for discovering mappings between taxonomies
KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Knowledge-based sense disambiguation (almost) for all structures
Information Systems
Exploring schema repositories with schemr
ACM SIGMOD Record
Combining OWL ontology and schema annotations in metadata management
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
Discovery of probabilistic mappings between taxonomies: principles and experiments
Journal on data semantics XV
Transforming heterogeneous messages automatically in web service composition
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Holistic schema matching for web query interfaces
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
A survey of schema-based matching approaches
Journal on Data Semantics IV
Reducing the cost of validating mapping compositions by exploiting semantic relationships
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
A web-based novel term similarity framework for ontology learning
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Design and use of ER repositories: methodologies and experiences in egovernment initiatives
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
A novel clustering-based approach to schema matching
ADVIS'06 Proceedings of the 4th international conference on Advances in Information Systems
Viewpoints on emergent semantics
Journal on Data Semantics VI
Search Computing
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
InfoGather: entity augmentation and attribute discovery by holistic matching with web tables
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Appearance-Order-Based schema matching
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Social networks profile mapping using games
WebApps'12 Proceedings of the 3rd USENIX conference on Web Application Development
Challenges and conflicts integrating heterogeneous data warehouses in virtual organisations
International Journal of Networking and Virtual Organisations
On the foundations of probabilistic information integration
Proceedings of the 21st ACM international conference on Information and knowledge management
Learning to discover complex mappings from web forms to ontologies
Proceedings of the 21st ACM international conference on Information and knowledge management
ACM SIGKDD Explorations Newsletter
Identifying and weighting integration hypotheses on open data platforms
Proceedings of the First International Workshop on Open Data
An evolutionary approach to complex schema matching
Information Systems
Matching Attributes across Overlapping Heterogeneous Data Sources Using Mutual Information
Journal of Database Management
Assessing relevance and trust of the deep web sources and results based on inter-source agreement
ACM Transactions on the Web (TWEB)
Publish-time data integration for open data platforms
Proceedings of the 2nd International Workshop on Open Data
Schema matching prediction with applications to data source discovery and dynamic ensembling
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
Schema Matching is the problem of identifying corresponding elements in different schemas. Discovering these correspondences or matches is inherently difficult to automate. Past solutions have proposed a principled combination of multiple algorithms. However, these solutions sometimes perform rather poorly due to the lack ofsufficient evidence in the schemas being matched. In this paper we show how a corpus of schemas and mappings can be used to augment the evidence about the schemas being matched, so they can be matched better. Such a corpus typically contains multiple schemas that model similar concepts and hence enables us to learn variations in the elements and their properties. We exploit such a corpus in two ways. First, we increase the evidence about each element being matched by including evidence from similar elements in the corpus. Second, we learn statistics about elements and their relationships and use them to infer constraints that we use to prune candidate mappings. We also describe how to use known mappings to learn the importance of domain and generic constraints. We present experimental results that demonstrate corpus-based matching outperforms direct matching (without the benefit of a corpus) in multiple domains.