A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
Database techniques for the World-Wide Web: a survey
ACM SIGMOD Record
Automatic discovery of language models for text databases
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
IEPAD: information extraction based on pattern discovery
Proceedings of the 10th international conference on World Wide Web
Reconciling schemas of disparate data sources: a machine-learning approach
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Modern Information Retrieval
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
Global Viewing of Heterogeneous Data Sources
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the 27th International Conference on Very Large Data Bases
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
Proceedings of the 27th International Conference on Very Large Data Bases
Semantic Integration in Heterogeneous Databases Using Neural Networks
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Global Schema Generation Using Formal Ontologies
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Data extraction and label assignment for web databases
WWW '03 Proceedings of the 12th international conference on World Wide Web
Statistical schema matching across web query interfaces
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Synthesizing an Integrated Ontology
IEEE Internet Computing
COMA: a system for flexible combination of schema matching approaches
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Wise-integrator: an automatic integrator of web search interfaces for E-commerce
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Mining structures for semantics
ACM SIGKDD Explorations Newsletter
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Object-level ranking: bringing order to Web objects
WWW '05 Proceedings of the 14th international conference on World Wide Web
QA-Pagelet: Data Preparation Techniques for Large-Scale Data Analysis of the Deep Web
IEEE Transactions on Knowledge and Data Engineering
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Automatic complex schema matching across Web query interfaces: A correlation mining approach
ACM Transactions on Database Systems (TODS)
Matching large schemas: Approaches and evaluation
Information Systems
Towards a global schema for web entities
Proceedings of the 17th international conference on World Wide Web
Bootstrapping pay-as-you-go data integration systems
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Schema Matching across Query Interfaces on the Deep Web
BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
Proceedings of the VLDB Endowment
Integrating web query results: holistic schema matching
Proceedings of the 17th ACM conference on Information and knowledge management
ODE: Ontology-assisted data extraction
ACM Transactions on Database Systems (TODS)
A Prioritized Collective Selection Strategy for Schema Matching across Query Interfaces
BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
An instance-based approach for domain-independent schema matching
Proceedings of the 46th Annual Southeast Regional Conference on XX
Site-Wide Wrapper Induction for Life Science Deep Web Databases
DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
Deriving Customized Integrated Web Query Interfaces
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
An empirical study on using hidden markov model for search interface segmentation
Proceedings of the 18th ACM conference on Information and knowledge management
An evidential approach to query interface matching on the deep Web
Information Systems
Category mapping for the automatic integration of category-constrained web search
International Journal of Business Intelligence and Data Mining
Kosmix: high-performance topic exploration using the deep web
Proceedings of the VLDB Endowment
A hierarchical approach to model web query interfaces for web source integration
Proceedings of the VLDB Endowment
Stop word and related problems in web interface integration
Proceedings of the VLDB Endowment
Wrapping of Web Sources with restricted Query Interfaces by Query Tunneling
Electronic Notes in Theoretical Computer Science (ENTCS)
Schema mapping in p2p networks based on classification and probing
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
An instance-based schema matching method with attributes ranking and classification
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
Interoperability by design using the StdTrip tool: an a priori approach
Proceedings of the 6th International Conference on Semantic Systems
Understanding deep web search interfaces: a survey
ACM SIGMOD Record
PruSM: a prudent schema matching approach for web forms
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Web database schema identification through simple query interface
RED'09 Proceedings of the 2nd international conference on Resource discovery
Designing service marts for engineering search computing applications
ICWE'10 Proceedings of the 10th international conference on Web engineering
Instance discovery and schema matching with applications to biological deep web data integration
DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
Editorial: Revising the constraints of lightweight mediated schemas
Data & Knowledge Engineering
A probabilistic approach for automatically filling form-based web interfaces
Proceedings of the VLDB Endowment
Materializing multi-relational databases from the web using taxonomic queries
Proceedings of the fourth ACM international conference on Web search and data mining
On-line web database integration
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
SourceRank: relevance and trust assessment for deep web sources based on inter-source agreement
Proceedings of the 20th international conference on World wide web
Extracting data records from query result pages based on visual features
BNCOD'11 Proceedings of the 28th British national conference on Advances in databases
Holistic schema matching for web query interfaces
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
CCWrapper: adaptive predefined schema guided web extraction
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Automatic generation of data types for classification of deep web sources
DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Matching Attributes across Overlapping Heterogeneous Data Sources Using Mutual Information
Journal of Database Management
Assessing relevance and trust of the deep web sources and results based on inter-source agreement
ACM Transactions on the Web (TWEB)
Schema matching prediction with applications to data source discovery and dynamic ensembling
The VLDB Journal — The International Journal on Very Large Data Bases
The ontological key: automatically understanding and integrating forms to access the deep Web
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
In a Web database that dynamically provides information in response to user queries, two distinct schemas, interface schema (the schema users can query) and result schema (the schema users can browse), are presented to users. Each partially reflects the actual schema of the Web database. Most previous work only studied the problem of schema matching across query interfaces of Web databases. In this paper, we propose a novel schema model that distinguishes the interface and the result schema of a Web database in a specific domain. In this model, we address two significant Web database schema-matching problems: intra-site and inter-site. The first problem is crucial in automatically extracting data from Web databases, while the second problem plays a significant role in meta-retrieving and integrating data from different Web databases. We also investigate a unified solution to the two problems based on query probing and instance-based schema matching techniques. Using the model, a cross validation technique is also proposed to improve the accuracy of the schema matching. Our experiments on real Web databases demonstrate that the two problems can be solved simultaneously with high precision and recall.