A Theory of Attributed Equivalence in Databases with Application to Schema Integration
IEEE Transactions on Software Engineering
Infomaster: an information integration system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The TSIMMIS Approach to Mediation: Data Models and Languages
Journal of Intelligent Information Systems - Special issue: next generation information technologies and systems
Extracting schema from semistructured data
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
An adaptive query execution system for data integration
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Reconciling schemas of disparate data sources: a machine-learning approach
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Clio: a semi-automatic tool for schema mapping
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Scaling Access to Heterogeneous Data Sources with DISCO
IEEE Transactions on Knowledge and Data Engineering
Global Viewing of Heterogeneous Data Sources
IEEE Transactions on Knowledge and Data Engineering
A Structure Based Schema Integration Methodology
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
ICDT '97 Proceedings of the 6th International Conference on Database Theory
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Using Schema Matching to Simplify Heterogeneous Data Translation
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Everything You Ever Wanted to Know About DTDs, But Were Afraid to Ask (Extended Abstract)
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Quilt: An XML Query Language for Heterogeneous Data Sources
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Semantic and schematic similarities between database objects: a context-based approach
The VLDB Journal — The International Journal on Very Large Data Bases
Information Systems - Special issue on web data integration
Finding an optimum edit script between an XML document and a DTD
Proceedings of the 2005 ACM symposium on Applied computing
Peer-to-peer management of XML data: issues and research challenges
ACM SIGMOD Record
Schema matching for transforming structured documents
Proceedings of the 2005 ACM symposium on Document engineering
Integration of XML schemas at various "severity" levels
Information Systems
Dealing with semantic heterogeneity for improving web usage
Data & Knowledge Engineering - Special issue: ER 2004
Data & Knowledge Engineering - Special issue: WIDM 2004
XML schema clustering with semantic and hierarchical similarity measures
Knowledge-Based Systems
Matching large schemas: Approaches and evaluation
Information Systems
Xproj: a framework for projected structural clustering of xml documents
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Measuring the structural similarity of semistructured documents using entropy
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Measuring the structural similarity among XML documents and DTDs
Journal of Intelligent Information Systems
Computing structural similarity of source XML schemas against domain XML schema
ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Similarity of XML schema definitions
Proceedings of the eighth ACM symposium on Document engineering
PORSCHE: Performance ORiented SCHEma mediation
Information Systems
Document Clustering Using Incremental and Pairwise Approaches
Focused Access to XML Documents
An Effective Data Processing Method for Fast Clustering
ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Expert Systems with Applications: An International Journal
Equivalence of XSD Constructs and Its Exploitation in Similarity Evaluation
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
A schema matching-based approach to XML schema clustering
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Extension of Schema Matching Platform ASMADE to Constraints and Mapping Expression
Advanced Internet Based Systems and Applications
Improving XML schema matching performance using Prüfer sequences
Data & Knowledge Engineering
A gauss function based approach for unbalanced ontology matching
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Data Discovery and Related Factors of Documents on the Web and the Network
ICCSA '09 Proceedings of the International Conference on Computational Science and Its Applications: Part I
A cluster-based approach to XML similarity joins
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Semantic clustering of XML documents
ACM Transactions on Information Systems (TOIS)
Actively Learning Ontology Matching via User Interaction
ISWC '09 Proceedings of the 8th International Semantic Web Conference
XML Schema Element Similarity Measures: A Schema Matching Context
OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part II
Extensible User-Based XML Grammar Matching
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Semantic Structural Similarity Measure for Clustering XML Documents
WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
A Bloom Filter Based Approach for Evaluating Structural Similarity of XML Documents
WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
Return specification inference and result clustering for keyword search on XML
ACM Transactions on Database Systems (TODS)
Structural and semantic aspects of similarity of Document Type Definitions and XML schemas
Information Sciences: an International Journal
Structural similarity evaluation between XML documents and DTDs
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
Semantics-guided clustering of heterogeneous XML schemas
Journal on data semantics IX
An approach for measuring similarity between XML documents
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
An effective detection method for clustering similar XML DTDs using tag sequences
ICCSA'07 Proceedings of the 2007 international conference on Computational science and Its applications - Volume Part II
A weighted common structure based clustering technique for XML documents
Journal of Systems and Software
Improving XML search by generating and utilizing informative result snippets
ACM Transactions on Database Systems (TODS)
Element similarity measures in XML schema matching
Information Sciences: an International Journal
Transforming XML documents as schemas evolve
Proceedings of the VLDB Endowment
A bounded distance metric for comparing tree structure
Information Systems
Highly efficient algorithms for structural clustering of large websites
Proceedings of the 20th international conference on World wide web
Multimedia metadata mapping: towards helping developers in their integration task
Proceedings of the 8th International Conference on Advances in Mobile Computing and Multimedia
XML data clustering: An overview
ACM Computing Surveys (CSUR)
MuMIe: a new system for multimedia metadata interoperability
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Transactions on large-scale data- and knowledge-centered systems III
Transactions on computational collective intelligence V
A new sequential mining approach to XML document clustering*
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
An approach for clustering semantically heterogeneous XML schemas
OTM'05 Proceedings of the 2005 Confederated international conference on On the Move to Meaningful Internet Systems - Volume >Part I
Clustering XML documents by structure based on common neighbor
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Clustering and retrieval of XML documents by structure
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
Clustering OWL documents based on semantic analysis
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
XMine: a methodology for mining XML structure
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
XML clustering based on common neighbor
APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
Querying tree-structured data using dimension graphs
CAiSE'05 Proceedings of the 17th international conference on Advanced Information Systems Engineering
Automatic generation of semantic fields for resource discovery in the semantic web
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
An experiment on the matching and reuse of XML schemas
ICWE'05 Proceedings of the 5th international conference on Web Engineering
Semantic integration of tree-structured data using dimension graphs
Journal on Data Semantics IV
LAX: an efficient approximate XML join based on clustered leaf nodes for XML data integration
BNCOD'05 Proceedings of the 22nd British National conference on Databases: enterprise, Skills and Innovation
A framework for integrating XML transformations
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Clustering large scale of XML documents
GPC'06 Proceedings of the First international conference on Advances in Grid and Pervasive Computing
Machine learning models: combining evidence of similarity for XML schema matching
KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
Minimizing user effort in XML grammar matching
Information Sciences: an International Journal
Measuring structural similarity of semistructured data based on information-theoretic approaches
The VLDB Journal — The International Journal on Very Large Data Bases
Hierarchical clustering of XML documents focused on structural components
Data & Knowledge Engineering
Combining structure and content similarities for XML document clustering
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Proceedings of the 2013 Research in Adaptive and Convergent Systems
Schema matching prediction with applications to data source discovery and dynamic ensembling
The VLDB Journal — The International Journal on Very Large Data Bases
Personal and Ubiquitous Computing
Hi-index | 0.00 |
It is increasingly important to develop scalable integration techniques for the growing number of XML data sources. A practical starting point for the integration of large numbers of Document Type Definitions (DTDs) of XML sources would be to first find clusters of DTDs that are similar in structure and semantics. Reconciling similar DTDs within such a cluster will be an easier task than reconciling DTDs that are different in structure and semantics as the latter would involve more restructuring. We introduce XClust, a novel integration strategy that involves the clustering of DTDs. A matching algorithm based on the semantics, immediate descendents and leaf-context similarity of DTD elements is developed. Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach.