Similarity measures in scientometric research: the Jaccard index versus Salton's cosine formula
Information Processing and Management: an International Journal
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Clio project: managing heterogeneity
ACM SIGMOD Record
Mining database structure; or, how to build a data quality browser
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A New Measure of Edit Distance between Labeled Trees
COCOON '01 Proceedings of the 7th Annual International Conference on Computing and Combinatorics
On the Resemblance and Containment of Documents
SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
Schema mappings, data exchange, and metadata management
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
DogmatiX tracks down duplicates in XML
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Composing schema mappings: Second-order dependencies to the rescue
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2004
Implementing mapping composition
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Debugging schema mappings with routes
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Composition of mappings given by embedded dependencies
ACM Transactions on Database Systems (TODS)
Model management 2.0: manipulating richer mappings
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Finding similar files in a large file system
WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Composing mappings among data sources
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Mapping XML DTD to Relational Schema
DBTA '09 Proceedings of the 2009 First International Workshop on Database Technology and Applications
Learning string transformations from examples
Proceedings of the VLDB Endowment
Holistic constraint-preserving transformation from relational schema into XML schema
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Sampling dirty data for matching attributes
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Hi-index | 0.00 |
With the popularity of the internet, more and more data are generated on internet. Because of the usability of Extensible Markup Language(XML for short), more data is organized by XML document format. Because of the flexibility of XML, data organized by XML have a variety of organizational formats which brings a lot of inconvenience to data management. In particular, when the large-scale data operations are performed on XML data, for example data integration, model change, and so on, there are many problems. One of the current implementations is to use Data Exchange to carry out the above operations. The works of predecessors mainly are to analyze the characteristics of Schema Mapping on XML, and institute Data Exchange rules. These rules only consider the data integrity, reliability, but don't consider the quality of the data after conversion. This paper proposes the concept of quality assurance mechanisms. Firstly we discuss that a new model with quality assurance, and provide a suitable method for this model. Then we propose the strategy of weak branch's convergence on the basis of Schema. In the end theoretical analysis and experimental results show that the method is correct and feasible.