PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Extracting schema from semistructured data
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Ontology-based extraction and structuring of information from data-rich unstructured documents
Proceedings of the seventh international conference on Information and knowledge management
Discovering typical structures of documents: a road map approach
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
DTD inference for views of XML data
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A Web Odyssey: from Codd to XML
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Querying websites using compact skeletons
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Algorithmics and applications of tree and graph searching
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
APEX: an adaptive path index for XML data
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Covering indexes for branching path queries
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Information organization and databases
Extracting indexing information from XML DTDs
Information Processing Letters
Discovering Structural Association of Semistructured Data
IEEE Transactions on Knowledge and Data Engineering
Approximate Graph Schema Extraction for Semi-Structured Data
EDBT '00 Proceedings of the 7th International Conference on Extending Database Technology: Advances in Database Technology
Index Structures for Path Expressions
ICDT '99 Proceedings of the 7th International Conference on Database Theory
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Incremental Maintenance for Materialized Views over Semistructured Data
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Storage and Retrieval of XML Data Using Relational Databases
Proceedings of the 27th International Conference on Very Large Data Bases
A General Architecture for Finding Structural Regularities on the Web
AIMSA '00 Proceedings of the 9th International Conference on Artificial Intelligence: Methodology, Systems, and Applications
Research Issues in Web Data Mining
DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
Schema Discovery of the Semi-structured and Hierarchical Data
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
XML-based Components for Federating Multiple Heterogeneous Data Sources
ER '99 Proceedings of the 18th International Conference on Conceptual Modeling
Designing Good Semi-Structured Databases and Conceptual Modeling
ER '99 Proceedings of the 18th International Conference on Conceptual Modeling
SEuS: Structure Extraction Using Summaries
DS '02 Proceedings of the 5th International Conference on Discovery Science
A two phase optimization technique for XML queries with multiple regular path expressions
Journal of Systems and Software
XML query processing using document type definitions
Journal of Systems and Software
An effective query pruning technique for multiple regular path expressions
Journal of Systems and Software
A Web odyssey: from codd to XML
ACM SIGMOD Record
Handbook of massive data sets
Algebraic rewritings for optimizing regular path queries
Theoretical Computer Science - Database theory
Techniques for the evaluation of XML queries: a survey
Data & Knowledge Engineering
Querying websites using compact skeletons
Journal of Computer and System Sciences - Special issu on PODS 2001
Object-Oriented Mediator Queries to XML Data
WISE '00 Proceedings of the First International Conference on Web Information Systems Engineering (WISE'00)-Volume 2 - Volume 2
A structural adviser for the XML document authoring
Proceedings of the 2003 ACM symposium on Document engineering
Path sharing and predicate evaluation for high-performance XML filtering
ACM Transactions on Database Systems (TODS)
PRIX: Indexing And Querying XML Using Prüfer Sequences
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Ctree: a compact tree for indexing XML data
Proceedings of the 6th annual ACM international workshop on Web information and data management
A partition index for XML and semi-structured data
Data & Knowledge Engineering
Using Evolutionary Algorithms for Defining the Sampling Policy of Complex N-Partite Networks
IEEE Transactions on Knowledge and Data Engineering
Multimedia Tools and Applications
An adaptive path index for XML data using the query workload
Information Systems
QED: a novel quaternary encoding to completely avoid re-labeling in XML updates
Proceedings of the 14th ACM international conference on Information and knowledge management
Knowledge and Information Systems
Sequencing XML data and query twigs for fast pattern matching
ACM Transactions on Database Systems (TODS)
Inference of concise DTDs from XML data
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Efficient structural joins on indexed XML documents
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A framework for using materialized XPath views in XML query processing
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Inferring XML schema definitions from XML data
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Faster path indexes for search in XML data
ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Temporal XML: modeling, indexing, and query processing
The VLDB Journal — The International Journal on Very Large Data Bases
Proceedings of the VLDB Endowment
Computer Languages, Systems and Structures
An adaptive path index for XML data using the query workload
Information Systems
Inference of concise regular expressions and DTDs
ACM Transactions on Database Systems (TODS)
Tracking hidden groups using communications
ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
Exploring XML web collections with DescribeX
ACM Transactions on the Web (TWEB)
Extraction and exploitation of intensional knowledge from heterogeneous information sources: semi-automatic approaches and tools
Database and information retrieval techniques for XML
ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Reuse or never reuse the deleted labels in XML query processing based on labeling schemes
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
An improved prefix labeling scheme: a binary string approach for dynamic ordered XML
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Information retrieval from distributed semistructured documents using metadata interface
KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
Fast answering of XPath query workloads on web collections
XSym'07 Proceedings of the 5th international conference on Database and XML Technologies
Energy and Latency Efficient Access of Wireless XML Stream
Journal of Database Management
Hi-index | 0.00 |
Introduces the concept of representative objects, which uncover the inherent schema(s) in semi-structured, hierarchical data sources and provide a concise description of the structure of the data. Semi-structured data, unlike data stored in typical relational or object-oriented databases, does not have a fixed schema that is known in advance and stored separately from the data. With the rapid growth of the World Wide Web, semi-structured hierarchical data sources are becoming widely available to the casual user. The lack of external schema information currently makes browsing and querying these data sources inefficient at best, and impossible at worst. We show how representative objects make schema discovery efficient and facilitate the generation of meaningful queries over the data.