ViST: a dynamic index method for querying XML data by tree structures

Authors:
Haixun Wang;Sanghyun Park;Wei Fan;Philip S. Yu
Affiliations:
IBM Thomas J. Watson Research Center, Hawthorne, NY;POSTECH, Pohang, Korea;IBM Thomas J. Watson Research Center, Hawthorne, NY;IBM Thomas J. Watson Research Center, Hawthorne, NY
Venue:
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Year:
2003

Citing 14
Cited 127

A query language for XML

WWW '99 Proceedings of the eighth international conference on World Wide Web
A Space-Economical Suffix Tree Construction Algorithm

Journal of the ACM (JACM)
Data on the Web: from relations to semistructured data and XML

Data on the Web: from relations to semistructured data and XML
Compact labeling schemes for ancestor queries

SODA '01 Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms
Labeling dynamic XML trees

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Improved labeling scheme for ancestor queries

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
A comparison of labeling schemes for ancestor queries

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
APEX: an adaptive path index for XML data

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Covering indexes for branching path queries

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
A Fast Index for Semistructured Data

Proceedings of the 27th International Conference on Very Large Data Bases
Quilt: An XML Query Language for Heterogeneous Data Sources

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
The XML benchmark project

The XML benchmark project

Recent progress on selected topics in database research: a report by nine young Chinese researchers working in the United States

Journal of Computer Science and Technology
PRIX: Indexing And Querying XML Using Prüfer Sequences

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
A Succinct Physical Storage Scheme for Efficient Evaluation of Path Queries in XML

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Efficient processing of XML twig queries with OR-predicates

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
On the integration of structure indexes and inverted lists

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
XSeq: an indexing infrastructure for tree pattern queries

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Twig query processing over graph-structured XML data

Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
An optimal algorithm for querying tree structures and its applications in bioinformatics

ACM SIGMOD Record
Virtual cursors for XML joins

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Ctree: a compact tree for indexing XML data

Proceedings of the 6th annual ACM international workshop on Web information and data management
Event-based modeling and processing of digital media

Proceedings of the 1st international workshop on Computer vision meets databases
On the Sequencing of Tree Structures for XML Indexing

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Incremental maintenance of path-expression views

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Rewriting XPath queries using materialized views

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Benefits of path summaries in an XML query optimizer supporting multiple access methods

VLDB '05 Proceedings of the 31st international conference on Very large data bases
From region encoding to extended dewey: on efficient processing of XML twig pattern matching

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Tree-pattern queries on a lightweight XML processor

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Accelerating queries by pruning XML documents

Data & Knowledge Engineering
XML Document Indexes: A Classification

IEEE Internet Computing
A path-based node filtering method for efficient structural joins

Information Processing Letters
Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical Designs

IEEE Transactions on Knowledge and Data Engineering
Efficient indexing and querying of XML data using modified Prüfer sequences

Proceedings of the 14th ACM international conference on Information and knowledge management
Optimizing cursor movement in holistic twig joins

Proceedings of the 14th ACM international conference on Information and knowledge management
Supporting complex queries on multiversion XML documents

ACM Transactions on Internet Technology (TOIT)
Sequencing XML data and query twigs for fast pattern matching

ACM Transactions on Database Systems (TODS)
Compressing and searching XML data via two zips

Proceedings of the 15th international conference on World Wide Web
MTree: an XML XPath graph index

Proceedings of the 2006 ACM symposium on Applied computing
Exploit sequencing to accelerate hot XML query pattern mining

Proceedings of the 2006 ACM symposium on Applied computing
xSpace: a tuple space for XML & its application in orchestration of web services

Proceedings of the 2006 ACM symposium on Applied computing
Tree inclusion algorithm, signatures and evaluation of path-oriented queries

Proceedings of the 2006 ACM symposium on Applied computing
Processing queries on tree-structured data efficiently

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Meta-data indexing for XPath location steps

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
TWIX: twig structure and content matching of selective queries using binary labeling

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
FIX: feature-based indexing technique for XML documents

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Answering tree pattern queries using views

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Efficient processing of XPath queries using indexes

Information Systems
Schema-conscious XML indexing

Information Systems
Index structures for matching XML twigs using relational query processors

Data & Knowledge Engineering
The linguist's search engine: an overview

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Indexing dataspaces

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Optimizing XPath queries on streaming XML data

ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
Efficiently Querying Large XML Data Repositories: A Survey

IEEE Transactions on Knowledge and Data Engineering
XFlat: Query-friendly encrypted XML view publishing

Information Sciences: an International Journal
Taming XPath queries by minimizing wildcard steps

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Efficient evaluation of high-selective xml twig patterns with parent child edges in tree-unaware rdbms

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Filtering unsatisfiable XPath queries

Data & Knowledge Engineering
LCS-TRIM: dynamic programming meets XML indexing and querying

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
XML twig pattern matching using version tree

Data & Knowledge Engineering
Efficient updates in dynamic XML data: from binary string to quaternary string

The VLDB Journal — The International Journal on Very Large Data Bases
An efficient numbering scheme and query algorithms for XML

International Journal of Computational Science and Engineering
Efficient processing of branch queries for high-performance XML filtering

Proceedings of the 2nd international conference on Scalable information systems
Efficient storage scheme and query processing for supply chain management using RFID

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Faster path indexes for search in XML data

ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Structural summaries for efficient XML query processing

Ph.D. '08 Proceedings of the 2008 EDBT Ph.D. workshop
Rules for query rewrite in native XML databases

DataX '08 Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
An Efficient XML Index Structure with Bottom-Up Query Processing

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
On the efficient search of an XML twig query in large DataGuide trees

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Hash-base subgraph query processing method for graph-structured XML documents

Proceedings of the VLDB Endowment
Prefix based numbering schemes for XML: techniques, applications and performances

Proceedings of the VLDB Endowment
RRSi: indexing XML data for proximity twig queries

Knowledge and Information Systems
FMware: middleware for efficient filtering and matching of XML messages with local data

Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware
XML data partitioning strategies to improve parallelism in parallel holistic twig joins

Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Incremental sequence-based frequent query pattern mining from XML queries

Data Mining and Knowledge Discovery
Extending path summary and region encoding for efficient structural query processing in native XML databases

Journal of Systems and Software
Binding Structural Properties to Node and Path Constraints in XML Path Retrieval

Advanced Internet Based Systems and Applications
XPath query evaluation based on the stack encoding

C3S2E '09 Proceedings of the 2nd Canadian Conference on Computer Science and Software Engineering
Compressing and indexing labeled trees, with applications

Journal of the ACM (JACM)
BPI-TWIG: XML Twig Query Evaluation

XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies
A new algorithm for tree mapping in XML databases

IMSA '07 Proceedings of the Eleventh IASTED International Conference on Internet and Multimedia Systems and Applications
Principles of Holism for sequential twig pattern matching

The VLDB Journal — The International Journal on Very Large Data Bases
OTwig: An Optimised Twig Pattern Matching Approach for XML Databases

SOFSEM '10 Proceedings of the 36th Conference on Current Trends in Theory and Practice of Computer Science
BPI: XML query evaluation using bitmapped path indices

Proceedings of the 2009 EDBT/ICDT Workshops
A path-based node filtering method for efficient structural joins

Information Processing Letters
Xistree: bottom-up method of XML indexing

BIS'07 Proceedings of the 10th international conference on Business information systems
Parameterized XPath views

BNCOD'07 Proceedings of the 24th British national conference on Databases
Effective pruning for XML structural match queries

Data & Knowledge Engineering
VERT: a semantic approach for content search and content extraction in XML query processing

ER'07 Proceedings of the 26th international conference on Conceptual modeling
An approach for XML similarity join using tree serialization

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Exploring XML web collections with DescribeX

ACM Transactions on the Web (TWEB)
Data sources selection for XML data sources

International Journal of Intelligent Information and Database Systems
XML: some papers in a haystack

ACM SIGMOD Record
Efficient XQuery join processing in publish/subscribe systems

ADC '09 Proceedings of the Twentieth Australasian Conference on Australasian Database - Volume 92
LTIX: a compact level-based tree to index XML databases

Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Benchmarking holistic approaches to XML tree pattern query processing

DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
KWilt: a semantic patchwork for flexible access to heterogeneous knowledge

RR'10 Proceedings of the Fourth international conference on Web reasoning and rule systems
Indexing and querying XML using extended Dewey labeling scheme

Data & Knowledge Engineering
XPath query processing improvements

Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
LLS: level-based labeling scheme for XML databases

Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Labeling Dynamic XML Trees

SIAM Journal on Computing
On the twig joins

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Key concepts for native XML processing

From active data management to event-based systems and more
TwigTable: using semantics in XML twig pattern query processing

Journal on data semantics XV
An efficient algorithm of frequent XML query pattern mining for ebXML applications in e-commerce

Expert Systems with Applications: An International Journal
Database and information retrieval techniques for XML

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Mining interesting XML-enabled association rules with templates

KDID'04 Proceedings of the Third international conference on Knowledge Discovery in Inductive Databases
LMIX: a dynamic XML index method using line model

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
FMware: middleware for efficient filtering and matching of XML messages with local data

Middleware'06 Proceedings of the 7th ACM/IFIP/USENIX international conference on Middleware
Developing an XML document retrieval system for a digital museum

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and its Applications - Volume Part I
TwigStackList ¬: a holistic twig join algorithm for twig query with not-predicates on XML data

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Efficient schemes of executing star operators in XPath query expressions

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Exploit sequencing to accelerate XML twig query answering

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
XML document retrieval system based on document structure and image content for digital museum

APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
Efficient dissemination of filtered data in XML-Based SDI

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
SIOUX: an efficient index for processing structural XQueries

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Efficiently coding and querying XML document

DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Searching web data: An entity retrieval and high-performance indexing model

Web Semantics: Science, Services and Agents on the World Wide Web
PathStack¬: a holistic path join algorithm for path query with not-predicates on XML data

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Efficiently coding and indexing XML document

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Efficient XPath evaluation

ADBIS'05 Proceedings of the 9th East European conference on Advances in Databases and Information Systems
A path-based labeling scheme for efficient structural join

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
Relational index support for XPath axes

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
A logic-based approach to cache answerability for XPath queries

XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
FLUX: content and structure matching of XPath queries with range predicates

XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
On the query evaluation in document DBs

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
XML document retrieval for digital museum

ICADL'05 Proceedings of the 8th international conference on Asian Digital Libraries: implementing strategies and sharing experiences
Adding logical operators to tree pattern queries on graph-structured data

Proceedings of the VLDB Endowment
Examining the impact of data-access cost on XML twig pattern matching

Information Sciences: an International Journal
XML filtering with XPath expressions containing parent and ancestor axes

Information Sciences: an International Journal
OXDP & OXiP: the notion of objects for efficient large XML data queries

International Journal of Grid and Utility Computing
XML query processing: efficiency and optimality

Proceedings of the 16th International Database Engineering & Applications Sysmposium
On the efficient processing regular path expressions of an enormous volume of XML data

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Efficient evaluation of nearest common ancestor in XML twig queries using tree-unaware RDBMS

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
An XML data query method based on structure-encoded

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Optimizing XML queries: Bitmapped materialized views vs. indexes

Information Systems
Optimal and efficient generalized twig pattern processing: a combination of preorder and postorder filterings

The VLDB Journal — The International Journal on Very Large Data Bases
Semantic-based construction of content and structure XML index

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Temporal and multi-versioned XML documents: A survey

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the growing importance of XML in data exchange, much research has been done in providing flexible query facilities to extract data from structured XML documents. In this paper, we propose ViST, a novel index structure for searching XML documents. By representing both XML documents and XML queries in structure-encoded sequences, we show that querying XML data is equivalent to finding subsequence matches. Unlike index methods that disassemble a query into multiple sub-queries, and then join the results of these sub-queries to provide the final answers, ViST uses tree structures as the basic unit of query to avoid expensive join operations. Furthermore, ViST provides a unified index on both content and structure of the XML documents, hence it has a performance advantage over methods indexing either just content or structure. ViST supports dynamic index update, and it relies solely on B+ Trees without using any specialized data structures that are not well supported by DBMSs. Our experiments show that ViST is effective, scalable, and efficient in supporting structural queries.