The tree inclusion problem: in optimal space and faster

Authors:
Philip Bille;Inge Li Gørtz
Affiliations:
Department of Theoretical Computer Science, The IT University of Copenhagen, Copenhagen S, Denmark;Department of Theoretical Computer Science, The IT University of Copenhagen, Copenhagen S, Denmark
Venue:
ICALP'05 Proceedings of the 32nd international conference on Automata, Languages and Programming
Year:
2005

Citing 14
Cited 8

Fast algorithms for finding nearest common ancestors

SIAM Journal on Computing
On query languages for the P-string data model

Information modelling and knowledge bases
Retrieval from hierarchical texts by partial patterns

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Ordered and Unordered Tree Inclusion

SIAM Journal on Computing
More efficient algorithm for ordered tree inclusion

Journal of Algorithms
Time and space efficient method-lookup for object-oriented programs

Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms
Querying and ranking XML documents

Journal of the American Society for Information Science and Technology - XML
Deterministic dictionaries

Journal of Algorithms
Fully Persistent Arrays (Extended Array)

WADS '89 Proceedings of the Workshop on Algorithms and Data Structures
Minimizing Diameters of Dynamic Trees

ICALP '97 Proceedings of the 24th International Colloquium on Automata, Languages and Programming
A New Algorithm for the Ordered Tree Inclusion Problem

CPM '97 Proceedings of the 8th Annual Symposium on Combinatorial Pattern Matching
On the Tree Inclusion Problem

MFCS '93 Proceedings of the 18th International Symposium on Mathematical Foundations of Computer Science
Finding hot query patterns over an XQuery stream

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient mining of XML query patterns for caching

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29

Generalized LCS

Theoretical Computer Science
Matching subsequences in trees

Journal of Discrete Algorithms
Fast Arc-Annotated Subsequence Matching in Linear Space

SOFSEM '10 Proceedings of the 36th Conference on Current Trends in Theory and Practice of Computer Science
Exploiting the structure of update fragments for efficient XML index maintenance

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
The tree inclusion problem: In linear space and faster

ACM Transactions on Algorithms (TALG)
Faster bit-parallel algorithms for unordered pseudo-tree matching and tree homeomorphism

IWOCA'10 Proceedings of the 21st international conference on Combinatorial algorithms
Weak inclusion for XML types

CIAA'11 Proceedings of the 16th international conference on Implementation and application of automata
Weak inclusion for recursive XML types

CIAA'12 Proceedings of the 17th international conference on Implementation and Application of Automata

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given two rooted, ordered, and labeled trees P and T the tree inclusion problem is to determine if P can be obtained from T by deleting nodes in T. This problem has recently been recognized as an important query primitive in XML databases. Kilpeläinen and Mannila (SIAM J. of Comp. 1995) presented the first polynomial time algorithm using quadratic time and space. Since then several improved results have been obtained for special cases when P and T have a small number of leaves or small depth. However, in the worst case these algorithms still use quadratic time and space. In this paper we present a new approach to the problem which leads to a new algorithm which uses optimal linear space and has subquadratic running time. Our algorithm improves all previous time and space bounds. Most importantly, the space is improved by a linear factor. This will make it possible to query larger XML databases and speed up the query time since more of the computation can be kept in main memory.