Query evaluation over probabilistic XML

Authors:
Benny Kimelfeld;Yuri Kosharovsky;Yehoshua Sagiv
Affiliations:
IBM Almaden Research Center, San Jose, USA;The Hebrew University, Jerusalem, Israel;The Hebrew University, Jerusalem, Israel
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2009

Citing 34
Cited 22

On generating all maximal independent sets

Information Processing Letters
Probabilistic quantifiers and games

Journal of Computer and System Sciences - Structure in Complexity Theory Conference, June 2-5, 1986
Monte-Carlo approximation algorithms for enumeration problems

Journal of Algorithms
Counting classes are at least as hard as the polynomial-time hierarchy

SIAM Journal on Computing
Memoing for logic programs

Communications of the ACM
Fixed-Parameter Tractability and Completeness I: Basic Results

SIAM Journal on Computing
On the hardness of approximate reasoning

Artificial Intelligence
A probabilistic relational model and algebra

ACM Transactions on Database Systems (TODS)
A probabilistic relational algebra for the integration of information retrieval and database systems

ACM Transactions on Information Systems (TOIS)
ProbView: a flexible probabilistic database system

ACM Transactions on Database Systems (TODS)
The complexity of query reliability

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
On the complexity of database queries

Journal of Computer and System Sciences
EquiX---a search and query language for XML

Journal of the American Society for Information Science and Technology - XML
Containment and equivalence for an XPath fragment

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
The Management of Probabilistic Data

IEEE Transactions on Knowledge and Data Engineering
An Algebra for Probabilistic Databases

IEEE Transactions on Knowledge and Data Engineering
The Complexity of First-Order and Monadic Second-Order Logic Revisited

LICS '02 Proceedings of the 17th Annual IEEE Symposium on Logic in Computer Science
The complexity of relational query languages (Extended Abstract)

STOC '82 Proceedings of the fourteenth annual ACM symposium on Theory of computing
Optimal implementation of conjunctive queries in relational data bases

STOC '77 Proceedings of the ninth annual ACM symposium on Theory of computing
A Probabilistic XML Approach to Data Integration

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
PEPX: a query-friendly probabilistic XML database

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
On the complexity of managing probabilistic XML data

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
The dichotomy of conjunctive queries on probabilistic structures

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Probabilistic interval XML

ACM Transactions on Computational Logic (TOCL)
Efficient query evaluation on probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
ProTDB: probabilistic data in XML

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Matching twigs in probabilistic XML

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Revisiting redundancy and minimization in an XPath fragment

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Query efficiency in probabilistic XML models

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Incorporating constraints in probabilistic XML

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Running tree automata on probabilistic XML

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the expressiveness of probabilistic XML models

The VLDB Journal — The International Journal on Very Large Data Bases
Querying and updating probabilistic information in XML

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology

On the expressiveness of probabilistic XML models

The VLDB Journal — The International Journal on Very Large Data Bases
Updating probabilistic XML

Proceedings of the 2010 EDBT/ICDT Workshops
Aggregate queries for discrete and continuous probabilistic XML

Proceedings of the 13th International Conference on Database Theory
Probabilistic XML via Markov Chains

Proceedings of the VLDB Endowment
Tractability in probabilistic databases

Proceedings of the 14th International Conference on Database Theory
A probabilistic XML merging tool

Proceedings of the 14th International Conference on Extending Database Technology
Value joins are expensive over (probabilistic) XML

Proceedings of the 4th International Workshop on Logic in Databases
Efficient query evaluation over probabilistic XML with long-distance dependencies

Proceedings of the 2011 Joint EDBT/ICDT Ph.D. Workshop
ProApproX: a lightweight approximation query processor over probabilistic trees

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
The monte carlo database system: Stochastic analysis close to the data

ACM Transactions on Database Systems (TODS)
Boosting twig joins in probabilistic XML

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Capturing continuous data and answering aggregate queries in probabilistic XML

ACM Transactions on Database Systems (TODS)
Bayesian network-based probabilistic XML keywords filtering

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
Answering queries using views over probabilistic XML: complexity and tractability

Proceedings of the VLDB Endowment
Efficient probabilistic XML query processing using an extended labeling scheme and a lightweight index

Information Processing and Management: an International Journal
Demonstrating ProApproX 2.0: a predictive query engine for probabilistic XML

Proceedings of the 21st ACM international conference on Information and knowledge management
ELCA evaluation for keyword search on probabilistic XML data

World Wide Web
Uncertain version control in open collaborative editing of tree-structured documents

Proceedings of the 2013 ACM symposium on Document engineering
On the connections between relational and XML probabilistic data models

BNCOD'13 Proceedings of the 29th British National conference on Big Data
Entity resolution for distributed probabilistic data

Distributed and Parallel Databases
Dynamically querying possibilistic XML data

Information Sciences: an International Journal
Formal transformation from fuzzy object-oriented databases to fuzzy XML

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Query evaluation over probabilistic XML is explored. The queries are twig patterns with projection, and the data is represented in terms of three models of probabilistic XML (that extend existing ones in the literature). The first model makes an assumption of independence among the probabilistic junctions, whereas the second model can encode probabilistic dependencies. The third model combines the first two and, hence, is the most general. An efficient algorithm (under data complexity) is given for query evaluation in the first model. In addition, various optimizations are proposed, and their effectiveness is shown both analytically and experimentally. For the other two models, it is shown that every query is either intractable or trivial. Nonetheless, efficient (additive and multiplicative) approximation algorithms are given for these two models. Finally, Boolean queries are enriched by allowing disjunctions and negations of branches. The above algorithm for the first model is extended to handle these queries. For the other two models, there is an efficient additive approximation, and a multiplicative one also exists if there is no negation; in addition, it is shown that if the query is non-monotonic, then no efficient multiplicative approximation exists unless NP = RP.