ProTDB: probabilistic data in XML

Authors:
Andrew Nierman;H. V. Jagadish
Affiliations:
Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI;Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI
Venue:
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Year:
2002

Citing 14
Cited 66

A probabilistic relational model and algebra

ACM Transactions on Database Systems (TODS)
A probabilistic relational algebra for the integration of information retrieval and database systems

ACM Transactions on Information Systems (TOIS)
Query evaluation in probabilistic relational databases

Selected papers from the international workshop on Uncertainty in databases and deductive systems
Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
ProbView: a flexible probabilistic database system

ACM Transactions on Database Systems (TODS)
PSQL: a query language for probabilistic relational data

Data & Knowledge Engineering - Special issue on ER '97
XIRQL: a query language for information retrieval in XML documents

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Getting answers to natural language questions on the web

Journal of the American Society for Information Science and Technology
Probabilistic question answering on the web

Proceedings of the 11th international conference on World Wide Web
The Management of Probabilistic Data

IEEE Transactions on Knowledge and Data Engineering
An Algebra for Probabilistic Databases

IEEE Transactions on Knowledge and Data Engineering
The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
The Theory of Probabilistic Databases

VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
TAX: A Tree Algebra for XML

DBPL '01 Revised Papers from the 8th International Workshop on Database Programming Languages

Probabilistic Interval XML

ICDT '03 Proceedings of the 9th International Conference on Database Theory
Querying structured text in an XML database

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
TIMBER: a native system for querying XML

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Probabilistic XML Approach to Data Integration

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
A Framework for Management of Semistructured Probabilistic Data

Journal of Intelligent Information Systems
Merging uncertain information with semantic heterogeneity in XML

Knowledge and Information Systems
Efficient join processing over uncertain data

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
PEPX: a query-friendly probabilistic XML database

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Fusion rules for merging uncertain information

Information Fusion
On the complexity of managing probabilistic XML data

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Maximally joining probabilistic data

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Range search on multidimensional uncertain data

ACM Transactions on Database Systems (TODS)
Probabilistic interval XML

ACM Transactions on Computational Logic (TOCL)
Efficient query evaluation on probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Fuzzy XML data modeling with the UML and relational data models

Data & Knowledge Engineering
Matching twigs in probabilistic XML

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Query efficiency in probabilistic XML models

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Incorporating constraints in probabilistic XML

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Annotated XML: queries and provenance

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Query Selectivity Estimation for Uncertain Data

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Optimization of Queries over Interval Probabilistic Data

SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Query ranking in probabilistic XML data

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Modeling and querying probabilistic XML data

ACM SIGMOD Record
Fuzzy data modeling based on XML schema

Proceedings of the 2009 ACM symposium on Applied Computing
Probabilistic databases: diamonds in the dirt

Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Running tree automata on probabilistic XML

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incorporating constraints in probabilistic XML

ACM Transactions on Database Systems (TODS)
Information integration with uncertainty

IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
On the expressiveness of probabilistic XML models

The VLDB Journal — The International Journal on Very Large Data Bases
Query evaluation over probabilistic XML

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient processing of twig pattern matching in fuzzy XML

Proceedings of the 18th ACM conference on Information and knowledge management
Updating probabilistic XML

Proceedings of the 2010 EDBT/ICDT Workshops
Aggregate queries for discrete and continuous probabilistic XML

Proceedings of the 13th International Conference on Database Theory
Querying parse trees of stochastic context-free grammars

Proceedings of the 13th International Conference on Database Theory
A Survey on Uncertainty Management in Data Integration

Journal of Data and Information Quality (JDIQ)
Matching twigs in fuzzy XML

Information Sciences: an International Journal
Tractability in probabilistic databases

Proceedings of the 14th International Conference on Database Theory
Value joins are expensive over (probabilistic) XML

Proceedings of the 4th International Workshop on Logic in Databases
Efficient query answering in probabilistic RDF graphs

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A hybrid algorithm for finding top-k twig answers in probabilistic XML

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Edit distance between XML and probabilistic XML documents

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Capturing continuous data and answering aggregate queries in probabilistic XML

ACM Transactions on Database Systems (TODS)
Querying and updating probabilistic information in XML

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Matching top-k answers of twig patterns in probabilistic XML

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
Measuring the quality of uncertain information using possibilistic logic

ECSQARU'05 Proceedings of the 8th European conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
Ontology-Based user context management: the challenges of imperfection and time-dependence

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Keywords filtering over probabilistic XML data

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Bayesian network-based probabilistic XML keywords filtering

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
AN EFFICIENT REPRESENTATION MODEL OF DISTANCE DISTRIBUTION BETWEEN UNCERTAIN OBJECTS

Computational Intelligence
Answering queries using views over probabilistic XML: complexity and tractability

Proceedings of the VLDB Endowment
Efficient probabilistic XML query processing using an extended labeling scheme and a lightweight index

Information Processing and Management: an International Journal
On the foundations of probabilistic information integration

Proceedings of the 21st ACM international conference on Information and knowledge management
ELCA evaluation for keyword search on probabilistic XML data

World Wide Web
Construction of fuzzy ontologies from fuzzy XML models

Knowledge-Based Systems
Querying and ranking incomplete twigs in probabilistic XML

World Wide Web
Efficient processing of top-k twig queries over probabilistic XML data

World Wide Web
Search and result presentation in scientific workflow repositories

Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Uncertain version control in open collaborative editing of tree-structured documents

Proceedings of the 2013 ACM symposium on Document engineering
Formal approach for reengineering fuzzy XML in fuzzy object-oriented databases

Applied Intelligence
Efficient processing of twig query with compound predicates in fuzzy XML

Fuzzy Sets and Systems
On the connections between relational and XML probabilistic data models

BNCOD'13 Proceedings of the 29th British National conference on Big Data
Storing and querying fuzzy XML data in relational databases

Applied Intelligence
Dynamically querying possibilistic XML data

Information Sciences: an International Journal
Formal translation from fuzzy EER model to fuzzy XML model

Expert Systems with Applications: An International Journal
Formal transformation from fuzzy object-oriented databases to fuzzy XML

Applied Intelligence
Incorporating fuzzy information into the formal mapping from web data model to extended entity-relationship model

Integrated Computer-Aided Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Where as traditional databases manage only deterministic information, many applications that use databases involve uncertain data. This paper presents a Probabilistic Tree Data Base (ProTDB) to manage probabilistic data, represented in XML. Our approach differs from previous efforts to develop probabilistic relational systems in that we build a probabilistic XML database. This design is driven by application needs that involve data not readily amenable to a relational representation. XML data poses several modeling challenges: due to its structure, due to the possibility of uncertainty association at multiple granularities, and due to the possibility of missing and repeated sub-elements. We present a probabilistic XML model that addresses all of these challenges. We devise an implementation of XML query operations using our probability model, and demonstrate the efficiency of our implementation experimentally. We have used ProTDB to manage data from two application areas: protein chemistry data from the bioinformatics domain, and information extraction data obtained from the web using a natural language analysis system. We present a brief case study of the latter to demonstrate the value of probabilistic XML data management.