Bayesian network-based probabilistic XML keywords filtering

  • Authors:
  • Chenjing Zhang;Kun Yue;Jinghua Zhu;Xiaoling Wang;Aoying Zhou

  • Affiliations:
  • College of Information Technology, Shanghai Ocean University, Shanghai, China and School of Computer Science and Technology, Fudan University, Shanghai, China;School of Information Science and Engineering, Yunnan University, Kunming, China;School of Computer Science and Technology, Fudan University, Shanghai, China;Shanghai Key Laboratory of Trustworthy Computing, Software Engineering Institute, East China Normal University, Shanghai, China;Shanghai Key Laboratory of Trustworthy Computing, Software Engineering Institute, East China Normal University, Shanghai, China

  • Venue:
  • DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data uncertainty appears in many important XML applications. Recent probabilistic XML models represent different dependency correlations of sibling nodes by adding various kinds of distributional nodes, while there does not exist a uniform probability calculation method for different dependency correlations. Since Bayesian Networks can denote various dependency correlations among nodes just by conditional probability table(CPT), this paper proposes the Bayesian Networks based probabilistic XML model PrXML-BN, and combines SLCA semantic meaning of keyword query into Bayesian Networks, then implements keywords filtering on SLCA semantic meaning. To optimize the performance of keywords filtering, two optimization strategies are proposed in this paper. In the end, experiments verify the performance of keywords filtering algorithm based on SLCA in model PrXML-BN.