Discovering Relations Among Entities from XML Documents

  • Authors:
  • Yangyang Wu;Qing Lei;Wei Luo;Harou Yokota

  • Affiliations:
  • Department of Computer Science, Huaqiao University, Quanzhou Fujian, China;Department of Computer Science, Huaqiao University, Quanzhou Fujian, China;Department of Computer Science, Huaqiao University, Quanzhou Fujian, China;Department of Computer Science, Tokyo Institute of Technology, Tokyo, Japan

  • Venue:
  • MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses relation information extraction problem and proposes a method of discovering relations among entities which is buried in different nest structures of XML documents. The method first identifies and collects XML fragments that contain all types of entities given by users, then computes similarity between fragments based on semantics of their tags and their structures, and clusters fragments by similarity so that the fragments containing the same relation are clustered together, finally extracts relation instances and patterns of their occurrences from each cluster. The results of experiments show that the method can identify and extract relation information among given types of entities correctly from all kinds of XML documents with meaningful tags.