Knowledge Discovery across Documents through Concept Chain Queries

  • Authors:
  • Wei Jin;Rohini K. Srihari

  • Affiliations:
  • University at Buffalo, State University of New York;University at Buffalo, State University of New York

  • Venue:
  • ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper focuses on detecting links between two concepts across text documents (e.g. two persons). We interpret such a query as finding the most meaningful evidence trail across documents that connect these two concepts. Here we propose a fast and efficient algorithm to perform this task. It is based on the idea of hypothesis generation originated by Swanson called "complementary structures in disjoint literatures" (CSD). We adapted the technique by (i) developing an alternate method of generating semantic profiles and (ii) extending the technique to generate concept chains. Counterterrorism corpus is used to evaluate the performance of this approach and demonstrates the effectiveness of our algorithm.