Text Schema Mining Using Graphs and Formal Concept Analysis

  • Authors:
  • Felix H. Gatzemeier;Oliver Meyer

  • Affiliations:
  • -;-

  • Venue:
  • ICCS '02 Proceedings of the 10th International Conference on Conceptual Structures: Integration and Interfaces
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an investigation into finding and evaluating schemata through formal concept analysis.S chemata are used in conceptual authoring support to provide proven building blocks of text structures.A s still only few schemata are available, ways to mine them from structures of existing texts seem worthwhile.Th e general process begins with the structure of a text as a graph, transforms this into a formal context and examines the formal concept lattice for this context. Especially formal concepts with large extents may be candidates for schemata. Three alternative kinds of transformations are presented: 1. Wille's Natural transformation produces contexts mainly based on type and connection information, 2. Schema-derived transformations derive of attributes that identify partial or complete instances from a set of schemata, 3. Informal: Starting from a set of schemata, manually formulate conditions that may be present in the instance graph and contribute to the presence of such schemata.We have regarded document structures consisting of a hierarchy of sections and subsections, which may import and export topics. The topics are interconnected in a conceptual graph called the topic map. Results of processing two such structures with the natural transformation and an informal one are reported.Some notes on the implementation in the Chasid prototype are given.