From text to hypertext by indexing

  • Authors:
  • Airi Salminen;Jean Tague-Sutcliffe;Charles McClellan

  • Affiliations:
  • Univ. of Jyva¨skyla¨Jyva¨skyla¨, Finland;Univ. of Western Ontario, London, Ont., Canada;Univ. of Western Ontario, London, Ont., Canada

  • Venue:
  • ACM Transactions on Information Systems (TOIS)
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

A model is presented for converting a collection of documents to hypertext by means of indexing. The documents are assumed to be semistructured, i.e., their text is a hierarchy of parts, and some of the parts consist of natural language. The model is intended as a framework for specifying hypertextual reading capabilities for specific application areas and for developing new automated tools for the conversion of semistructured text to hypertext. In the model, two well-known paradigms—formal grammars and document indexing—are combined.The structure of the source text is defined by a schema that is a constrained context-free grammar. The hierarchic structure of the source may thus be modeled by a parse tree for the grammar. The effect of indexing is described by grammar transformations. The new grammar, called an indexing schema, is associated with a new parse tree where some text parts are index elements. The indexing schema may hide some parts of the original documents or the structure of some parts. For information retrieval, parts of the indexed text are considered to be nodes of a hypergraph. In the hypergraph-based information access, the navigation capabilities of the hypertext systems are combined with the querying capabilities of information retrieval systems.