Managing texts and facts in a mixed data base environment
Proc. of the ICOD-2 workshop on New applications of data bases
Restructuring of complex objects and office forms
Proceedings on International conference on database theory
A domain theoretic approach to higher-order relations
Proceedings on International conference on database theory
SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
Document processing in a relational database system
ACM Transactions on Information Systems (TOIS)
Principles of Database Systems
Principles of Database Systems
The document concept in a data base
SIGMOD '82 Proceedings of the 1982 ACM SIGMOD international conference on Management of data
Non first normal form relations to represent hierarchically organized data
PODS '84 Proceedings of the 3rd ACM SIGACT-SIGMOD symposium on Principles of database systems
Relational algebras, logic, and functional programming
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Data Structures for an Integrated Data Base Management and Information Retrieval System
VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
OTTER - An information retrieval system for office automation
COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
An object-oriented Office Document Architecture model for processing and interchange of documents
COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
The structure of abstract document objects
COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
Sql/nf: a Query Language for ~1NF Relational Databases
Sql/nf: a Query Language for ~1NF Relational Databases
A non-first-normal-form relational database model
A non-first-normal-form relational database model
Concept and prototype of a collaborative business process environment for document processing
Data & Knowledge Engineering - Special issue: Collaborative business process technologies
Report on the DB/IR panel at SIGMOD 2005
ACM SIGMOD Record
Expressiveness and performance of full-text search languages
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Hi-index | 0.00 |
The logical structure of a document is usually a tree in which the order of the nodes is important at least at some level of the tree. We call a document unstructured if its structure is a single-level ordered tree. The purpose of this paper is to present a many-sorted algebra for handling unstructured documents. The documents in the model are represented by relations. An algebra for handling documents of one type can be extended to an algebra for handling documents of several types. Further, an algebra for handling documents can be extended by the relational algebra for handling documents and relations in a common algebra. The model of this paper can be regarded as a part of a general document model. On the other hand, unstructured documents themselves are an important group of documents. We will show by examples that the simple model covers a wide range of document handling and information retrieval problems.