Knowledge organization and access in a conceptual information system
Information Processing and Management: an International Journal - Artificial Intelligence and Information Retrieval
Semantic database modeling: survey, applications, and research issues
ACM Computing Surveys (CSUR)
MAFIA—an active mail-filter-agent for an intelligent document processing support
Proceedings of the IFIP WG 8.4 confernece on Multi-user interfaces and applications
Toward principles for the design of ontologies used for knowledge sharing
International Journal of Human-Computer Studies - Special issue: the role of formal ontology in the information technology
Electronic markets for learning: education brokerages on the Internet
Communications of the ACM
An abductive, linguistic approach to model retrieval
Data & Knowledge Engineering - Special issue: natural language for data bases (workshop 1996)
A Methodology for the Design of Distributed Web Systems
CAiSE '97 Proceedings of the 9th International Conference on Advanced Information Systems Engineering
Content-Based Organization of the Information Space in Multi-Database Networks
CAiSE '98 Proceedings of the 10th International Conference on Advanced Information Systems Engineering
ICE: an object oriented toolkit for tailoring collaborative
Proceedings of the IFIP TC8/WG8.1 Working Conference on Information Systems in the WWW Environment
A Conceptual-Modeling Approach to Extracting Data from the Web
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Structure-Based Queries over the World Wide Web
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Constructing common information spaces
ECSCW'97 Proceedings of the fifth conference on European Conference on Computer-Supported Cooperative Work
Hi-index | 0.00 |
When publishing documents on the Web, the user needs to describe and classify her documents for the benefit of later retrieval and use. This paper presents an approach to semantic document classification and retrieval based on Natural Language Processing and Conceptual Modeling. The Referent Model language is used in combination with a lexical analysis tool to define a controlled vocabulary for classifying documents. Documents are classified by means of sentences that contain the high frequency words in the document that also occur in the domain model defining the vocabulary. The sentences are parsed using a DCG-like grammar, mapped into a Referent Model fragment and stored along with the document using RDF-XML syntax. The model fragment represents the connection between the document and the domain model and serves as a document index. The approach is being implemented for a document collection published by the Norwegian Center for Medical Informatics (KITH).