The SGML handbook
OHSUMED: an interactive retrieval evaluation and new large test collection for research
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Designing a top-level ontology of human beings: a multi-perspective approach
Journal of Computer Science and Technology
TERMINAE: A Linguistic-Based Tool for the Building of a Domain Ontology
EKAW '99 Proceedings of the 11th European Workshop on Knowledge Acquisition, Modeling and Management
Layout & language: preliminary experiments in assigning logical structure to table cells
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Using SGML as a basis for data-intensive NLP
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A workbench for finding structure in texts
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Mining tables from large scale HTML texts
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Information extraction with automatic knowledge expansion
Information Processing and Management: an International Journal
Multilingual question answering with high portability on relational databases
MultiSumQA '02 proceedings of the 2002 conference on multilingual summarization and question answering - Volume 19
Pattern mining across domain-specific text collections
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Automatic ontology extraction from unstructured texts
OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, COA, and ODBASE - Volume Part II
A rational agent for the construction of a semantic model
Proceedings of the COLING-2000 Workshop on Using Toolsets and Architectures To Build NLP Systems
Hi-index | 0.00 |
In this paper we describe an architecture and functionality of main components of a workbench for an acquisition of domain knowledge from large text corpora. The workbench supports an incremental process of corpus analysis starting from a rough automatic extraction and organization of lexico-semantic regularities and ending with a computer supported analysis of extracted data and a semiautomatic refinement of obtained hypotheses. For doing this the workbench employs methods from computational linguistics, information retrieval and knowledge engineering. Although the work-bench is currently under implementation some of its components are already implemented and their performance is illustrated with samples from engineering for a medical domain.