A workbench for finding structure in texts

Authors:
Andrei Mikheev;Steven Finch
Affiliations:
University of Edinburgh, Edinburgh, UK;Thomson Technical Labs, Rockville, Maryland
Venue:
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Year:
1997

Citing 4
Cited 12

The SGML handbook

The SGML handbook
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Using SGML as a basis for data-intensive NLP

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Towards a workbench for acquisition of domain knowledge from natural language

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics

Methods for precise named entity matching in digital collections

Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Evolving GATE to meet new challenges in language engineering

Natural Language Engineering
An overview of methods and tools for ontology learning from texts

The Knowledge Engineering Review
GATE: an architecture for development of robust HLT applications

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Building automatically a business registration ontology

dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
Learning meronyms from biomedical text

ACLstudent '05 Proceedings of the ACL Student Research Workshop
IntEx: a syntactic role driven protein-protein interaction extractor for bio-medical text

ISMB '05 Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics
Merging stories with shallow semantics

KRAQ '06 Proceedings of the Workshop KRAQ'06 on Knowledge and Reasoning for Language Processing
Biomedical question answering: A survey

Computer Methods and Programs in Biomedicine
A Dempster-Shafer model for document retrieval using noun phrases

IRSG'98 Proceedings of the 20th Annual BCS-IRSG conference on Information Retrieval Research
Relation mining experiments in the pharmacogenomics domain

Journal of Biomedical Informatics
An automatic approach for ontology-based feature extraction from heterogeneous textualresources

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we report on a set of computational tools with (n)SGML pipeline data flow for uncovering internal structure in natural language texts. The main idea behind the workbench is the independence of the text representation and text analysis phases. At the representation phase the text is converted from a sequence of characters to features of interest by means of the annotation tools. At the analysis phase those features are used by statistics gathering and inference tools for finding significant correlations in the texts. The analysis tools are independent of particular assumptions about the nature of the feature-set and work on the abstract level of feature-elements represented as SGML items.