The SGML handbook
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Using SGML as a basis for data-intensive NLP
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Towards a workbench for acquisition of domain knowledge from natural language
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Methods for precise named entity matching in digital collections
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Evolving GATE to meet new challenges in language engineering
Natural Language Engineering
An overview of methods and tools for ontology learning from texts
The Knowledge Engineering Review
GATE: an architecture for development of robust HLT applications
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Building automatically a business registration ontology
dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
Learning meronyms from biomedical text
ACLstudent '05 Proceedings of the ACL Student Research Workshop
IntEx: a syntactic role driven protein-protein interaction extractor for bio-medical text
ISMB '05 Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics
Merging stories with shallow semantics
KRAQ '06 Proceedings of the Workshop KRAQ'06 on Knowledge and Reasoning for Language Processing
Biomedical question answering: A survey
Computer Methods and Programs in Biomedicine
A Dempster-Shafer model for document retrieval using noun phrases
IRSG'98 Proceedings of the 20th Annual BCS-IRSG conference on Information Retrieval Research
Relation mining experiments in the pharmacogenomics domain
Journal of Biomedical Informatics
An automatic approach for ontology-based feature extraction from heterogeneous textualresources
Engineering Applications of Artificial Intelligence
Hi-index | 0.00 |
In this paper we report on a set of computational tools with (n)SGML pipeline data flow for uncovering internal structure in natural language texts. The main idea behind the workbench is the independence of the text representation and text analysis phases. At the representation phase the text is converted from a sequence of characters to features of interest by means of the annotation tools. At the analysis phase those features are used by statistics gathering and inference tools for finding significant correlations in the texts. The analysis tools are independent of particular assumptions about the nature of the feature-set and work on the abstract level of feature-elements represented as SGML items.