Facilitating the analysis of discourse phenomena in an interoperable NLP platform

Authors:
Riza Theresa Batista-Navarro;Georgios Kontonatsios;Claudiu Mihăilă;Paul Thompson;Rafal Rak;Raheel Nawaz;Ioannis Korkontzelos;Sophia Ananiadou
Affiliations:
The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK
Venue:
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Year:
2013

Citing 15
Cited 0

Centering: a framework for modeling the local coherence of discourse

Computational Linguistics
The Theory and Practice of Discourse Parsing and Summarization

The Theory and Practice of Discourse Parsing and Summarization
UIMA: an architectural approach to unstructured information processing in the corporate research environment

Natural Language Engineering
Speech and Language Processing (2nd Edition)

Speech and Language Processing (2nd Edition)
Discourse processing for context question answering based on linguistic knowledge

Knowledge-Based Systems
Biomedical named entity recognition using conditional random fields and rich feature sets

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Accelerating the annotation of sparse named entities by dynamic sentence selection

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
U-Compare

Bioinformatics
Middleware for creating and combining multi-dimensional NLP markup

NLPXML '06 Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing
Coreference for learning to extract relations: yes, Virginia, coreference matters

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Building a coreference-annotated corpus from the domain of biochemistry

BioNLP '11 Proceedings of BioNLP 2011 Workshop
Methodological Review: A review of causal inference for biomedical informatics

Journal of Biomedical Informatics
BioNLP Shared Task 2011: supporting resources

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Scaling up high-value retrieval to medium-volume data

IRFC'10 Proceedings of the First international Information Retrieval Facility conference on Adbances in Multidisciplinary Retrieval
Identifying claimed knowledge updates in biomedical research articles

ACL '12 Proceedings of the Workshop on Detecting Structure in Scholarly Discourse

Quantified Score

Hi-index	0.00

Visualization

Abstract

The analysis of discourse phenomena is essential in many natural language processing (NLP) applications. The growing diversity of available corpora and NLP tools brings a multitude of representation formats. In order to alleviate the problem of incompatible formats when constructing complex text mining pipelines, the Unstructured Information Management Architecture (UIMA) provides a standard means of communication between tools and resources. U-Compare, a text mining workflow construction platform based on UIMA, further enhances interoperability through a shared system of data types, allowing free combination of compliant components into workflows. Although U-Compare and its type system already support syntactic and semantic analyses, support for the analysis of discourse phenomena was previously lacking. In response, we have extended the U-Compare type system with new discourse-level types. We illustrate processing and visualisation of discourse information in U-Compare by providing several new deserialisation components for corpora containing discourse annotations. The new U-Compare is downloadable from http://nactem.ac.uk/ucompare.