Facilitating the analysis of discourse phenomena in an interoperable NLP platform

  • Authors:
  • Riza Theresa Batista-Navarro;Georgios Kontonatsios;Claudiu Mihăilă;Paul Thompson;Rafal Rak;Raheel Nawaz;Ioannis Korkontzelos;Sophia Ananiadou

  • Affiliations:
  • The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK;The National Centre for Text Mining, The University of Manchester, Manchester, UK

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The analysis of discourse phenomena is essential in many natural language processing (NLP) applications. The growing diversity of available corpora and NLP tools brings a multitude of representation formats. In order to alleviate the problem of incompatible formats when constructing complex text mining pipelines, the Unstructured Information Management Architecture (UIMA) provides a standard means of communication between tools and resources. U-Compare, a text mining workflow construction platform based on UIMA, further enhances interoperability through a shared system of data types, allowing free combination of compliant components into workflows. Although U-Compare and its type system already support syntactic and semantic analyses, support for the analysis of discourse phenomena was previously lacking. In response, we have extended the U-Compare type system with new discourse-level types. We illustrate processing and visualisation of discourse information in U-Compare by providing several new deserialisation components for corpora containing discourse annotations. The new U-Compare is downloadable from http://nactem.ac.uk/ucompare.