Aligning predicates across monolingual comparable texts using graph-based clustering

Authors:
Michael Roth;Anette Frank
Affiliations:
Heidelberg University, Germany;Heidelberg University, Germany
Venue:
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Year:
2012

Citing 25
Cited 0

A new approach to the maximum flow problem

STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
Empirical methods for artificial intelligence

Empirical methods for artificial intelligence
Centering: a framework for modeling the local coherence of discourse

Computational Linguistics
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
A systematic comparison of various statistical alignment models

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
The Proposition Bank: An Annotated Corpus of Semantic Roles

Computational Linguistics
Alignment by agreement

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Modeling local coherence: An entity-based approach

Computational Linguistics
Constructing corpora for the development and evaluation of paraphrase systems

Computational Linguistics
Anomalies in the WordNet verb hierarchy

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Clustering and matching headlines for automatic paraphrase acquisition

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
A phrase-based alignment model for natural language inference

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
WordNet::Similarity: measuring the relatedness of concepts

HLT-NAACL--Demonstrations '04 Demonstration Papers at HLT-NAACL 2004
The GREC main subject reference generation challenge 2009: overview and evaluation results

UCNLG+Sum '09 Proceedings of the 2009 Workshop on Language Generation and Summarisation
Bootstrapping semantic analyzers from non-contradictory texts

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Assessing the role of discourse references in entailment inference

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
SemEval-2010 task 10: Linking events and their participants in discourse

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Very high accuracy and fast dependency parsing is not a contradiction

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Recognising entailment within discourse

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A high-performance syntactic and semantic dependency parser

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations
The PASCAL recognising textual entailment challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Semantic topic models: combining word distributional statistics and dictionary definitions

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Aligning predicate argument structures in monolingual comparable texts: a new corpus for a new task

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
SemEval-2012 task 6: a pilot on semantic textual similarity

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Generating coherent discourse is an important aspect in natural language generation. Our aim is to learn factors that constitute coherent discourse from data, with a focus on how to realize predicate-argument structures in a model that exceeds the sentence level. We present an important subtask for this overall goal, in which we align predicates across comparable texts, admitting partial argument structure correspondence. The contribution of this work is two-fold: We first construct a large corpus resource of comparable texts, including an evaluation set with manual predicate alignments. Secondly, we present a novel approach for aligning predicates across comparable texts using graph-based clustering with Mincuts. Our method significantly outperforms other alignment techniques when applied to this novel alignment task, by a margin of at least 6.5 percentage points in F1-score.