Summarization beyond sentence extraction: a probabilistic approach to sentence compression
Artificial Intelligence
Cohesive Generation of Syntactically Simplified Newspaper Text
TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
A systematic comparison of various statistical alignment models
Computational Linguistics
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Accurate unlexicalized parsing
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Improved statistical alignment models
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Sentence alignment for monolingual comparable corpora
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Supervised and unsupervised learning for sentence compression
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Discriminative reranking for semantic parsing
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Discriminative sentence compression with conditional random fields
Information Processing and Management: an International Journal
Mining wikipedia revision histories for improving sentence compression
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Moses: open source toolkit for statistical machine translation
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Intersecting multilingual data for faster and better statistical translations
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Sentence compression as tree transduction
Journal of Artificial Intelligence Research
A comparison of model free versus model intensive approaches to sentence compression
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
For the sake of simplicity: unsupervised extraction of lexical simplifications from Wikipedia
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Learning to translate with source and target syntax
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning simple Wikipedia: a cogitation in ascertaining abecedarian language
CL&W '10 Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids
Paraphrase generation as monolingual translation: data and evaluation
INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Entity-focused sentence simplification for relation extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A monolingual tree-based translation model for sentence simplification
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Simple English Wikipedia: a new text simplification task
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Sentence simplification by monolingual machine translation
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A hybrid system for Spanish text simplification
SLPAT '12 Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Comparing resources for spanish lexical simplification
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Text simplification resources for Spanish
Language Resources and Evaluation
Hi-index | 0.00 |
In this paper we examine the sentence simplification problem as an English-to-English translation problem, utilizing a corpus of 137K aligned sentence pairs extracted by aligning English Wikipedia and Simple English Wikipedia. This data set contains the full range of transformation operations including rewording, reordering, insertion and deletion. We introduce a new translation model for text simplification that extends a phrase-based machine translation approach to include phrasal deletion. Evaluated based on three metrics that compare against a human reference (BLEU, word-F1 and SSA) our new approach performs significantly better than two text compression techniques (including T3) and the phrase-based translation system without deletion.