Improved boosting algorithms using confidence-rated predictions
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
The automatic construction of large-scale corpora for summarization research
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
User Modeling in Text Generation
User Modeling in Text Generation
Using hidden Markov modeling to decompose human-written summaries
Computational Linguistics - Summarization
Statistics-Based Summarization - Step One: Sentence Compression
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Bitext maps and alignment via pattern recognition
Computational Linguistics
Cut and paste based text summarization
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Text alignment in a tool for translating revised documents
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
An IR approach for translating new words from nonparallel, comparable texts
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
A program for aligning sentences in bilingual corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Inferring strategies for sentence ordering in multidocument news summarization
Journal of Artificial Intelligence Research
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Computational Linguistics
Induction of Word and Phrase Alignments for Automatic Document Summarization
Computational Linguistics
Reading level assessment using support vector machines and statistical language models
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Dependency-based sentence alignment for multiple document summarization
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Multi-level bootstrapping for extracting parallel sentences from a quasi-comparable corpus
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
A new generation of textual corpora: mining corpora from very large collections
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Modeling local coherence: An entity-based approach
Computational Linguistics
A machine learning approach to reading level assessment
Computer Speech and Language
Constructing corpora for the development and evaluation of paraphrase systems
Computational Linguistics
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Investigating automatic alignment methods for slide generation from academic papers
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
User-sensitive text summarization thesis summary
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Corpus-based and knowledge-based measures of text semantic similarity
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Cognitively motivated features for readability assessment
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Clustering and matching headlines for automatic paraphrase acquisition
ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
The distributional similarity of sub-parses
EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
Automatically generating Wikipedia articles: a structure-aware approach
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Content modeling using latent permutations
Journal of Artificial Intelligence Research
A monolingual tree-based translation model for sentence simplification
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A survey of paraphrasing and textual entailment methods
Journal of Artificial Intelligence Research
Providing cross-lingual editing assistance to Wikipedia editors
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Matching samples of multiple views
Data Mining and Knowledge Discovery
Putting it simply: a context-aware approach to lexical simplification
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Simple English Wikipedia: a new text simplification task
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Paraphrase fragment extraction from monolingual comparable corpora
BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
Partial predicate argument structure matching for entailment determination
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Textual entailment recognition using a linguistically–motivated decision tree classifier
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Learning to simplify sentences using Wikipedia
MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
An unsupervised alignment algorithm for text simplification corpus construction
MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
Natural Language Engineering
Learning to simplify sentences with quasi-synchronous grammar and integer programming
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning sentential paraphrases from bilingual parallel corpora for text-to-text generation
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Generalizing sub-sentential paraphrase acquisition across original signal type of text pairs
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Managing information disparity in multilingual document collections
ACM Transactions on Speech and Language Processing (TSLP)
An abstractive approach to sentence compression
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Sections on Paraphrasing; Intelligent Systems for Socially Aware Computing; Social Computing, Behavioral-Cultural Modeling, and Prediction
Multitechnique paraphrase alignment: A contribution to pinpointing sub-sentential paraphrases
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Sections on Paraphrasing; Intelligent Systems for Socially Aware Computing; Social Computing, Behavioral-Cultural Modeling, and Prediction
Text simplification resources for Spanish
Language Resources and Evaluation
Hi-index | 0.00 |
We address the problem of sentence alignment for monolingual corpora, a phenomenon distinct from alignment in parallel corpora. Aligning large comparable corpora automatically would provide a valuable resource for learning of text-to-text rewriting rules. We incorporate context into the search for an optimal alignment in two complementary ways: learning rules for matching paragraphs using topic structure and further refining the matching through local alignment to find good sentence pairs. Evaluation shows that our alignment method outperforms state-of-the-art systems developed for the same task.