DIRT @SBT@discovery of inference rules from text
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A systematic comparison of various statistical alignment models
Computational Linguistics
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
HMM-based word alignment in statistical translation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Extracting paraphrases from a parallel corpus
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Learning to paraphrase: an unsupervised approach using multiple-sequence alignment
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Improved statistical alignment models
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
An evaluation exercise for word alignment
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Automatic paraphrase acquisition from news articles
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Collecting paraphrase corpora from volunteer contributors
Proceedings of the 3rd international conference on Knowledge capture
Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion
Information Processing and Management: an International Journal
Semantic text similarity using corpus-based word similarity and string similarity
ACM Transactions on Knowledge Discovery from Data (TKDD)
The Evaluation of Sentence Similarity Measures
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Constructing corpora for the development and evaluation of paraphrase systems
Computational Linguistics
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Corpus-based and knowledge-based measures of text semantic similarity
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
ParaMetric: an automatic evaluation metric for paraphrasing
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A local alignment kernel in the context of NLP
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Universal Mobile Information Retrieval
UAHCI '09 Proceedings of the 5th International on ConferenceUniversal Access in Human-Computer Interaction. Part II: Intelligent and Ubiquitous Interaction Environments
Paraphrase recognition via dissimilarity significance classification
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Is sentence compression an NLG task?
ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Clustering and matching headlines for automatic paraphrase acquisition
ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
A phrase-based alignment model for natural language inference
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Recognizing entailment in intelligent tutoring systems*
Natural Language Engineering
Classification errors in a domain-independent assessment system
EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Answering learners' questions by retrieving question paraphrases from social Q&A sites
EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
The distributional similarity of sub-parses
EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
Measuring the semantic similarity of texts
EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
Machine learning with semantic-based distances between sentences for textual entailment
RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Mutaphrase: paraphrasing with FrameNet
RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Biology based alignments of paraphrases for sentence compression
RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Assessing Student Paraphrases Using Lexical Semantics and Word Weighting
Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Paraphrase recognition using machine learning to combine similarity measures
ACLstudent '09 Proceedings of the ACL-IJCNLP 2009 Student Research Workshop
Paraphrase identification as probabilistic quasi-synchronous recognition
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Improved statistical machine translation using monolingually-derived paraphrases
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Random walks for text semantic similarity
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Sub-sentential paraphrasing by contextual pivot translation
TextInfer '09 Proceedings of the 2009 Workshop on Applied Textual Inference
Unsupervised induction of sentence compression rules
UCNLG+Sum '09 Proceedings of the 2009 Workshop on Language Generation and Summarisation
Sentence similarity measurement based on shallow parsing
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Paraphrase identification using machine learning techniques
ICNVS'10 Proceedings of the 12th international conference on Networking, VLSI and signal processing
Interlingual annotation of parallel text corpora: A new framework for annotation and evaluation
Natural Language Engineering
Discriminative learning over constrained latent representations
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Tree edit models for recognizing textual entailments, paraphrases, and answers to questions
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Bootstrapping semantic analyzers from non-contradictory texts
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Identification of Sentence-to-Sentence Relations Using a Textual Entailer
Research on Language and Computation
Text relatedness based on a word thesaurus
Journal of Artificial Intelligence Research
Cross-caption coreference resolution for automatic image understanding
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Learning the relative usefulness of questions in community QA
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Paraphrase generation as monolingual translation: data and evaluation
INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Paraphrase alignment for synonym evidence discovery
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Paraphrasing with search engine query logs
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
On the limits of sentence compression by deletion
Empirical methods in natural language generation
Using machine translation systems to expand a corpus in textual entailment
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Using local alignments for relation recognition
Journal of Artificial Intelligence Research
A survey of paraphrasing and textual entailment methods
Journal of Artificial Intelligence Research
Generating phrasal and sentential paraphrases: A survey of data-driven methods
Computational Linguistics
SyMSS: A syntax-based measure for short-text semantic similarity
Data & Knowledge Engineering
Developing a corpus of plagiarised short answers
Language Resources and Evaluation
Collecting highly parallel data for paraphrase evaluation
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Extracting paraphrases from definition sentences on the web
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Monolingual alignment by edit rate computation on sentential paraphrase pairs
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Paraphrase fragment extraction from monolingual comparable corpora
BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
Tensor Field Model for higher-order information retrieval
Journal of Systems and Software
Filtering and clustering relations for unsupervised information extraction in open domain
Proceedings of the 20th ACM international conference on Information and knowledge management
A novel approach to update summarization using evolutionary manifold-ranking and spectral clustering
Expert Systems with Applications: An International Journal
Paraphrase identification on the basis of supervised machine learning techniques
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Partial predicate argument structure matching for entailment determination
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
VENSES – a linguistically-based system for semantic evaluation
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Mining paraphrases from self-anchored web sentence fragments
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Comparing phrase-based and syntax-based paraphrase generation
MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
SPARTE, a test suite for recognising textual entailment in spanish
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Finding instance names and alternative glosses on the web: wordnet reloaded
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Data-driven response generation in social media
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Linguistic redundancy in Twitter
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning sentential paraphrases from bilingual parallel corpora for text-to-text generation
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Aligning needles in a haystack: paraphrase acquisition across the web
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Recognising sentence similarity using similitude and dissimilarity features
International Journal of Advanced Intelligence Paradigms
Power-law distributions for paraphrases extracted from bilingual corpora
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Validation of sub-sentential paraphrases acquired from parallel monolingual corpora
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Re-examining machine translation metrics for paraphrase identification
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
SemEval-2012 task 6: a pilot on semantic textual similarity
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
UKP: computing semantic textual similarity by combining multiple content similarity measures
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
ETS: discriminative edit models for paraphrase scoring
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
A simple unsupervised latent semantics based approach for sentence similarity
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
University_of_Sheffield: two approaches to semantic text similarity
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Modeling sentences in the latent space
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A comparison of vector-based representations for semantic composition
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Enlarging paraphrase collections through generalization and instantiation
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Generalizing sub-sentential paraphrase acquisition across original signal type of text pairs
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Using discourse information for paraphrase extraction
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Terminological paraphrase extraction from scientific literature based on predicate argument tuples
Journal of Information Science
The automatic retrieval of news entities based on the structure of a news cluster
Scientific and Technical Information Processing
Similarity measures based on latent dirichlet allocation
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Multitechnique paraphrase alignment: A contribution to pinpointing sub-sentential paraphrases
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Sections on Paraphrasing; Intelligent Systems for Socially Aware Computing; Social Computing, Behavioral-Cultural Modeling, and Prediction
Experiments with semantic similarity measures based on LDA and LSA
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Hi-index | 0.00 |
We investigate unsupervised techniques for acquiring monolingual sentence-level paraphrases from a corpus of temporally and topically clustered news articles collected from thousands of web-based news sources. Two techniques are employed: (1) simple string edit distance, and (2) a heuristic strategy that pairs initial (presumably summary) sentences from different news stories in the same cluster. We evaluate both datasets using a word alignment algorithm and a metric borrowed from machine translation. Results show that edit distance data is cleaner and more easily-aligned than the heuristic data, with an overall alignment error rate (AER) of 11.58% on a similarly-extracted test set. On test data extracted by the heuristic strategy, however, performance of the two training sets is similar, with AERs of 13.2% and 14.7% respectively. Analysis of 100 pairs of sentences from each set reveals that the edit distance data lacks many of the complex lexical and syntactic alternations that characterize monolingual paraphrase. The summary sentences, while less readily alignable, retain more of the non-trivial alternations that are of greatest interest learning paraphrase relationships.