SyMSS: A syntax-based measure for short-text semantic similarity

Authors:
Jesús Oliva;José Ignacio Serrano;María Dolores del Castillo;Ángel Iglesias
Affiliations:
-;-;-;-
Venue:
Data & Knowledge Engineering
Year:
2011

Citing 27
Cited 9

Natural language understanding (2nd ed.)

Natural language understanding (2nd ed.)
Contextual correlates of synonymy

Communications of the ACM
Using syntactic dependency as local context to resolve word sense ambiguity

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Literature Extraction of Protein Functions Using Sentence Pattern Mining

IEEE Transactions on Knowledge and Data Engineering
Sentence Similarity Based on Semantic Nets and Corpus Statistics

IEEE Transactions on Knowledge and Data Engineering
Evaluating WordNet-based Measures of Lexical Semantic Relatedness

Computational Linguistics
Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Semantic text similarity using corpus-based word similarity and string similarity

ACM Transactions on Knowledge Discovery from Data (TKDD)
The Evaluation of Sentence Similarity Measures

DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
A new sentence similarity measure and sentence based extractive technique for automatic text summarization

Expert Systems with Applications: An International Journal
Psychiatric document retrieval using a discourse-aware model

Artificial Intelligence
Dependency-based syntactic-semantic analysis with PropBank and NomBank

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Corpus-based and knowledge-based measures of text semantic similarity

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Using information content to evaluate semantic similarity in a taxonomy

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
A semantic similarity metric combining features and intrinsic information content

Data & Knowledge Engineering
A machine learning approach to textual entailment recognition

Natural Language Engineering
Extended gloss overlaps as a measure of semantic relatedness

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Measuring the semantic similarity of texts

EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
Definition and analysis of intermediate entailment levels

EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
Experiments of UNED at the third recognising textual entailment challenge

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Techniques for improving web retrieval effectiveness

Information Processing and Management: an International Journal
Noun retrieval effect on text summarization and delivery of personalized news articles to the user's desktop

Data & Knowledge Engineering
A comparative study of two short text semantic similarity measures

KES-AMSTA'08 Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications
Editorial: An integration of WordNet and fuzzy association rule mining for multi-label document clustering

Data & Knowledge Engineering
The PASCAL recognising textual entailment challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
What syntax can contribute in the entailment task

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Textual entailment recognition based on dependency analysis and wordnet

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment

Editorial: Classifying text streams by keywords using classifier ensemble

Data & Knowledge Engineering
A semantically enhanced service repository for user-centric service discovery and management

Data & Knowledge Engineering
Online named entity recognition method for microtexts in social networking services: A case study of twitter

Expert Systems with Applications: An International Journal
From humor recognition to irony detection: The figurative language of social media

Data & Knowledge Engineering
Recognising sentence similarity using similitude and dissimilarity features

International Journal of Advanced Intelligence Paradigms
TakeLab: systems for measuring semantic text similarity

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
A rule-based human interpretation system for semantic textual similarity task

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
DERI&UPM: pushing corpus based relatedness to similarity: shared task system description

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
A new case-based classification using incremental concept lattice knowledge

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sentence and short-text semantic similarity measures are becoming an important part of many natural language processing tasks, such as text summarization and conversational agents. This paper presents SyMSS, a new method for computing short-text and sentence semantic similarity. The method is based on the notion that the meaning of a sentence is made up of not only the meanings of its individual words, but also the structural way the words are combined. Thus, SyMSS captures and combines syntactic and semantic information to compute the semantic similarity of two sentences. Semantic information is obtained from a lexical database. Syntactic information is obtained through a deep parsing process that finds the phrases in each sentence. With this information, the proposed method measures the semantic similarity between concepts that play the same syntactic role. Psychological plausibility is added to the method by using previous findings about how humans weight different syntactic roles when computing semantic similarity. The results show that SyMSS outperforms state-of-the-art methods in terms of rank correlation with human intuition, thus proving the importance of syntactic information in sentence semantic similarity computation.