Communications of the ACM - Special issue on parallelism
Pictures of relevance: a geometric analysis of similarity measures
Journal of the American Society for Information Science
Word association norms, mutual information, and lexicography
Computational Linguistics
Elements of information theory
Elements of information theory
Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Similarity-based approaches to natural language processing
Similarity-based approaches to natural language processing
Similarity-Based Models of Word Cooccurrence Probabilities
Machine Learning - Special issue on natural language learning
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Similarity-based word sense disambiguation
Computational Linguistics - Special issue on word sense disambiguation
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Using syntactic dependency as local context to resolve word sense ambiguity
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Memory-based learning: using similarity for smoothing
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Statistical sense disambiguation with relatively small corpora using dictionary definitions
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures
ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Smoothing of automatically generated selectional constraints
HLT '93 Proceedings of the workshop on Human Language Technology
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Evaluating strategies for similarity search on the web
Proceedings of the 11th international conference on World Wide Web
Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Topic-based document segmentation with probabilistic latent semantic analysis
Proceedings of the eleventh international conference on Information and knowledge management
The disambiguation of nominalizations
Computational Linguistics
The link prediction problem for social networks
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Using the web to obtain frequencies for unseen bigrams
Computational Linguistics - Special issue on web as corpus
A classification approach to word prediction
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Using semantic preferences to identify verbal participation in role switching alternations
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Newsjunkie: providing personalized newsfeeds via analysis of information novelty
Proceedings of the 13th international conference on World Wide Web
A study of topic similarity measures
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Learning random walk models for inducing word dependency distributions
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Distributional similarity models: clustering vs. nearest neighbors
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
LSH forest: self-tuning indexes for similarity search
WWW '05 Proceedings of the 14th international conference on World Wide Web
Taxonomy learning: factoring the structure of a taxonomy into a semantic classification decision
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Evaluating smoothing algorithms against plausibility judgements
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Building semantic perceptron net for topic spotting
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Constructing semantic space models from parsed corpora
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Dimension-reduced estimation of word co-occurrence probability
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Computational Linguistics
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity
Computational Linguistics
Using the web to overcome data sparseness
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
A general framework for distributional similarity
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Aligning word senses using bilingual corpora
ACM Transactions on Asian Language Information Processing (TALIP)
Evaluating WordNet-based Measures of Lexical Semantic Relatedness
Computational Linguistics
Automated extraction of Tree-Adjoining Grammars from treebanks
Natural Language Engineering
Learning question classifiers: the role of semantic information
Natural Language Engineering
An empirical study on language model adaptation
ACM Transactions on Asian Language Information Processing (TALIP)
Modeling spatially correlated data in sensor networks
ACM Transactions on Sensor Networks (TOSN)
Modelling the substitutability of discourse connectives
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A bootstrapping approach to unsupervised detection of cue phrase variants
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Exploring distributional similarity based models for query spelling correction
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Characterising measures of lexical distributional similarity
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Domain-specific sense distributions and predominant sense acquisition
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Online audio background determination for complex audio environments
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
The link-prediction problem for social networks
Journal of the American Society for Information Science and Technology
Proceedings of the 16th international conference on World Wide Web
On anonymizing query logs via token-based hashing
Proceedings of the 16th international conference on World Wide Web
Dependency-Based Construction of Semantic Space Models
Computational Linguistics
N semantic classes are harder than two
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Use of ranked cross document evidence trails for hypothesis generation
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Relational visual cluster validity (RVCV)
Pattern Recognition Letters
Weakly-supervised discovery of named entities using web search queries
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Finding translations for low-frequency words in comparable corpora
Machine Translation
A context-sensitive framework for lexical ontologies
The Knowledge Engineering Review
Applications of corpus-based semantic similarity and word segmentation to database schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Methods for extracting and classifying pairs of cognates and false friends
Machine Translation
PDE4Java: Plagiarism Detection Engine for Java source code: a clustering approach
International Journal of Business Intelligence and Data Mining
Bootstrapping Information Extraction from Semi-structured Web Pages
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Word sense disambiguation: A survey
ACM Computing Surveys (CSUR)
Learning semantic relatedness from term discrimination information
Expert Systems with Applications: An International Journal
Combining named entities and tags for novel sentence detection
Proceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval
Low-Cost Supervision for Multiple-Source Attribute Extraction
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Adaptive language modeling for word prediction
HLT-SRWS '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Student Research Workshop
Learning Process Behavior with EDY: an Experimental Analysis
Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
Word Sense Induction Using Graphs of Collocations
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Named entity recognition in biomedical texts using an HMM model
JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Using hidden Markov random fields to combine distributional and pattern-based word clustering
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A local alignment kernel in the context of NLP
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Semantic classification with distributional kernels
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Metric learning for synonym acquisition
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
A discriminative candidate generator for string transformations
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Bootstrapping distributional feature vector quality
Computational Linguistics
Deriving a large scale taxonomy from Wikipedia
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
SemEval-2007 task 10: English lexical substitution task
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Learning concept hierarchies from text corpora using formal concept analysis
Journal of Artificial Intelligence Research
Knowledge derived from wikipedia for computing semantic relatedness
Journal of Artificial Intelligence Research
Unsupervised methods for determining object and relation synonyms on the web
Journal of Artificial Intelligence Research
Wikipedia-based semantic interpretation for natural language processing
Journal of Artificial Intelligence Research
Computing semantic relatedness using Wikipedia-based explicit semantic analysis
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Graph-based clustering for semantic classification of onomatopoetic words
TextGraphs-3 Proceedings of the 3rd Textgraphs Workshop on Graph-Based Algorithms for Natural Language Processing
The distributional similarity of sub-parses
EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment
UMSLLS '09 Proceedings of the Workshop on Unsupervised and Minimally Supervised Learning of Lexical Semantics
Comparison of similarity models for the relation discovery task
LD '06 Proceedings of the Workshop on Linguistic Distances
Detecting compositionality in multi-word expressions
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
SMS-Watchdog: Profiling Social Behaviors of SMS Users for Anomaly Detection
RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
Dependency Language Modeling Using KNN and PLSI
MICAI '09 Proceedings of the 8th Mexican International Conference on Artificial Intelligence
Learning Co-relations of Plausible Verb Arguments with a WSM and a Distributional Thesaurus
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Hypernym discovery based on distributional similarity and hierarchical structures
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
A study of convolution tree kernel with local alignment
GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
Using distributional similarity to identify individual verb choice
INLG '06 Proceedings of the Fourth International Natural Language Generation Conference
New experiments in distributional representations of synonymy
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Classifying Japanese polysemous verbs based on fuzzy C-means clustering
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
A comparison of co-occurrence and similarity measures as simulations of context
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Experiments on extracting semantic relations from syntactic relations
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Creating ontologies for content representation: the OntoSeed suite
Journal on data semantics IX
ERACER: a database approach for statistical inference and data cleaning
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Master defect record retrieval using network-based feature association
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Acquisition of instance attributes via labeled and related instances
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Estimating interference in the QPRP for subtopic retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
PCIR: Combining DHTs and peer clusters for efficient full-text P2P indexing
Computer Networks: The International Journal of Computer and Telecommunications Networking
Multi-prototype vector-space models of word meaning
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Cross-lingual induction of selectional preferences with bilingual vector spaces
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Distributional similarity vs. PU learning for entity set expansion
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
From frequency to meaning: vector space models of semantics
Journal of Artificial Intelligence Research
Language pyramid and multi-scale text analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Automatic creation of a technical trend map from research papers and patents
PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Grouping product features using semi-supervised learning with soft-constraints
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Using local alignments for relation recognition
Journal of Artificial Intelligence Research
Directional distributional similarity for lexical inference
Natural Language Engineering
Estimation of quality of service in spelling correction using Kullback-Leibler divergence
Expert Systems with Applications: An International Journal
Clustering product features for opinion mining
Proceedings of the fourth ACM international conference on Web search and data mining
Automatic image semantic interpretation using social action and tagging data
Multimedia Tools and Applications
A flexible, corpus-driven model of regular and inverse selectional preferences
Computational Linguistics
Web Semantics: Science, Services and Agents on the World Wide Web
A word at a time: computing word relatedness using temporal semantic analysis
Proceedings of the 20th international conference on World wide web
A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
Natural Computing: an international journal
Polysemous verb classification using subcategorization acquisition and graph-based clustering
LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
Automatic evaluation of texts by using paraphrases
LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
Synthesizing products for online catalogs
Proceedings of the VLDB Endowment
Taxonomy induction based on a collaboratively built knowledge repository
Artificial Intelligence
Entity set expansion in opinion documents
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Learning from collective human behavior to introduce diversity in lexical choice
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Semantic relations in bilingual lexicons
ACM Transactions on Speech and Language Processing (TSLP)
Proceedings of the 20th ACM international conference on Information and knowledge management
Expertise ranking using activity and contextual link measures
Data & Knowledge Engineering
Automatic construction of a bilingual thesaurus using citation analysis
Proceedings of the 4th workshop on Patent information retrieval
Stochastic modelling of scientific terms distribution in publications
MKM'06 Proceedings of the 5th international conference on Mathematical Knowledge Management
Matching peptide sequences with mass spectra
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Making senses: bootstrapping sense-tagged lists of semantically-related words
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A context-theoretic framework for compositionality in distributional semantics
Computational Linguistics
Text2Onto: a framework for ontology learning and data-driven change discovery
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Sentence role identification in medline abstracts: training classifier with structured abstracts
AM'03 Proceedings of the Second international conference on Active Mining
An empirical study on language model adaptation using a metric of domain similarity
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Exploiting Wikipedia for cross-lingual and multilingual information retrieval
Data & Knowledge Engineering
Mining the web for the "voice of the herd" to track stock market bubbles
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Large-scale learning of word relatedness with constraints
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Hybrid Matching Algorithm for Personal Names
Journal of Data and Information Quality (JDIQ)
Fundamenta Informaticae - Emergent Computing
Regular polysemy: a distributional model
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
SemEval-2012 task 4: evaluating Chinese word similarity
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Soft cardinality + ML: learning adaptive similarity functions for cross-lingual textual entailment
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
A generalised hybrid architecture for NLP
HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
Modeling covert event retrieval in logical metonymy: probabilistic and distributional accounts
CMCL '12 Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Extracting signed social networks from text
TextGraphs-7 '12 Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing
Transforming graph data for statistical relational learning
Journal of Artificial Intelligence Research
Semantically enhanced text stemmer (SETS) for cross-domain document clustering
KES'12 Proceedings of the 16th international conference on Knowledge Engineering, Machine Learning and Lattice Computing with Applications
Diffusion of innovations revisited: from social network to innovation network
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Enhanced cross-domain document clustering with a semantically enhanced text stemmer SETS
International Journal of Knowledge-based and Intelligent Engineering Systems - Selected papers of KES2012-Part 2 of 2
Hi-index | 0.00 |
We study distributional similarity measures for the purpose of improving probability estimation for unseen cooccurrences. Our contributions are three-fold: an empirical comparison of a broad range of measures; a classification of similarity functions based on the information that they incorporate; and the introduction of a novel function that is superior at evaluating potential proxy distributions.