Contextual correlates of synonymy
Communications of the ACM
Determining Semantic Similarity among Entity Classes from Different Ontologies
IEEE Transactions on Knowledge and Data Engineering
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources
IEEE Transactions on Knowledge and Data Engineering
Computational Linguistics - Special issue on web as corpus
Using the web to obtain frequencies for unseen bigrams
Computational Linguistics - Special issue on web as corpus
Entity-based cross-document coreferencing using the Vector Space Model
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Measuring semantic similarity between words using web search engines
Proceedings of the 16th international conference on World Wide Web
Cross-lingual query suggestion using query logs of different languages
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
POLYPHONET: An advanced social network extraction system from the Web
Web Semantics: Science, Services and Agents on the World Wide Web
Cm-pmi: improved web-based association measure with contextual label matching
Proceedings of the 17th international conference on World Wide Web
Extracting Social Networks Among Various Entities on the Web
ESWC '07 Proceedings of the 4th European conference on The Semantic Web: Research and Applications
Web-Based Measure of Semantic Relatedness
WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
Semantically driven snippet selection for supporting focused web searches
Data & Knowledge Engineering
Word Similarity Based on an Ensemble Model Using Ranking SVMs
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Towards a Novel Association Measure via Web Search Results Mining
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Measuring topic homogeneity and its application to dictionary-based word sense disambiguation
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Using web-search results to measure word-group similarity
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A study on similarity and relatedness using distributional and WordNet-based approaches
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
A Web-Based Relatedness Measure by Conditional Query
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Towards Bridging the Web and the Semantic Web
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Exploiting internal and external semantics for the clustering of short texts using world knowledge
Proceedings of the 18th ACM conference on Information and knowledge management
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Measuring semantic relatedness with vector space models and random walks
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Exploiting query logs for cross-lingual query suggestions
ACM Transactions on Information Systems (TOIS)
A cascaded classification approach to disambiguating polysemous mentions with social chains
Expert Systems with Applications: An International Journal
Labeling categories and relationships in an evolving social network
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Unsupervised translation disambiguation based on maximum web bilingual relatedness: web as lexicon
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
A new semantic similarity measuring method based on web search engines
WSEAS Transactions on Computers
Automated skimming in response to questions for nonvisual readers
SLPAT '10 Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
Grouping product features using semi-supervised learning with soft-constraints
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Ontology-based information content computation
Knowledge-Based Systems
Clustering product features for opinion mining
Proceedings of the fourth ACM international conference on Web search and data mining
Distributional memory: A general framework for corpus-based semantics
Computational Linguistics
Combining heterogeneous knowledge resources for improved distributional semantic models
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
HSWS: enhancing efficiency of web search engine via semantic web
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Harnessing different knowledge sources to measure semantic relatedness under a uniform model
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Lexical co-occurrence, statistical significance, and word association
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A framework for semantic discovery of web services
iUBICOM'10 Proceedings of the 5th international conference on Ubiquitous and Collaborative Computing
Measuring semantic similarity between words by removing noise and redundancy in web snippets
Concurrency and Computation: Practice & Experience
Hybrid Method for Computing Word-Pair Similarity based on Web Content
Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
Structuring e-commerce inventory
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Context similarity measure using Fuzzy Formal Concept Analysis
Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
Combining language sources and robust semantic relatedness for attribute-based knowledge transfer
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I
Computing term similarity by large probabilistic isA knowledge
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
A web search with double checking model is proposed to explore the web as a live corpus. Five association measures including variants of Dice, Overlap Ratio, Jaccard, and Cosine, as well as Co-Occurrence Double Check (CODC), are presented. In the experiments on Rubenstein-Goodenough's benchmark data set, the CODC measure achieves correlation coefficient 0.8492, which competes with the performance (0.8914) of the model using WordNet. The experiments on link detection of named entities using the strategies of direct association, association matrix and scalar association matrix verify that the double-check frequencies are reliable. Further study on named entity clustering shows that the five measures are quite useful. In particular, CODC measure is very stable on word-word and name-name experiments. The application of CODC measure to expand community chains for personal name disambiguation achieves 9.65% and 14.22% increase compared to the system without community expansion. All the experiments illustrate that the novel model of web search with double checking is feasible for mining associations from the web.