A web-based novel term similarity framework for ontology learning

Authors:
Seokkyung Chung;Jongeun Jun;Dennis McLeod
Affiliations:
Yahoo! Inc., Santa Clara, CA;Department of Computer Science, University of Southern California, Los Angeles, CA;Department of Computer Science, University of Southern California, Los Angeles, CA
Venue:
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Year:
2006

Citing 20
Cited 2

Cyc: toward programs with common sense

Communications of the ACM
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Deriving concept hierarchies from text

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Inferring hierarchical descriptions

Proceedings of the eleventh international conference on Information and knowledge management
Ontology Learning for the Semantic Web

IEEE Intelligent Systems
Creating Semantic Web Contents with Protégé-2000

IEEE Intelligent Systems
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
OntoEdit: Collaborative Ontology Development for the Semantic Web

ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
Text Mining Techniques to Automatically Enrich a Domain Ontology

Applied Intelligence
Retrieval effectiveness of an ontology-based model for information selection

The VLDB Journal — The International Journal on Very Large Data Bases
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Similarity-based estimation of word cooccurrence probabilities

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Taxonomy-driven computation of product recommendations

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Corpus-Based Schema Matching

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
WordNet::Similarity: measuring the relatedness of concepts

HLT-NAACL--Demonstrations '04 Demonstration Papers at HLT-NAACL 2004
Subspace clustering of microarray data based on domain transformation

VDMB'06 Proceedings of the First international conference on Data Mining and Bioinformatics
Lexically evaluating ontology triples generated automatically from texts

ESWC'05 Proceedings of the Second European conference on The Semantic Web: research and Applications
Dynamic pattern mining: an incremental data clustering approach

Journal on Data Semantics II

Efficient concept clustering for ontology learning using an event life cycle on the web

Proceedings of the 2008 ACM symposium on Applied computing
An efficient ontology-based expert peering system

GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given that pairwise similarity computations are essential in ontology learning and data mining, we propose a similarity framework that is based on a conventional Web search engine There are two main aspects that we can benefit from utilizing a Web search engine First, we can obtain the freshest content for each term that represents the up-to-date knowledge on the term This is particularly useful for dynamic ontology management in that ontologies must evolve with time as new concepts or terms appear Second, in comparison with the approaches that use the certain amount of crawled Web documents as corpus, our method is less sensitive to the problem of data sparseness because we access as much content as possible using a search engine At the core of our proposed methodology, we present two different measures for similarity computation, a mutual information based and a feature-based metric Moreover, we show how the proposed metrics can be utilized for modifying existing ontologies Finally, we compare the extracted similarity relations with semantic similarity using WordNet Experimental results show that our method can extract topical relations between terms that are not present in conventional concept-based ontologies.