Integrating multiple internet directories by instance-based learning

Authors:
Ryutaro Ichise;Hiedeaki Takeda;Shinichi Honiden
Affiliations:
National Institute of Informatics, Chiyoda-ku, Tokyo, Japan;National Institute of Informatics, Chiyoda-ku, Tokyo, Japan;National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
Venue:
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Year:
2003

Citing 10
Cited 17

Elements of machine learning

Elements of machine learning
Siteseer: personalized navigation for the Web

Communications of the ACM
On integrating catalogs

Proceedings of the 10th international conference on World Wide Web
Learning to map between ontologies on the semantic web

Proceedings of the 11th international conference on World Wide Web
Machine Learning

Machine Learning
Hierarchically Classifying Documents Using Very Few Words

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Building Hierarchical Classifiers Using Class Proximity

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
FCA-MERGE: bottom-up merging of ontologies

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Discovery of shared topics networks among people: a simple approach to find community knowledge from WWW bookmarks

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence

A semantic-based approach to interoperability of classification hierarchies: evaluation of linguistic techniques

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Using Google distance to weight approximate ontology matches

Proceedings of the 16th international conference on World Wide Web
Integrating Cross-Language Hierarchies and Its Application to Retrieving Relevant Documents

ACM Transactions on Asian Language Information Processing (TALIP)
Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

Expert Systems with Applications: An International Journal
Learning Concept Mappings from Instance Similarity

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Deriving Concept Mappings through Instance Mappings

ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
Evaluation of Similarity Measures for Ontology Mapping

New Frontiers in Artificial Intelligence
A large dataset for the evaluation of ontology matching

The Knowledge Engineering Review
An empirical comparison of ontology matching techniques

Journal of Information Science
A maximum likelihood framework for integrating taxonomies

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Instance-based matching of large life science ontologies

DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
Combining logic and probabilities for discovering mappings between taxonomies

KSEM'10 Proceedings of the 4th international conference on Knowledge science, engineering and management
Discovery of probabilistic mappings between taxonomies: principles and experiments

Journal on data semantics XV
A large scale taxonomy mapping evaluation

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Soundness of schema matching methods

ESWC'05 Proceedings of the Second European conference on The Semantic Web: research and Applications
Matching unstructured vocabularies using a background ontology

EKAW'06 Proceedings of the 15th international conference on Managing Knowledge in a World of Networks
Instance-based semantic interoperability in the cultural heritage

Semantic Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Finding desired information on the Internet is becoming increasingly difficult. Internet directories such as Yahoo!, which organize web pages into hierarchical categories, provide one solution to this problem; however, such directories are of limited use because some bias is applied both in the collection and categorization of pages. We propose a method for integrating multiple Internet directories by instance-based learning. Our method provides the mapping of categories in order to transfer documents from one directory to another, instead of simply merging two directories into one. We present herein an effective algorithm for determining similar categories between two directories via a statistical method called the k-statistic. In order to evaluate the proposed method, we conducted experiments using two actual Internet directories, Yahoo! and Google. The results show that the proposed method achieves extensive improvements relative to both the Naive Bayes and Enhanced Naive Bayes approaches, without any text analysis on documents.