Acquisition of Soft Taxonomies for Intelligent Personal Hierarchies and the Soft Semantic Web

Authors:
T. P. Martin;B. Azvine
Affiliations:
-;-
Venue:
BT Technology Journal
Year:
2003

Citing 20
Cited 6

Principles of database and knowledge-base systems, Vol. I

Principles of database and knowledge-base systems, Vol. I
Fuzzy information and database systems

Fuzzy Sets and Systems - On fuzzy information and database systems
Mass assignments and fuzzy sets for fuzzy databases

Advances in the Dempster-Shafer theory of evidence
CYC: a large-scale investment in knowledge infrastructure

Communications of the ACM
WordNet: a lexical database for English

Communications of the ACM
Fuzzy queries in multimedia database systems

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Combining fuzzy information from multiple systems

Journal of Computer and System Sciences
SEMINT: a tool for identifying attribute correspondences in heterogeneous databases using neural networks

Data & Knowledge Engineering
Reconciling schemas of disparate data sources: a machine-learning approach

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Information Retrieval

Information Retrieval
Modern Information Retrieval

Modern Information Retrieval
Combining fuzzy information: an overview

ACM SIGMOD Record
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Enhancing E-Communities with Agent-Based Systems

Computer
Boolean Query Mapping Across Heterogeneous Information Sources

IEEE Transactions on Knowledge and Data Engineering
Generic Schema Matching with Cupid

Proceedings of the 27th International Conference on Very Large Data Bases
Approximate query mapping: Accounting for translation closeness

The VLDB Journal — The International Journal on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Representing and reasoning about mappings between domain models

Eighteenth national conference on Artificial intelligence
Theory of Relational Databases

Theory of Relational Databases

Granular Association Rules for Multiple Taxonomies: A Mass Assignment Approach

Uncertainty Reasoning for the Semantic Web I
Internet search: Subdivision-based interactive query expansion and the soft semantic web

Applied Soft Computing
mTRACK: monitoring time-varying relations in approximately

CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
A framework for use of imprecise categorization in developing intelligent systems

IEEE Transactions on Fuzzy Systems
Adding a peer-to-peer trust layer to metadata generators

OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems
Viewpoints on emergent semantics

Journal on Data Semantics VI

Quantified Score

Hi-index	0.00

Visualization

Abstract

Information overload is a problem at an individual and a corporate level. Many solutions have been proposed, including knowledge management, data warehouses, service directories and digital libraries. The semantic Web aims to unify many of these approaches by appropriate markup and agreement on the meaning of the markup. At the individual's level, these techniques partially solve the problem by classifying documents within hierarchical structures and enabling searching and browsing of the documents. However, they also contribute to the problem as there is no unique categorisation and access structure that suits every individual. Finding the right document becomes a two-stage process — first find the right place in the categorisation scheme, then find the document within that class.In addition to enterprise-wide sources, individual information sources include e-mails, electronic documents in many formats, personal and group filespaces, notes, diary entries, etc. These are unlikely to conform to the enterprise categorisation but form useful resources nevertheless.The idea of an intelligent personal hierarchy for information (iPHI) is to auto-configure access to multiple sources of information based on personal categories. This entails fuzzy matching of meta-data structure as well as content. Metadata is a powerful tool in intelligent information management; however, it is not necessarily uniform, either in label or in content. One document's ‘author’ is another's ‘creator’; ‘John Smith’, ‘Smith, John’ and ‘J.Smith’ all refer to the same individual but are syntactically different.Fusion (or intelligent integration) of information takes place in an environment where the data may be of varying quality, and some may be incomplete or inconsistent. Combining metadata (and the associated data) is not possible without knowing (or learning) the mappings between their ontologies. Such mappings are likely to be soft, i.e. approximate — different sources arise from different designers with different world views. Soft computing is vital to tackle these problems. Frequently, data sources are organised implicitly, according to an internal ontology or taxonomy. Knowing this ontology or taxonomy is a necessary first step to using it in the fusion process. The work described in this paper extracts the implicit taxonomy and enables a user's interaction with the data (e.g. searching) to be expressed in their preferred terms rather than those used by the system.