Mining Context-Specific Web Knowledge: An Experimental Dictionary-Based Approach

Authors:
Vincenzo Lecce;Marco Calabrese;Domenico Soldo
Affiliations:
Polytechnic of Bari --- II Faculty of Engineering - DIASS Polytechnic of Bari,;Polytechnic of Bari --- II Faculty of Engineering - DIASS Polytechnic of Bari,;Polytechnic of Bari --- II Faculty of Engineering - DIASS Polytechnic of Bari,
Venue:
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Year:
2008

Citing 6
Cited 0

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Web mining research: a survey

ACM SIGKDD Explorations Newsletter
Mining the Web's Link Structure

Computer
Learning concept hierarchies from text corpora using formal concept analysis

Journal of Artificial Intelligence Research
On how to perform a gold standard based evaluation of ontology learning

ISWC'06 Proceedings of the 5th international conference on The Semantic Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work presents an experimental semantic approach for mining knowledge from the World Wide Web (WWW). The main goal is to build a context-specific knowledge base from web documents. The basic idea is to use a reference knowledge provided by a dictionary as the indexing structure of domain-specific computed knowledge instances organised in the form of interlinked text words. The WordNet lexical database has been used as reference knowledge for the English web documents. Both the reference and the computed knowledge are actually conceived as word graphs. Graph is considered here as a powerful way to represent structured knowledge. This assumption has many consequences on the way knowledge can be explored and similar knowledge patterns can be identified. In order to identify context-specific elements in knowledge graphs, the novel semantic concept of "minutia" has been introduced. A preliminary evaluation of the efficacy of the proposed approach has been carried out. A fair comparison strategy with other non-semantic competing approaches is currently under investigation.