Migrating data-intensive web sites into the Semantic Web
Proceedings of the 2002 ACM symposium on Applied computing
Exploiting Structure for Intelligent Web Search
HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
Term Weighting Approaches in Automatic Text Retrieval
Term Weighting Approaches in Automatic Text Retrieval
Ontology-focused crawling of Web documents
Proceedings of the 2003 ACM symposium on Applied computing
Web-scale information extraction in knowitall: (preliminary results)
Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
The state of the art in ontology learning: a framework for comparison
The Knowledge Engineering Review
Automatic construction of a hypernym-labeled noun hierarchy from text
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
ACM SIGKDD Explorations Newsletter
A clustering method based on path similarities of XML data
Data & Knowledge Engineering
A methodology for clustering XML documents by structure
Information Systems
Empirical merging of ontologies: a proposal of universal uncertainty representation framework
ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications
Discovering semantic sibling associations from web documents with XTREEM-SP
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Finding instance names and alternative glosses on the web: wordnet reloaded
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Discovering semantic sibling groups from web documents with XTREEM-SG
EKAW'06 Proceedings of the 15th international conference on Managing Knowledge in a World of Networks
Designing and evaluating patterns for ontology enrichment from texts
EKAW'06 Proceedings of the 15th international conference on Managing Knowledge in a World of Networks
Discovering multi terms and co-hyponymy from XHTML documents with XTREEM
KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
Learning of semantic sibling group hierarchies - K-means vs. bi-secting-K-means
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Domain relevance on term weighting
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Knowledge engineering within the application-independent architecture SEASALT
International Journal of Knowledge Engineering and Data Mining
Hi-index | 0.00 |
The acquisition of explicit semantics is still a research challenge. Approaches for the extraction of semantics focus mostly on learning subordination relations. The extraction of coordination relations, also called "sibling relations" is studied much less, though they are not less important in ontology engineering. We describe and evaluate the XTREEM-SG approach on finding sibling semantics from semi-structured Web documents. XTREEM-SG stands for "Xhtml TREE Mining - for Sibling Groups". It uses the XHTML-markup that is available in Web content to group together terms that are in a sibling relation to each other. Our approach has the advantage that it is domain and language independent; it does not rely on background knowledge, NLP software nor training. We evaluate XTREEM-SG towards two gold standard ontologies. We investigate how variations on input, parameters and gold standard influence the obtained results on structuring a closed vocabulary into semantic sibling groups. Earlier methods that evaluate sibling relations against a gold standard report a 14.18% F-measure on average sibling overlap. Our method improves this number into 22.93%.