Computing Aggregations from Linguistic Web Resources: A Case Study in Czech Republic Sector/Traffic Accidents

Authors:
Jan Dedek;Peter Vojtá
Affiliations:
-;-
Venue:
ADVCOMP '08 Proceedings of the 2008 The Second International Conference on Advanced Engineering Computing and Applications in Sciences
Year:
2008

Citing 0
Cited 2

Towards semantic annotation supported by dependency linguistics and ILP

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part II
Fuzzy ILP Classification of web reports after linguistic text mining

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semantic computing aims to connect the intention of humans with computational content. We present a study of a problem of this type: extract information from large number of similar linguistic web resources to compute various aggregations (sum, average,...). In our motivating example we calculate the sum of injured people in traffic accidents in a certain period in a certain region. We restrict ourselves to pages written in Czech language. Our solution exploits existing linguistic tools created originally for a syntactically annotated corpus, Prague Dependency Treebank (PDT 2.0). We propose a solutions which learns tree queries to extract data from PDT2.0 annotations and transforms the data in an ontology. This method is not limited to Czech language and can be used with any structured linguistic representation. We present a proof of concept of our method. This enables to compute various aggregations over linguistic web resources.