Databases on the web: national web domain survey

Authors:
Denis Shestakov
Affiliations:
Aalto University, Konemiehentie, Espoo, Finland
Venue:
Proceedings of the 15th Symposium on International Database Engineering & Applications
Year:
2011

Citing 9
Cited 0

The Z39.50 information retrieval protocol: an overview and status report

ACM SIGCOMM Computer Communication Review
A technique for measuring the relative size and overlap of public Web search engines

WWW7 Proceedings of the seventh international conference on World Wide Web 7
A comparison of techniques to find mirrored hosts on the WWW

Journal of the American Society for Information Science
Structured databases on the web: observations and implications

ACM SIGMOD Record
Characterizing a national community web

ACM Transactions on Internet Technology (TOIT)
Characterization of national Web domains

ACM Transactions on Internet Technology (TOIT)
Structured data on the web

Communications of the ACM
Sampling the national deep web

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
On estimating the scale of national deep web

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

The deep Web, the part of the Web consisting of web pages filled with information from myriads of online databases, is to date relatively unexplored. Even its basic characteristics such as, for instance, the number of searchable databases on the Web are disputable. In this paper, we address the problem of accurate estimation of the deep Web by sampling one national web domain. We report some of our results obtained when surveying the Russian Web. The survey findings, namely the size estimates of the deep Web, could be useful for further studies to handle data in the deep Web.