The Z39.50 information retrieval protocol: an overview and status report
ACM SIGCOMM Computer Communication Review
A technique for measuring the relative size and overlap of public Web search engines
WWW7 Proceedings of the seventh international conference on World Wide Web 7
A comparison of techniques to find mirrored hosts on the WWW
Journal of the American Society for Information Science
Structured databases on the web: observations and implications
ACM SIGMOD Record
Characterizing a national community web
ACM Transactions on Internet Technology (TOIT)
Characterization of national Web domains
ACM Transactions on Internet Technology (TOIT)
Communications of the ACM
Sampling the national deep web
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
On estimating the scale of national deep web
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Hi-index | 0.00 |
The deep Web, the part of the Web consisting of web pages filled with information from myriads of online databases, is to date relatively unexplored. Even its basic characteristics such as, for instance, the number of searchable databases on the Web are disputable. In this paper, we address the problem of accurate estimation of the deep Web by sampling one national web domain. We report some of our results obtained when surveying the Russian Web. The survey findings, namely the size estimates of the deep Web, could be useful for further studies to handle data in the deep Web.