Analysis of web page image tag distribution characteristics

Authors:
Isola Ajiferuke;Dietmar Wolfram
Affiliations:
Faculty of Information and Media Studies, University of Western Ontario, Middlesex College, London, Ont., Canada N6A 5B7;School of Information Studies, University of Wisconsin-Milwaukee, P.O. Box 413, Milwaukee, WI
Venue:
Information Processing and Management: an International Journal
Year:
2005

Citing 9
Cited 2

A bibliometric system which really works

Journal of the American Society for Information Science
Stochastic models for the distribution of index terms

Journal of Documentation
Anatomy of the generalized inverse Gaussian-Poisson distribution with special applications to bibliometric studies

Information Processing and Management: an International Journal - Special issue on Informetrics
An investigation of documents from the World Wide Web

Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Graph structure in the Web

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
The Web's hidden order

Communications of the ACM
The Laws of the Web: Patterns in the Ecology of Information

The Laws of the Web: Patterns in the Ecology of Information
Mining the Web: Discovering Knowledge from HyperText Data

Mining the Web: Discovering Knowledge from HyperText Data

Usefulness of local buffer data for WWW objects prefetching

International Journal of Intelligent Information and Database Systems
Local buffer as source of web mining data

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

The authors investigate the frequency distribution of the use of image tags in Web pages. Using data sampled from top level Web pages across five top level domains and from sample pages within individual websites, the authors model observed patterns in the frequency of image tag usage by fitting collected data distributions to different theoretical models used in informetrics. Models tested include the modified power law (MPL), Mandelbrot (MDB), generalized waring (GW), generalized inverse Gaussian-Poisson (GIGP), and generalized negative binomial (GNB) distributions. The GIGP provided the best fit for data sets for top level pages across the top level domains tested. The poor fits of the models to the observed data distributions from specific websites were due to the multimodal nature of the observed data sets. Mixtures of the tested models for the data sets provided better fits. The ability to effectively model Web page attributes, such as the distribution of the number of image tags used per page, is needed for accurate simulation models of Web page content, and makes it possible to estimate the number of requests needed to display the complete content of Web pages.