Conceptualizing documentation on the web: an evaluation of different heuristic-based models for counting links between university web sites

Authors:
Mike Thelwall
Affiliations:
School of Computing and Information Technology, University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1SB, UK
Venue:
Journal of the American Society for Information Science and Technology
Year:
2002

Citing 11
Cited 22

Invoked on the Web

Journal of the American Society for Information Science
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Readers, authors, and page structure: a discussion of four questions arising from a content analysis of Web pages

Journal of the American Society for Information Science
Graph structure in the Web

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Web-based analyses of e-journal impact: approaches, problems, and issues

Journal of the American Society for Information Science
Not just a matter of time: field differences and the shaping of electronic media in supporting scientific communication

Journal of the American Society for Information Science
Small-world linkage and co-linkage

Proceedings of the 12th ACM conference on Hypertext and Hypermedia
Information technology and interests in scholarly communication: a discourse analysis

Journal of the American Society for Information Science and Technology
Extracting macroscopic information from Web links

Journal of the American Society for Information Science and Technology
Towards Automatic Web Genre Identification

HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 4 - Volume 4

Scholarly use of the web: what are the key inducers of links to journal web sites?

Journal of the American Society for Information Science and Technology
Three target document range metrics for university web sites

Journal of the American Society for Information Science and Technology
The connection between the research of a university and counts of links to its web pages: an investigation based upon a classification of the relationships of pages to the research of the host University

Journal of the American Society for Information Science and Technology
Exploiting hyperlinks to study academic Web use

Social Science Computer Review
Do the Web sites of higher rated scholars have significantly more online impact?

Journal of the American Society for Information Science and Technology
Methods for reporting on the targets of links from national systems of university web sites

Information Processing and Management: an International Journal
Finding similar academic web sites with links, bibliometric couplings and colinks

Information Processing and Management: an International Journal
A modeling approach to uncover hyperlink patterns: the case of Canadian universities

Information Processing and Management: an International Journal
Text characteristics of English language university Web sites: Research Articles

Journal of the American Society for Information Science and Technology
Toward a basic framework for webometrics

Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Scientific web intelligence: finding relationships in university webs

Communications of the ACM - Designing for the mobile device
Mathematical models for academic webs: linear relationship or non-linear power law?

Information Processing and Management: an International Journal - Special issue: Infometrics
The web structure of e-government - developing a methodology for quantitative evaluation

Proceedings of the 15th international conference on World Wide Web
Automated web issue analysis: a nurse prescribing case study

Information Processing and Management: an International Journal - Special issue: Informetrics
Longitudinal trends in academic web links

Journal of Information Science
Extracting accurate and complete results from search engines: Case study windows live

Journal of the American Society for Information Science and Technology
Do open-access journals in library and information science have any scholarly impact? A bibliometric study of selected open-access journals using Google Scholar

Journal of the American Society for Information Science and Technology
Social Network Services as Data Sources and Platforms for e-Researching Social Networks

Social Science Computer Review
Mathematical models for academic webs: Linear relationship or non-linear power law?

Information Processing and Management: an International Journal - Special issue: Infometrics
Map of nonprofit organization websites in Israel

Journal of the American Society for Information Science and Technology
Web data as academic and business quality estimates: A comparison of three data sources

Journal of the American Society for Information Science and Technology
Web link-based relationships among top European universities

Journal of Information Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

All known previous Web link studies have used the Web page as the primary indivisible source document for counting purposes. Arguments are presented to explain why this is not necessarily optimal and why other alternatives have the potential to produce better results. This is despite the fact that individual Web files are often the only choice if search engines are used for raw data and are the easiest basic Web unit to identify. The central issue is of defining the Web "document": that which should comprise the single indissoluble unit of coherent material. Three alternative heuristics are defined for the educational arena based upon the directory, the domain and the whole university site. These are then compared by implementing them on a set of 108 UK university institutional Web sites under the assumption that a more effective heuristic will tend to produce results that correlate more highly with institutional research productivity. It was discovered that the domain and directory models were able to successfully reduce the impact of anomalous linking behavior between pairs of Web sites, with the latter being the method of choice. Reasons are then given as to why a document model on its own cannot eliminate all anomalies in Web linking behavior. Finally, the results from all models give a clear confirmation of the very strong association between the research productivity of a UK university and the number of incoming links from its peers' Web sites.