In search of the ur-Wikipedia: universality, similarity, and translation in the Wikipedia inter-language link network

  • Authors:
  • Morten Warncke-Wang;Anuradha Uduwage;Zhenhua Dong;John Riedl

  • Affiliations:
  • University of Minnesota, Minneapolis, Minnesota;University of Minnesota, Minneapolis, Minnesota;Nankai University, Tianjin, China;University of Minnesota, Minneapolis, Minnesota

  • Venue:
  • Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Wikipedia has become one of the primary encyclopaedic information repositories on the World Wide Web. It started in 2001 with a single edition in the English language and has since expanded to more than 20 million articles in 283 languages. Criss-crossing between the Wikipedias is an inter-language link network, connecting the articles of one edition of Wikipedia to another. We describe characteristics of articles covered by nearly all Wikipedias and those covered by only a single language edition, we use the network to understand how we can judge the similarity between Wikipedias based on concept coverage, and we investigate the flow of translation between a selection of the larger Wikipedias. Our findings indicate that the relationships between Wikipedia editions follow Tobler's first law of geography: similarity decreases with increasing distance. The number of articles in a Wikipedia edition is found to be the strongest predictor of similarity, while language similarity also appears to have an influence. The English Wikipedia edition is by far the primary source of translations. We discuss the impact of these results for Wikipedia as well as user-generated content communities in general.