Geography of social ontologies: Testing a variant of the Sapir-Whorf Hypothesis in the context of Wikipedia

  • Authors:
  • Alexander Mehler;Olga Pustylnikov;Nils Diewald

  • Affiliations:
  • Text Technology, Faculty of Technology, Bielefeld University, Universitätsstraíe 25, D-33615 Bielefeld, Germany;Text Technology, Faculty of Technology, Bielefeld University, Universitätsstraíe 25, D-33615 Bielefeld, Germany;Text Technology, Faculty of Technology, Bielefeld University, Universitätsstraíe 25, D-33615 Bielefeld, Germany

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article, we test a variant of the Sapir-Whorf Hypothesis in the area of complex network theory. This is done by analyzing social ontologies as a new resource for automatic language classification. Our method is to solely explore structural features of social ontologies in order to predict family resemblances of languages used by the corresponding communities to build these ontologies. This approach is based on a reformulation of the Sapir-Whorf Hypothesis in terms of distributed cognition. Starting from a corpus of 160 Wikipedia-based social ontologies, we test our variant of the Sapir-Whorf Hypothesis by several experiments, and find out that we outperform the corresponding baselines. All in all, the article develops an approach to classify linguistic networks of tens of thousands of vertices by exploring a small range of mathematically well-established topological indices.