Capturing the structures in association knowledge: application of network analyses to large-scale databases of Japanese word associations

  • Authors:
  • Terry Joyce;Maki Miyake

  • Affiliations:
  • School of Global Studies, Tama University, Fujisawa, Kanagawa, Japan;Graduate School of Language and Culture, Osaka University, Toyonaka-shi, Osaka, Japan

  • Venue:
  • LKR'08 Proceedings of the 3rd international conference on Large-scale knowledge resources: construction and application
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Within the general enterprise of probing into the complexities of lexical knowledge, one particularly promising research focus is on word association knowledge. Given Deese's [1] and Cramer's [2] convictions that word association closely mirror the structured patterns of relations that exist among concepts, as largely echoed Hirst's [3] more recent comments about the close relationships between lexicons and ontologies, as well as Firth's [4] remarks about finding a word's meaning in the company it keeps, efforts to capture and unravel the rich networks of associations that connect words together are likely to yield interesting insights into the nature of lexical knowledge. Adopting such an approach, this paper applies a range of network analysis techniques in order to investigate the characteristics of network representations of word association knowledge in Japanese. Specifically, two separate association networks are constructed from two different large-scale databases of Japanese word associations: the Associative Concept Dictionary (ACD) by Okamoto and Ishizaki [5] and the Japanese Word Association Database (JWAD) by Joyce [6] [7] [8]. Results of basic statistical analyses of the association networks indicate that both are scale-free with smallworld properties and that both exhibit hierarchical organization. As effective methods of discerning associative structures with networks, some graph clustering algorithms are also applied. In addition to the basic Markov Clustering algorithm proposed by van Dongen [9], the present study also employs a recently proposed combination of the enhanced Recurrent Markov Cluster algorithm (RMCL) [10] with an index of modularity [11]. Clustering results show that the RMCL and modularity combination provides effective control over cluster sizes. The results also demonstrate the effectiveness of graph clustering approaches to capturing the structures within large-scale association knowledge resources, such as the two constructed networks of Japanese word associations.