Information flow analysis with chinese text

  • Authors:
  • Paulo Cheong;Dawei Song;Peter Bruza;Kam-Fai Wong

  • Affiliations:
  • CRC for Enterprise Distributed Systems Technology (DSTC), The University of Queensland, Brisbane, QLD;CRC for Enterprise Distributed Systems Technology (DSTC), The University of Queensland, Brisbane, QLD;CRC for Enterprise Distributed Systems Technology (DSTC), The University of Queensland, Brisbane, QLD;Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T, Hong Kong

  • Venue:
  • IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article investigates the effectiveness of an information inference mechanism on Chinese text. The information inference derives implicit associations via computation of information flow on a high dimensional conceptual space, which is approximated by a cognitively motivated lexical semantic space model, namely Hyperspace Analogue to Language (HAL). A dictionary-based Chinese word segmentation system was used to segment words. To evaluate the Chinese-based information flow model, it is applied to query expansion, in which a set of test queries are expanded automatically via information flow computations and documents are retrieved. Standard recall-precision measures are used to measure performance. Experimental results for TREC-5 Chinese queries and People Daily’s corpus suggest that the Chinese information flow model significantly increases average precision, though the increase is not as high as those achieved using English corpus. Nevertheless, there is justification to believe that the HAL-based information flow model, and in turn our psychologistic stance on the next generation of information processing systems, have a promising degree of language independence.