Development of a multilingual text mining approach for knowledge discovery in patents

  • Authors:
  • Chung-Hong Lee;Hsin-Chang Yang;Yi-Ju Li

  • Affiliations:
  • Department of Electrical Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan;Department of Information Management, National University of Kaohsiung, Kaohsiung, Taiwan;Department of Electrical Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan

  • Venue:
  • SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe our work on developing a novel technique for discovery of implicit knowledge about patents from multilingual patent information sources. In this work we developed a system platform to support locating similar and relevant multilingual patent documents. The platform was implemented using a multilingual vector space based on the latent semantic indexing (LSI) model, and utilizing collected professional Chinese-English parallel corpora for training the system model. These multilingual patent documents could then be mapped into the semantic vector space for evaluating their similarity by means of text clustering techniques. The preliminary results show that our platform framework has potential for retrieval and relatedness evaluation of multilingual patent documents.