Similarity computing model of high dimension data for symptom classification of Chinese traditional medicine

  • Authors:
  • Jing Peng;Chang-jie Tang;Dong-qing Yang;Jing Zhang;Jian-jun Hu

  • Affiliations:
  • School of Computer Science and Engineering, Sichuan University, Chengdu 610065, China and School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China;School of Computer Science and Engineering, Sichuan University, Chengdu 610065, China;School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China;Chengdu Jiuheyuan Industry Company, Chengdu 610015, China;School of Computer Science and Engineering, Sichuan University, Chengdu 610065, China

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, researchers have paid more and more attention on data mining of practical applications. Aimed to the problem of symptom classification of Chinese traditional medicine, this paper proposes a novel computing model based on the similarities among attributes of high dimension data to compute the similarity between any tuples. This model assumes data attributes as basic vectors of m dimensions and each tuple as a sum vector of all the attribute-vectors. Based on the transcendental concept similarity information among attributes, it suggests a novel distance algorithm to compute the similarity distance of any pair of attribute-vectors. In this method, the computing of similarity between any tuples are turned to the formulas of attribute-vectors and their projections of each other, and the similarity between any pair of tuples can be worked out by computing these vectors and formulas. This paper also presents a novel classification algorithm based on the similarity computing model and successfully applies the algorithm into the symptom classification of Chinese traditional medicine. The efficiency of the algorithm is proved by extensive experiments.