A framework and its empirical study of automatic diagnosis of traditional Chinese medicine utilizing raw free-text clinical records

  • Authors:
  • Yaqiang Wang;Zhonghua Yu;Yongguang Jiang;Yongchao Liu;Li Chen;Yiguang Liu

  • Affiliations:
  • Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Preclinical Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 610075, PR China;Medical College, Beihua University, Jilin, Jilin 132013, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic diagnosis is one of the most important parts in the expert system of traditional Chinese medicine (TCM), and in recent years, it has been studied widely. Most of the previous researches are based on well-structured datasets which are manually collected, structured and normalized by TCM experts. However, the obtained results of the former work could not be directly and effectively applied to clinical practice, because the raw free-text clinical records differ a lot from the well-structured datasets. They are unstructured and are denoted by TCM doctors without the support of authoritative editorial board in their routine diagnostic work. Therefore, in this paper, a novel framework of automatic diagnosis of TCM utilizing raw free-text clinical records for clinical practice is proposed and investigated for the first time. A series of appropriate methods are attempted to tackle several challenges in the framework, and the Naive Bayes classifier and the Support Vector Machine classifier are employed for TCM automatic diagnosis. The framework is analyzed carefully. Its feasibility is validated through evaluating the performance of each module of the framework and its effectiveness is demonstrated based on the precision, recall and F-Measure of automatic diagnosis results.