A framework and its empirical study of automatic diagnosis of traditional Chinese medicine utilizing raw free-text clinical records

Authors:
Yaqiang Wang;Zhonghua Yu;Yongguang Jiang;Yongchao Liu;Li Chen;Yiguang Liu
Affiliations:
Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Preclinical Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 610075, PR China;Medical College, Beihua University, Jilin, Jilin 132013, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China;Department of Computer Science, Sichuan University, Chengdu, Sichuan 610064, PR China
Venue:
Journal of Biomedical Informatics
Year:
2012

Citing 13
Cited 2

Foundations of statistical natural language processing

Foundations of statistical natural language processing
On the use of words and n-grams for Chinese information retrieval

IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression

Bioinformatics
Chinese Word Segmentation and Named Entity Recognition: A Pragmatic Approach

Computational Linguistics
Bidirectional inference with the easiest-first strategy for tagging sequence data

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Methodological review: Knowledge discovery in traditional Chinese medicine: State of the art and perspectives

Artificial Intelligence in Medicine
Computational methods for Traditional Chinese Medicine: A survey

Computer Methods and Programs in Biomedicine
Latent tree models and diagnosis in traditional Chinese medicine

Artificial Intelligence in Medicine
A self-learning expert system for diagnosis in traditional Chinese medicine

Expert Systems with Applications: An International Journal
Methodological Review: Text mining for traditional Chinese medical knowledge discovery: A survey

Journal of Biomedical Informatics
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Developing a robust part-of-speech tagger for biomedical text

PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics

A preliminary work on symptom name recognition from free-text clinical records of traditional chinese medicine using conditional random fields and reasonable features

BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Supervised methods for symptom name recognition in free-text clinical records of traditional Chinese medicine: An empirical study

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic diagnosis is one of the most important parts in the expert system of traditional Chinese medicine (TCM), and in recent years, it has been studied widely. Most of the previous researches are based on well-structured datasets which are manually collected, structured and normalized by TCM experts. However, the obtained results of the former work could not be directly and effectively applied to clinical practice, because the raw free-text clinical records differ a lot from the well-structured datasets. They are unstructured and are denoted by TCM doctors without the support of authoritative editorial board in their routine diagnostic work. Therefore, in this paper, a novel framework of automatic diagnosis of TCM utilizing raw free-text clinical records for clinical practice is proposed and investigated for the first time. A series of appropriate methods are attempted to tackle several challenges in the framework, and the Naive Bayes classifier and the Support Vector Machine classifier are employed for TCM automatic diagnosis. The framework is analyzed carefully. Its feasibility is validated through evaluating the performance of each module of the framework and its effectiveness is demonstrated based on the precision, recall and F-Measure of automatic diagnosis results.