On the use of words and n-grams for Chinese information retrieval
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Named entity recognition using an HMM-based chunk tagger
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Shallow parsing with conditional random fields
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Chinese named entity recognition using lexicalized HMMs
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Chinese Word Segmentation and Named Entity Recognition: A Pragmatic Approach
Computational Linguistics
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Chinese Named Entity Recognition combining a statistical model with human knowledge
MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
Latent tree models and diagnosis in traditional Chinese medicine
Artificial Intelligence in Medicine
BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Reranking for biomedical named-entity recognition
BioNLP '07 Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
A self-learning expert system for diagnosis in traditional Chinese medicine
Expert Systems with Applications: An International Journal
Methodological Review: Text mining for traditional Chinese medical knowledge discovery: A survey
Journal of Biomedical Informatics
Journal of Biomedical Informatics
Journal of Biomedical Informatics
Hi-index | 0.00 |
A preliminary work on symptom name recognition from free-text clinical records (FCRs) of traditional Chinese medicine (TCM) is depicted in this paper. This problem is viewed as labeling each character in FCRs of TCM with a pre-defined tag ("B-SYC", "I-SYC" or "O-SYC") to indicate the character's role (a beginning, inside or outside part of a symptom name). The task is handled by Conditional Random Fields (CRFs) based on two types of features. The symptom name recognition F-Measure can reach up to 62.829% with recognition rate 93.403% and recognition error rate 52.665% under our experiment settings. The feasibility and effectiveness of the methods and reasonable features are verified, and several interesting and helpful results are shown. A detailed analysis for recognizing symptom names from FCRs of TCM is presented through analyzing labeling results of CRFs.