C4.5: programs for machine learning
C4.5: programs for machine learning
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Journal of Biomedical Informatics
Journal of Biomedical Informatics - Special issue: Building nursing knowledge through infomatics: from concept representation to data mining
IEEE Transactions on Pattern Analysis and Machine Intelligence
Customization in a unified framework for summarizing medical literature
Artificial Intelligence in Medicine
Summarization from medical documents: a survey
Artificial Intelligence in Medicine
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Estimating continuous distributions in Bayesian classifiers
UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Mining web data for epidemiological surveillance
PAKDD'12 Proceedings of the 2012 Pacific-Asia conference on Emerging Trends in Knowledge Discovery and Data Mining
Hi-index | 0.00 |
Much epidemiologic information resides in literature, which is not in a computable format. To extract information and build knowledge bases of epidemiologic studies, we developed a system to extract noun phrases about epidemiologic exposures and outcomes. The system consists of two components: a natural language processing (NLP) engine a machine learning (ML) based classifier. Four ML algorithms were applied and compared over different feature sets. To evaluate the performance of the system, we manually constructed an annotated dataset. The system achieved the highest F-measure of 82.0% for extracting exposure terms, and 70% for extracting outcome terms.