A training algorithm for optimal margin classifiers
COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
A brief survey of web data extraction tools
ACM SIGMOD Record
Mining the peanut gallery: opinion extraction and semantic classification of product reviews
WWW '03 Proceedings of the 12th international conference on World Wide Web
A Fully Automated Object Extraction System for the World Wide Web
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
REES: a large-scale relation and event extraction system
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
An intelligent discussion-bot for answering student queries in threaded discussions
Proceedings of the 11th international conference on Intelligent user interfaces
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Medical applications in case-based reasoning
The Knowledge Engineering Review
Approaches to text mining for clinical medical records
Proceedings of the 2006 ACM symposium on Applied computing
Methods for using textual entailment in open-domain question answering
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Finding question-answer pairs from online forums
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations and Trends in Databases
Comparative experiments on sentiment classification for online product reviews
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Generating comparative summaries of contradictory opinions in text
Proceedings of the 18th ACM conference on Information and knowledge management
Uniqueness of medical data mining
Artificial Intelligence in Medicine
Predicting thread discourse structure over technical web forums
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
One size does not fit all: multi-granularity search of web forums
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Patient-centric, multi-role, and multi-dimension information exploration on online healthcare forums
Proceedings of the sixth workshop on Ph.D. students in information and knowledge management
Hi-index | 0.00 |
We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical forum documents, our goal is to extract two related types of sentences that describe a biomedical case (i.e., medical problem descriptions and medical treatment descriptions). Such an extraction task directly generates medical case descriptions that can be useful in many applications. We solve the problem using two popular machine learning methods Support Vector Machines (SVM) and Conditional Random Fields (CRF). We propose novel features to improve the accuracy of extraction. Experiment results show that we can obtain an accuracy of up to 75%.