Making large-scale support vector machine learning practical
Advances in kernel methods
Reducing the run-time complexity in support vector machines
Advances in kernel methods
Automatic categorization of web sites based on source types
Proceedings of the fifteenth ACM conference on Hypertext and hypermedia
User Centred Quality Health Information Provision: Benefits and Challenges
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 6 - Volume 06
Evaluating the Quality of Health Web Sites: Developing a Validation Method and Rating Instrument
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 6 - Volume 06
An analysis of the relative hardness of Reuters-21578 subsets: Research Articles
Journal of the American Society for Information Science and Technology
Projecting Computational Sense of Self: A Study of Transition in a Chronic Illness Online Community
HICSS '06 Proceedings of the 39th Annual Hawaii International Conference on System Sciences - Volume 05
Using bag-of-concepts to improve the performance of support vector machines in text categorization
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
CBMS '07 Proceedings of the Twentieth IEEE International Symposium on Computer-Based Medical Systems
Automatic Web Page Categorization using Principal Component Analysis
HICSS '07 Proceedings of the 40th Annual Hawaii International Conference on System Sciences
Automatically generated consumer health metadata using semantic spaces
HDKM '08 Proceedings of the second Australasian workshop on Health data and knowledge management - Volume 80
Methodological Review: Empirical distributional semantics: Methods and biomedical applications
Journal of Biomedical Informatics
Guest Editorial: Current issues in biomedical text mining and natural language processing
Journal of Biomedical Informatics
A decision-tree-based symbolic rule induction system for text categorization
IBM Systems Journal
Journal of Biomedical Informatics
Naive bayes for text classification with unbalanced classes
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Categorization of computing education resources with utilization of crowdsourcing
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Hi-index | 0.00 |
To deal with the quantity and quality issues with online healthcare resources, creating web portals centred on particular health topics and/or communities of users is a strategy to provide access to a reduced corpus of information resources that meet quality and relevance criteria. In this paper we use hyperspace analogue to language (HAL) to model the language use patterns of webpages as Semantic Spaces. We have applied machine learning methods, including support vector machine (SVM), decision forest, and a novel summed similarity measure (SSM) to automatically classify online webpages on their Semantic Space models. We find classification accuracy on metadata attributes to be over 93% for 'medical' versus 'supportive' perspective, over 92% for disease stage of 'early' versus 'advanced', and over 90% for author credentials of 'lay' versus 'clinician' based on webpages of the Breast Cancer Knowledge Online portal. These results indicate that language use patterns can be used to automate such classification with useful levels of accuracy.