A maximum entropy approach to natural language processing
Computational Linguistics
Classifying racist texts using a support vector machine
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Unifying collaborative and content-based filtering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
US Domestic Extremist Groups on the Web: Link and Content Analysis
IEEE Intelligent Systems
Guest Editors' Introduction: Social Computing
IEEE Intelligent Systems
Detecting cyber security threats in weblogs using probabilistic models
PAISI'07 Proceedings of the 2007 Pacific Asia conference on Intelligence and security informatics
A data-centric approach to feed search in blogs
International Journal of Web Engineering and Technology
Hi-index | 0.00 |
This paper introduces a new approach for topic-oriented information detection and scoring (TOIDS) based on a hybrid design: integrating characteristic word combination and self learning. Using the characteristic word combination approach, both related and unrelated words are involved to judge a webpage's relevance. To address the domain adaptation problem, our self learning technique utilizes historical information from characteristic word lexicon to facilitate detection. Empirical results indicate that the proposed approach outperforms benchmark systems, achieving higher precision. We also demonstrate that our approach can be easily adapted in different domains.