Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Large-scale text categorization by batch mode active learning
Proceedings of the 15th international conference on World Wide Web
Very sparse random projections
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Very sparse stable random projections for dimension reduction in lα (0
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Topic and keyword re-ranking for LDA-based topic modeling
Proceedings of the 18th ACM conference on Information and knowledge management
An extensive study on automated Dewey Decimal Classification
Journal of the American Society for Information Science and Technology
A negative category based approach for Wikipedia document classification
International Journal of Knowledge Engineering and Data Mining
ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis
ACM Transactions on Intelligent Systems and Technology (TIST)
Intelligent search on the internet
Reasoning, Action and Interaction in AI Theories and Systems
An ontology-based mechanism for automatic categorization of web services
Concurrency and Computation: Practice & Experience
Hierarchical classification of web documents by stratified discriminant analysis
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Soft cardinality + ML: learning adaptive similarity functions for cross-lingual textual entailment
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Assessing the quality of textual features in social media
Information Processing and Management: an International Journal
Parallel rare term vector replacement: Fast and effective dimensionality reduction for text
Journal of Parallel and Distributed Computing
International Journal of Multimedia Data Engineering & Management
Hi-index | 0.00 |
Term weighting scheme, which has been used to convert the documents as vectors in the term space, is a vital step in automatic text categorization. In this paper, we conducted comprehensive experiments to compare various term weighting schemes with SVM on two widely-used benchmark data sets. We also presented a new term weighting scheme tf-rf to improve the term's discriminating power. The controlled experimental results showed that this newly proposed tf-rf scheme is significantly better than other widely-used term weighting schemes. Compared with schemes related with tf factor alone, the idf factor does not improve or even decrease the term's discriminating power for text categorization.