Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
A probabilistic learning approach for document indexing
ACM Transactions on Information Systems (TOIS) - Special issue on research and development in information retrieval
Text representation for intelligent text retrieval: a classification-oriented view
Text-based intelligent systems
An example-based mapping method for text categorization and retrieval
ACM Transactions on Information Systems (TOIS)
Combining multiple evidence from different properties of weighting schemes
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
A comparison of classifiers and document representations for the routing problem
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Training algorithms for linear text classifiers
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
A vector space model for automatic indexing
Communications of the ACM
A statistical learning learning model of text classification for support vector machines
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Centroid-Based Document Classification: Analysis and Experimental Results
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Scaling multi-class support vector machines using inter-class confusion
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Supervised term weighting for automated text categorization
Proceedings of the 2003 ACM symposium on Applied computing
Effect of term distributions on centroid-based text categorization
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Informatics and computer science intelligent systems applications
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Information extraction from research papers using conditional random fields
Information Processing and Management: an International Journal
A novel feature selection algorithm for text categorization
Expert Systems with Applications: An International Journal
Top 10 algorithms in data mining
Knowledge and Information Systems
An improved centroid classifier for text categorization
Expert Systems with Applications: An International Journal
Imbalanced text classification: A term weighting approach
Expert Systems with Applications: An International Journal
Text classification from unlabeled documents with bootstrapping and feature projection techniques
Information Processing and Management: an International Journal
Feature selection for text classification with Naïve Bayes
Expert Systems with Applications: An International Journal
Supervised and Traditional Term Weighting Methods for Automatic Text Categorization
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic text categorization based on content analysis with cognitive situation models
Information Sciences: an International Journal
Document indexing: a concept-based approach to term weight estimation
Information Processing and Management: an International Journal
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Simultaneous feature selection and classification using kernel-penalized support vector machines
Information Sciences: an International Journal
Two-level hierarchical combination method for text classification
Expert Systems with Applications: An International Journal
Ensemble of feature sets and classification algorithms for sentiment classification
Information Sciences: an International Journal
Comparison of metrics for feature selection in imbalanced text classification
Expert Systems with Applications: An International Journal
Word co-occurrence features for text classification
Information Systems
A semantic term weighting scheme for text categorization
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Exploring dictionary-based semantic relatedness in labeled tree data
Information Sciences: an International Journal
A study of supervised term weighting scheme for sentiment analysis
Expert Systems with Applications: An International Journal
Hi-index | 0.07 |
Most of the previous studies related on different term weighting emphasize on the document-indexing-based and four fundamental information elements-based approaches to address automatic text classification (ATC). In this study, we introduce class-indexing-based term-weighting approaches and judge their effects in high-dimensional and comparatively low-dimensional vector space over the TF.IDF and five other different term weighting approaches that are considered as the baseline approaches. First, we implement a class-indexing-based TF.IDF.ICF observational term weighting approach in which the inverse class frequency (ICF) is incorporated. In the experiment, we investigate the effects of TF.IDF.ICF over the Reuters-21578, 20 Newsgroups, and RCV1-v2 datasets as benchmark collections, which provide positive discrimination on rare terms in the vector space and biased against frequent terms in the text classification (TC) task. Therefore, we revised the ICF function and implemented a new inverse class space density frequency (ICS"@dF), and generated the TF.IDF.ICS"@dF method that provides a positive discrimination on infrequent and frequent terms. We present detailed evaluation of each category for the three datasets with term weighting approaches. The experimental results show that the proposed class-indexing-based TF.IDF.ICS"@dF term weighting approach is promising over the compared well-known baseline term weighting approaches.