The nature of statistical learning theory
The nature of statistical learning theory
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Feature selection on hierarchy of web documents
Decision Support Systems - Web retrieval and mining
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Feature Selection for Unbalanced Class Distribution and Naive Bayes
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Scalable Classifiers with Dynamic Pruning
DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications
Neighbor-weighted K-nearest neighbor for unbalanced text corpus
Expert Systems with Applications: An International Journal
An improved kNN algorithm – fuzzy kNN
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Nearest neighbor pattern classification
IEEE Transactions on Information Theory
Using ambiguity measure feature selection algorithm for support vector machine classifier
Proceedings of the 2008 ACM symposium on Applied computing
Making CN2-SD subgroup discovery algorithm scalable to large size data sets using instance selection
Expert Systems with Applications: An International Journal
Feature selection with a measure of deviations from Poisson in text categorization
Expert Systems with Applications: An International Journal
Text feature selection using ant colony optimization
Expert Systems with Applications: An International Journal
Feature selection for text classification with Naïve Bayes
Expert Systems with Applications: An International Journal
PicAChoo: a tool for customizable feature extraction utilizing characteristics of textual data
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
One-against-one fuzzy support vector machine classifier: An approach to text categorization
Expert Systems with Applications: An International Journal
Accessing Positive and Negative Online Opinions
UAHCI '09 Proceedings of the 5th International Conference on Universal Access in Human-Computer Interaction. Part III: Applications and Services
Feature reduction techniques for Arabic text categorization
Journal of the American Society for Information Science and Technology
Distinctive characteristics of a metric using deviations from Poisson for feature selection
Expert Systems with Applications: An International Journal
A novel hybrid ACO-GA algorithm for text feature selection
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
A framework of feature selection methods for text categorization
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
An Improved Feature Selection for Categorization Based on Mutual Information
WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
Improving annotation categorization performance through integrated social annotation computation
Expert Systems with Applications: An International Journal
Comparison of metrics for feature selection in imbalanced text classification
Expert Systems with Applications: An International Journal
A new feature selection algorithm based on binomial hypothesis testing for spam filtering
Knowledge-Based Systems
Incorporating game theory in feature selection for text categorization
RSFDGrC'11 Proceedings of the 13th international conference on Rough sets, fuzzy sets, data mining and granular computing
Feature sub-set selection metrics for Arabic text classification
Pattern Recognition Letters
Proceedings of the Third Symposium on Information and Communication Technology
An adaption of relief for redundant feature elimination
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Persian text classification based on K-NN using wordnet
IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
A novel probabilistic feature selection method for text classification
Knowledge-Based Systems
The Effect of Stemming on Arabic Text Classification: An Empirical Study
International Journal of Information Retrieval Research
Class-indexing-based term weighting for automatic text classification
Information Sciences: an International Journal
Hi-index | 12.06 |
With the development of the web, large numbers of documents are available on the Internet. Digital libraries, news sources and inner data of companies surge more and more. Automatic text categorization becomes more and more important for dealing with massive data. However the major problem of text categorization is the high dimensionality of the feature space. At present there are many methods to deal with text feature selection. To improve the performance of text categorization, we present another method of dealing with text feature selection. Our study is based on Gini index theory and we design a novel Gini index algorithm to reduce the high dimensionality of the feature space. A new measure function of Gini index is constructed and made to fit text categorization. The results of experiments show that our improvements of Gini index behave better than other methods of feature selection.