Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Training algorithms for linear text classifiers
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Learning routing queries in a query zone
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A novel feature selection algorithm for text categorization
Expert Systems with Applications: An International Journal
A fuzzy clustering approach for finding similar documents using a novel similarity measure
Expert Systems with Applications: An International Journal
Document Classification Based on Support Vector Machine Using a Concept Vector Model
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
A new approach on search for similar documents with multiple categories using fuzzy clustering
Expert Systems with Applications: An International Journal
Locally linear reconstruction for instance-based learning
Pattern Recognition
Feature selection for text classification with Naïve Bayes
Expert Systems with Applications: An International Journal
On the use of surrounding neighbors for synthetic over-sampling of the minority class
SMO'08 Proceedings of the 8th conference on Simulation, modelling and optimization
International Journal of Approximate Reasoning
Expert Systems with Applications: An International Journal
Automatic classification of Tamil documents using vector space model and artificial neural network
Expert Systems with Applications: An International Journal
Use of Ensemble Based on GA for Imbalance Problem
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Enhancing the Performance of Centroid Classifier by ECOC and Model Refinement
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Proceedings of the 18th ACM conference on Information and knowledge management
Chinese weblog pages classification based on folksonomy and support vector machines
AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Analytical evaluation of term weighting schemes for text categorization
Pattern Recognition Letters
Local class boundaries for support vector machine
LSMS/ICSEE'10 Proceedings of the 2010 international conference on Life system modeling and simulation and intelligent computing, and 2010 international conference on Intelligent computing for sustainable energy and environment: Part II
Exploiting probabilistic topic models to improve text categorization under class imbalance
Information Processing and Management: an International Journal
Exploring the performance of resampling strategies for the class imbalance problem
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
Surrounding influenced K-nearest neighbors: a new distance based classifier
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Adapting centroid classifier for document categorization
Expert Systems with Applications: An International Journal
An improved K-nearest-neighbor algorithm for text categorization
Expert Systems with Applications: An International Journal
Investigation of supervised dimensionality reduction methods for phonetic classification
Proceedings of the Third International Conference on Internet Multimedia Computing and Service
FSKNN: Multi-label text categorization based on fuzzy similarity and k nearest neighbors
Expert Systems with Applications: An International Journal
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
An adaptive fuzzy kNN text classifier
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Combined effects of class imbalance and class overlap on instance-based classification
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
A proposal of evolutionary prototype selection for class imbalance problems
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Text categorization methods for automatic estimation of verbal intelligence
Expert Systems with Applications: An International Journal
A text classification algorithm based on rocchio and hierarchical clustering
ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing
K Nearest Neighbor Equality: Giving equal chance to all existing classes
Information Sciences: an International Journal
Hybrid random forests: advantages of mixed trees in classifying text data
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
GA based optimal keyword extraction in an automatic chinese web document classification system
ISPA'07 Proceedings of the 2007 international conference on Frontiers of High Performance Computing and Networking
The decomposed k-nearest neighbor algorithm for imbalanced text classification
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
A document is known by the company it keeps: neighborhood consensus for short text categorization
Language Resources and Evaluation
Pattern Recognition Letters
Computer-aided diagnosis system: A Bayesian hybrid classification method
Computer Methods and Programs in Biomedicine
Class imbalance and the curse of minority hubs
Knowledge-Based Systems
Expert Systems with Applications: An International Journal
Hi-index | 12.07 |
Text categorization or classification is the automated assigning of text documents to pre-defined classes based on their contents. Many of classification algorithms usually assume that the training examples are evenly distributed among different classes. However, unbalanced data sets often appear in many practical applications. In order to deal with uneven text sets, we propose the neighbor-weighted K-nearest neighbor algorithm, i.e. NWKNN. The experimental results indicate that our algorithm NWKNN achieves significant classification performance improvement on imbalanced corpora.