Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Feature Selection for Unbalanced Class Distribution and Naive Bayes
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Choose Your Words Carefully: An Empirical Study of Feature Selection Metrics for Text Classification
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Feature engineering for a gene regulation prediction task
ACM SIGKDD Explorations Newsletter
An introduction to variable and feature selection
The Journal of Machine Learning Research
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Feature selection for text categorization on imbalanced data
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
A pitfall and solution in multi-class feature selection for text classification
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
An Empirical Study of Feature Selection for Text Categorization based on Term Weightage
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Classification and knowledge discovery in protein databases
Journal of Biomedical Informatics - Special issue: Biomedical machine learning
Efficient Feature Selection via Analysis of Relevance and Redundancy
The Journal of Machine Learning Research
An experimental study on large-scale web categorization
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Variable selection and ranking for analyzing automobile traffic accident data
Proceedings of the 2005 ACM symposium on Applied computing
Local sparsity control for naive Bayes with extreme misclassification costs
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Support vector machines classification with a very large-scale taxonomy
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Generalized LARS as an effective feature selection tool for text classification with SVMs
ICML '05 Proceedings of the 22nd international conference on Machine learning
Bias Analysis in Text Classification for Highly Skewed Data
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Angular measures for feature selection in text categorization
Proceedings of the 2006 ACM symposium on Applied computing
Exploiting partial decision trees for feature subset selection in e-mail categorization
Proceedings of the 2006 ACM symposium on Applied computing
Feature subset selection bias for classification learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Blocking objectionable web content by leveraging multiple information sources
ACM SIGKDD Explorations Newsletter
Tackling concept drift by temporal inductive transfer
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Quantifying trends accurately despite classifier error and class imbalance
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Acclimatizing Taxonomic Semantics for Hierarchical Content Classification
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Single-pass online learning: performance, voting schemes and online feature selection
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Pragmatic text mining: minimizing human effort to quantify many issues in call logs
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A New Text Categorization Technique Using Distributional Clustering and Learning Logic
IEEE Transactions on Knowledge and Data Engineering
Higher order feature selection for text classification
Knowledge and Information Systems
NEWPAR: an automatic feature selection and weighting schema for category ranking
Proceedings of the 2006 ACM symposium on Document engineering
Combining feature selectors for text classification
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Multi-class feature selection for texture classification
Pattern Recognition Letters
A quantitative analysis of lexical differences between genders in telephone conversations
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A study on automatically extracted keywords in text categorization
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A semi-supervised feature clustering algorithm with application to word sense disambiguation
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Comparison of feature selection and classification algorithms in identifying malicious executables
Computational Statistics & Data Analysis
Process-Specific Information for Learning Electronic Negotiation Outcomes
Fundamenta Informaticae
Learning rules with negation for text categorization
Proceedings of the 2007 ACM symposium on Applied computing
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
On the strength of hyperclique patterns for text categorization
Information Sciences: an International Journal
Feature selection methods for text classification
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Information-theoretic semantic multimedia indexing
Proceedings of the 6th ACM international conference on Image and video retrieval
Language morphology offset: Text classification on a Croatian-English parallel corpus
Information Processing and Management: an International Journal
Topic taxonomy adaptation for group profiling
ACM Transactions on Knowledge Discovery from Data (TKDD)
Understanding temporal aspects in document classification
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Learning video preferences from video content
Proceedings of the 8th international workshop on Multimedia data mining: (associated with the ACM SIGKDD 2007)
ACM Transactions on Information Systems (TOIS)
Pairwise vs global multi-class wrapper feature selection
AIKED'07 Proceedings of the 6th Conference on 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and Data Bases - Volume 6
How quickly should communication robots respond?
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums
ACM Transactions on Information Systems (TOIS)
Exploring the characteristics of opinion expressions for political opinion classification
dg.o '08 Proceedings of the 2008 international conference on Digital government research
Text classification: a recent overview
ICCOMP'05 Proceedings of the 9th WSEAS International Conference on Computers
MATH'07 Proceedings of the 12th WSEAS International Conference on Applied Mathematics
Anomaly-based fault detection in pervasive computing system
Proceedings of the 5th international conference on Pervasive services
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Scaling up text classification for large file systems
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Stable feature selection via dense feature groups
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Customer targeting models using actively-selected web content
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Heterogeneous data fusion for alzheimer's disease study
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Quantifying counts and costs via classification
Data Mining and Knowledge Discovery
Feature selection strategies for poorly correlated data: correlation coefficient considered harmful
AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
CWC: A Clustering-Based Feature Weighting Approach for Text Classification
MDAI '07 Proceedings of the 4th international conference on Modeling Decisions for Artificial Intelligence
Text Categorization in Non-linear Semantic Space
AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Using Intuitionistic Fuzzy Sets in Text Categorization
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Automated Classification and Categorization of Mathematical Knowledge
Proceedings of the 9th AISC international conference, the 15th Calculemas symposium, and the 7th international MKM conference on Intelligent Computer Mathematics
A Genetic Algorithm for Text Classification Rule Induction
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Client-Friendly Classification over Random Hyperplane Hashes
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A Survey on Statistical Pattern Feature Extraction
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Imbalanced text classification: A term weighting approach
Expert Systems with Applications: An International Journal
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
Exploiting temporal contexts in text classification
Proceedings of the 17th ACM conference on Information and knowledge management
BNS feature scaling: an improved representation over tf-idf for svm text classification
Proceedings of the 17th ACM conference on Information and knowledge management
Predicting web spam with HTTP session information
Proceedings of the 17th ACM conference on Information and knowledge management
Extremely fast text feature extraction for classification and indexing
Proceedings of the 17th ACM conference on Information and knowledge management
Iterative feature construction for improving inductive learning algorithms
Expert Systems with Applications: An International Journal
Exploring the boundary region of tolerance rough sets for feature selection
Pattern Recognition
Text feature selection using ant colony optimization
Expert Systems with Applications: An International Journal
Feature selection for text classification with Naïve Bayes
Expert Systems with Applications: An International Journal
Evaluation of a pervasive game for domestic energy engagement among teenagers
ACE '08 Proceedings of the 2008 International Conference on Advances in Computer Entertainment Technology
Class dependent feature scaling method using naive Bayes classifier for text datamining
Pattern Recognition Letters
Kinesthetic interaction: revealing the bodily potential in interaction design
Proceedings of the 20th Australasian Conference on Computer-Human Interaction: Designing for Habitus and Habitat
Feature selection with dynamic mutual information
Pattern Recognition
Seller's credibility in electronic markets: a complex network based approach
Proceedings of the 3rd workshop on Information credibility on the web
A survey of modern authorship attribution methods
Journal of the American Society for Information Science and Technology
Service Selection in Business Service Ecosystem
Service-Oriented Computing --- ICSOC 2008 Workshops
Using pre & post-processing methods to improve binding site predictions
Pattern Recognition
Feature shaping for linear SVM classifiers
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Consensus group stable feature selection
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-domain sentiment classification
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
An Iterative Hybrid Filter-Wrapper Approach to Feature Selection for Document Clustering
Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Improving "email speech acts" analysis via n-gram selection
ACTS '09 Proceedings of the HLT-NAACL 2006 Workshop on Analyzing Conversations in Text and Speech
Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006
Learning to recognize webpage genres
Information Processing and Management: an International Journal
A Wrapper Method for Feature Selection in Multiple Classes Datasets
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
A General Framework of Feature Selection for Text Categorization
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Feature subsumption for opinion analysis
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Preferential text classification: learning algorithms and evaluation measures
Information Retrieval
Latent Dirichlet Allocation for Automatic Document Categorization
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Parameter-Free Hierarchical Co-clustering by n-Ary Splits
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Disambiguation of preposition sense using linguistically motivated features
SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
PNNL: a supervised maximum entropy approach to word sense disambiguation
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Domain adaptation for statistical classifiers
Journal of Artificial Intelligence Research
Avoidance of model re-induction in SVM-based feature selection for text categorization
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Feature selection techniques for maximum entropy based biomedical named entity recognition
Journal of Biomedical Informatics
Rank Aggregation Based Text Feature Selection
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Stock Price Forecasting by Combining News Mining and Time Series Analysis
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Kernel Methods in Computer Vision
Foundations and Trends® in Computer Graphics and Vision
Efficient feature weighting methods for ranking
Proceedings of the 18th ACM conference on Information and knowledge management
Boosting KNN text classification accuracy by using supervised term weighting schemes
Proceedings of the 18th ACM conference on Information and knowledge management
A target-oriented phonotactic front-end for spoken language recognition
IEEE Transactions on Audio, Speech, and Language Processing
Topic-dependent sentiment analysis of financial blogs
Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion
An extensive study on automated Dewey Decimal Classification
Journal of the American Society for Information Science and Technology
Evaluation of a pervasive game for domestic energy engagement among teenagers
Computers in Entertainment (CIE) - SPECIAL ISSUE: Games
Incorporating user behaviors in new word detection
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Using some web content mining techniques for Arabic text classification
DNCOCO'09 Proceedings of the 8th WSEAS international conference on Data networks, communications, computers
Identifying fall-related injuries: Text mining the electronic medical record
Information Technology and Management
Discovering the discriminative views: measuring term weights for sentiment analysis
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A novel hybrid ACO-GA algorithm for text feature selection
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
A framework of feature selection methods for text categorization
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Efficient Text Classification Using Term Projection
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Classifying Documents According to Locational Relevance
EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Classifying relations for biomedical named entity disambiguation
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Selective enhancement learning in competitive learning
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Automatically classifying documents by ideological and organizational affiliation
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Handling class imbalance problem in cultural modeling
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Improved variable and value ranking techniques for mining categorical traffic accident data
Expert Systems with Applications: An International Journal
Ensemble gene selection by grouping for microarray data classification
Journal of Biomedical Informatics
Simulated evaluation of faceted browsing based on feature selection
Multimedia Tools and Applications
Leveraging web streams for contractual situational awareness in operational BI
Proceedings of the 2010 EDBT/ICDT Workshops
Feature selection & dominant feature selection for product reviews using meta-heuristic algorithms
Proceedings of the Third Annual ACM Bangalore Conference
Classification of skewed and homogenous document corpora with class-based and corpus-based keywords
KI'06 Proceedings of the 29th annual German conference on Artificial intelligence
A simple probability based term weighting scheme for automated text classification
IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
Does SVM really scale up to large bag of words feature spaces?
IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Feature selection for ordinal regression
Proceedings of the 2010 ACM Symposium on Applied Computing
Capturing heuristics and intelligent methods for improving micro-array data classification
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Conditional mutual information based feature selection for classification task
CIARP'07 Proceedings of the Congress on pattern recognition 12th Iberoamerican conference on Progress in pattern recognition, image analysis and applications
Using typical testors for feature selection in text categorization
CIARP'07 Proceedings of the Congress on pattern recognition 12th Iberoamerican conference on Progress in pattern recognition, image analysis and applications
A novel metric for redundant gene elimination based on discriminative contribution
ISBRA'08 Proceedings of the 4th international conference on Bioinformatics research and applications
EvoBIO'08 Proceedings of the 6th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
A wrapper-based feature selection method for ADMET prediction using evolutionary computing
EvoBIO'08 Proceedings of the 6th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
Automatic extraction of domain-specific stopwords from labeled documents
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Multi-labeled Chinese text categorization based on the boosting algorithms
ICNC'09 Proceedings of the 5th international conference on Natural computation
A comparison of data preparation approaches for e-mail categorisation
International Journal of Intelligent Information and Database Systems
Analytical evaluation of term weighting schemes for text categorization
Pattern Recognition Letters
Guest Editorial: Global modeling using local patterns
Data Mining and Knowledge Discovery
Text classification with the support of pruned dependency patterns
Pattern Recognition Letters
Expert Systems with Applications: An International Journal
An information-theoretic framework for semantic-multimedia retrieval
ACM Transactions on Information Systems (TOIS)
Formal and functional assessment of the pyramid method for summary content evaluation*
Natural Language Engineering
Quadratic Programming Feature Selection
The Journal of Machine Learning Research
From frequency to meaning: vector space models of semantics
Journal of Artificial Intelligence Research
Cuisine: Classification using stylistic feature sets and-or name-based feature sets
Journal of the American Society for Information Science and Technology
Evidentiality for text trustworthiness detection
NLPLING '10 Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground
Improving gender classification of blog authors
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Selecting keywords for content based recommendation
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Hierarchical auto-tagging: organizing Q&A knowledge for everyone
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
A multi-functional architecture addressing workflow and service challenges using provenance data
PIKM '10 Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management
A novel image retrieval model based on the most relevant features
Knowledge-Based Systems
Sentiment classification and polarity shifting
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Expert Systems with Applications: An International Journal
Launching: university partnership for health informatics
Proceedings of the 1st ACM International Health Informatics Symposium
Mining hot clusters of similar anomalies for system management
PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
IEEE Transactions on Information Technology in Biomedicine - Special section on affective and pervasive computing for healthcare
Application classification through monitoring and learning of resource consumption patterns
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Wireless network deployment configurations: Dwesa marginalized area as a case study
SAICSIT '10 Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists
A class-specific ensemble feature selection approach for classification problems
Proceedings of the 48th Annual Southeast Regional Conference
Expert Systems with Applications: An International Journal
Developing an information security program for HIPAA compliance
2009 Information Security Curriculum Development Conference
Toward predicting popularity of social marketing messages
SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
Word co-occurrence features for text classification
Information Systems
DEM registration using watershed algorithm and chain coding
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Entropy based feature selection for text categorization
Proceedings of the 2011 ACM Symposium on Applied Computing
A quantitative diagnostic method based on bayesian networks in traditional chinese medicine
ICONIP'06 Proceedings of the 13th international conference on Neural information processing - Volume Part III
A new feature selection algorithm based on binomial hypothesis testing for spam filtering
Knowledge-Based Systems
Multi-domain sentiment classification with classifier combination
Journal of Computer Science and Technology - Special issue on natural language processing
Combination of feature selection methods for text categorisation
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
World vs. method: educational standard formulation impacts document retrieval
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Semi-supervised SimHash for efficient document similarity search
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Developing Position Structure-Based Framework for Chinese Entity Relation Extraction
ACM Transactions on Asian Language Information Processing (TALIP)
Automatically tagging email by leveraging other users' folders
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Feature selection strategies for automated classification of digital media content
Journal of Information Science
ACM SIGSOFT Software Engineering Notes
Unsupervised joint feature discretization and selection
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
User Behaviors in Related Word Retrieval and New Word Detection: A Collaborative Perspective
ACM Transactions on Asian Language Information Processing (TALIP)
Incorporating game theory in feature selection for text categorization
RSFDGrC'11 Proceedings of the 13th international conference on Rough sets, fuzzy sets, data mining and granular computing
Adaptive machine learning approach for emotional email classification
HCII'11 Proceedings of the 14th international conference on Human-computer interaction: towards mobile and intelligent interaction environments - Volume Part III
Evaluation of feature combination approaches for text categorisation
ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Improvements over adaptive local hyperplane to achieve better classification
ICDM'11 Proceedings of the 11th international conference on Advances in data mining: applications and theoretical aspects
Spam detection on twitter using traditional classifiers
ATC'11 Proceedings of the 8th international conference on Autonomic and trusted computing
Group Profiling for Understanding Social Structures
ACM Transactions on Intelligent Systems and Technology (TIST)
Feature sub-set selection metrics for Arabic text classification
Pattern Recognition Letters
Multiple instance learning for classification of human behavior observations
ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
A pairwise ranking based approach to learning with positive and unlabeled examples
Proceedings of the 20th ACM international conference on Information and knowledge management
Online evaluation of email streaming classifiers using GNUsmail
IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Oscillating feature subset search algorithm for text categorization
CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
FISA: feature-based instance selection for imbalanced text classification
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Feature selection, rule extraction, and score model: making ATC competitive with SVM
RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
On the utility of incremental feature selection for the classification of textual data streams
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Weighted average pointwise mutual information for feature selection in text categorization
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
The performance analysis of ARM NEON technology for mobile platforms
Proceedings of the 2011 ACM Symposium on Research in Applied Computation
Text categorization with class-based and corpus-based keyword selection
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
An examination of feature selection frameworks in text categorization
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Developing robust models for favourability analysis
WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Expert Systems with Applications: An International Journal
Feature selection for image categorization
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Predicting high-risk program modules by selecting the right software measurements
Software Quality Control
Counting positives accurately despite inaccurate classification
ECML'05 Proceedings of the 16th European conference on Machine Learning
Techniques for improving the performance of naive bayes for text classification
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Expert Systems with Applications: An International Journal
Exploiting parse structures for native language identification
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A comparative study of language models for book and author recognition
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A comparison of text-categorization methods applied to n-gram frequency statistics
AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Data mining techniques for the screening of age-related macular degeneration
Knowledge-Based Systems
A platform for situational awareness in operational BI
Decision Support Systems
A survey on feature extraction for pattern recognition
Artificial Intelligence Review
Sentence-Level attachment prediction
IRFC'10 Proceedings of the First international Information Retrieval Facility conference on Adbances in Multidisciplinary Retrieval
An empirical study on the feature's type effect on the automatic classification of arabic documents
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Evolutionary search of optimal features
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Lexical entailment for information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Feature selection for dimensionality reduction
SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection
Feature selection in text categorization based on cloud model
Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Feature selection for MAUC-oriented classification systems
Neurocomputing
Malware characteristics and threats on the internet ecosystem
Journal of Systems and Software
Analyzing Online Review Helpfulness Using a Regressional ReliefF-Enhanced Text Mining Method
ACM Transactions on Management Information Systems (TMIS)
An unsupervised approach to feature discretization and selection
Pattern Recognition
Representation models for text classification: a comparative analysis over three web document types
Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
A new search engine integrating hierarchical browsing and keyword search
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
A two-stage feature selection method for text categorization
Computers & Mathematics with Applications
Feature selection for optimizing traffic classification
Computer Communications
Computer Methods and Programs in Biomedicine
A novel feature selection method based on normalized mutual information
Applied Intelligence
The Journal of Supercomputing
Fog computing and its role in the internet of things
Proceedings of the first edition of the MCC workshop on Mobile cloud computing
Efficient feature selection filters for high-dimensional data
Pattern Recognition Letters
CybercrimeIR --- a technological perspective to fight cybercrime
PAISI'12 Proceedings of the 2012 Pacific Asia conference on Intelligence and Security Informatics
Text categorization based on fuzzy soft set theory
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
Integrating end-users to the design process through design competitions
DPPI '11 Proceedings of the 2011 Conference on Designing Pleasurable Products and Interfaces
A global-ranking local feature selection method for text categorization
Expert Systems with Applications: An International Journal
Classifying Vietnamese disease outbreak reports with important sentences and rich features
Proceedings of the Third Symposium on Information and Communication Technology
Features' weight learning towards improved query classification
AIS'12 Proceedings of the Third international conference on Autonomous and Intelligent Systems
Measuring stability of feature ranking techniques: a noise-based approach
International Journal of Business Intelligence and Data Mining
Persian text classification based on K-NN using wordnet
IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Process-Specific Information for Learning Electronic Negotiation Outcomes
Fundamenta Informaticae
Identifying the semantic orientation of terms using S-HAL for sentiment analysis
Knowledge-Based Systems
An optimal approach of load balancing for grid computing
Proceedings of the CUBE International Information Technology Conference
Document-level sentiment classification: An empirical comparison between SVM and ANN
Expert Systems with Applications: An International Journal
Lip peripheral motion for visual surveillance
Proceedings of the Fifth International Conference on Security of Information and Networks
A variance reduction framework for stable feature selection
Statistical Analysis and Data Mining
langid.py: an off-the-shelf language identification tool
ACL '12 Proceedings of the ACL 2012 System Demonstrations
A novel probabilistic feature selection method for text classification
Knowledge-Based Systems
Feature selection based on term frequency and T-test for text categorization
Proceedings of the 21st ACM international conference on Information and knowledge management
A new histogram-based breast cancer image classifier using Gaussian mixture model
Proceedings of the 2012 ACM Research in Applied Computation Symposium
Designing playful interactive installations for urban environments --- the swingscape experience
ACE'12 Proceedings of the 9th international conference on Advances in Computer Entertainment
Discriminative feature analysis and selection for document classification
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
A pragmatic approach for sustainable development based on semantic web services
Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
Parameter-less co-clustering for star-structured heterogeneous data
Data Mining and Knowledge Discovery
Toward the scalability of neural networks through feature selection
Expert Systems with Applications: An International Journal
Blog topic analysis using TF smoothing and LDA
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Categorical proportional difference: a feature selection method for text categorization
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Using micro-documents for feature selection: The case of ordinal text classification
Expert Systems with Applications: An International Journal
Comparison of text feature selection policies and using an adaptive framework
Expert Systems with Applications: An International Journal
International Journal of Advanced Media and Communication
Understanding latency variations of black box services
Proceedings of the 22nd international conference on World Wide Web
Questions about questions: an empirical analysis of information needs on Twitter
Proceedings of the 22nd international conference on World Wide Web
Improving the real-time performance of heterogeneous extremely large datasets
Proceedings of the 17th Panhellenic Conference on Informatics
Semantic dispatching of multimedia news with MEWS
Proceedings of the 21st ACM international conference on Multimedia
Insights in global public spending
Proceedings of the 9th International Conference on Semantic Systems
Modificatory provisions detection: a hybrid NLP approach
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law
Enhancing financial performance with social media: An impression management perspective
Decision Support Systems
Sentiment classification of web review using association rules
OCSC'13 Proceedings of the 5th international conference on Online Communities and Social Computing
SVOIS: Support Vector Oriented Instance Selection for text classification
Information Systems
The impact of preprocessing on text classification
Information Processing and Management: an International Journal
Robust feature selection based on regularized brownboost loss
Knowledge-Based Systems
Hierarchical co-clustering: off-line and incremental approaches
Data Mining and Knowledge Discovery
Analyzing uncertainties of probabilistic rough set regions with game-theoretic rough sets
International Journal of Approximate Reasoning
An improved boosting based on feature selection for corporate bankruptcy prediction
Expert Systems with Applications: An International Journal
Sentiment classification: The contribution of ensemble learning
Decision Support Systems
A survey on feature selection methods
Computers and Electrical Engineering
Feature selection for ordinal text classification
Neural Computation
Evolutionary instance selection for text classification
Journal of Systems and Software
A scatter method for data and variable importance evaluation
Integrated Computer-Aided Engineering
A model for mining material properties for radiation shielding
Integrated Computer-Aided Engineering
Feature ranking fusion for text classifier
Intelligent Data Analysis
A novel feature subset selection algorithm based on association rule mining
Intelligent Data Analysis
BizPro: Extracting and categorizing business intelligence factors from textual news articles
International Journal of Information Management: The Journal for Information Professionals
Hi-index | 0.02 |
Machine learning for text classification is the cornerstone of document categorization, news filtering, document routing, and personalization. In text domains, effective feature selection is essential to make the learning task efficient and more accurate. This paper presents an empirical comparison of twelve feature selection methods (e.g. Information Gain) evaluated on a benchmark of 229 text classification problem instances that were gathered from Reuters, TREC, OHSUMED, etc. The results are analyzed from multiple goal perspectives-accuracy, F-measure, precision, and recall-since each is appropriate in different situations. The results reveal that a new feature selection metric we call 'Bi-Normal Separation' (BNS), outperformed the others by a substantial margin in most situations. This margin widened in tasks with high class skew, which is rampant in text classification problems and is particularly challenging for induction algorithms. A new evaluation methodology is offered that focuses on the needs of the data mining practitioner faced with a single dataset who seeks to choose one (or a pair of) metrics that are most likely to yield the best performance. From this perspective, BNS was the top single choice for all goals except precision, for which Information Gain yielded the best result most often. This analysis also revealed, for example, that Information Gain and Chi-Squared have correlated failures, and so they work poorly together. When choosing optimal pairs of metrics for each of the four performance goals, BNS is consistently a member of the pair---e.g., for greatest recall, the pair BNS + F1-measure yielded the best performance on the greatest number of tasks by a considerable margin.