Machine Learning
Efficient noise-tolerant learning from statistical queries
Journal of the ACM (JACM)
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Least Squares Support Vector Machine Classifiers
Neural Processing Letters
An introduction to support Vector Machines: and other kernel-based learning methods
An introduction to support Vector Machines: and other kernel-based learning methods
A streaming ensemble algorithm (SEA) for large-scale classification
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Phoenix: a parallel programming model for accommodating dynamically joining/leaving resources
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
A parallel solver for large quadratic programs in training support vector machines
Parallel Computing - Special issue: Parallel computing in numerical optimization
The Journal of Machine Learning Research
Classifying large data sets using SVMs with hierarchical clusters
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
An evaluation of statistical spam filtering techniques
ACM Transactions on Asian Language Information Processing (TALIP)
Predictive low-rank decomposition for kernel methods
ICML '05 Proceedings of the 22nd international conference on Machine learning
IMSCCS '06 Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences - Volume 1 (IMSCCS'06) - Volume 01
Google news personalization: scalable online collaborative filtering
Proceedings of the 16th international conference on World Wide Web
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Efficient Spam Email Filtering using Adaptive Ontology
ITNG '07 Proceedings of the International Conference on Information Technology
Fast support vector machine training and classification on graphics processors
Proceedings of the 25th international conference on Machine learning
Class Noise Mitigation Through Instance Weighting
ECML '07 Proceedings of the 18th European conference on Machine Learning
Semantic Analysis of User Behaviors for Detecting Spam Mail
IWSCA '08 Proceedings of the 2008 IEEE International Workshop on Semantic Computing and Applications
Mars: a MapReduce framework on graphics processors
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Building a Scalable Collaborative Web Filter with Free and Open Source Software
SITIS '08 Proceedings of the 2008 IEEE International Conference on Signal Image Technology and Internet Based Systems
Empirical analysis of support vector machine ensemble classifiers
Expert Systems with Applications: An International Journal
Improved spam filtering by extraction of information from text embedded image e-mail
Proceedings of the 2009 ACM symposium on Applied Computing
A comparison of approaches to large-scale data analysis
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
A survey of learning-based techniques of email spam filtering
Artificial Intelligence Review
SMOTE: synthetic minority over-sampling technique
Journal of Artificial Intelligence Research
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Language-model-based detection cascade for efficient classification of image-based spam e-mail
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Data Mining: Practical Machine Learning Tools and Techniques
Data Mining: Practical Machine Learning Tools and Techniques
A survey of emerging approaches to spam filtering
ACM Computing Surveys (CSUR)
Fast Modular network implementation for support vector machines
IEEE Transactions on Neural Networks
Parallel sequential minimal optimization for the training of support vector machines
IEEE Transactions on Neural Networks
Hi-index | 0.01 |
Spam, under a variety of shapes and forms, continues to inflict increased damage. Varying approaches including Support Vector Machine (SVM) techniques have been proposed for spam filter training and classification. However, SVM training is a computationally intensive process. This paper presents a MapReduce based parallel SVM algorithm for scalable spam filter training. By distributing, processing and optimizing the subsets of the training data across multiple participating computer nodes, the parallel SVM reduces the training time significantly. Ontology semantics are employed to minimize the impact of accuracy degradation when distributing the training data among a number of SVM classifiers. Experimental results show that ontology based augmentation improves the accuracy level of the parallel SVM beyond the original sequential counterpart.