A probabilistic learning approach for document indexing
ACM Transactions on Information Systems (TOIS) - Special issue on research and development in information retrieval
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
A multilevel approach to intelligent information filtering: model, system, and evaluation
ACM Transactions on Information Systems (TOIS)
Social Science Computer Review - State of the art of computing in the social sciences, 1999
Acceptable internet use policy
Communications of the ACM - Internet abuse in the workplace and Game engines in scientific research
Does electronic monitoring of employee internet usage work?
Communications of the ACM - Internet abuse in the workplace and Game engines in scientific research
Monitoring for pornography and sexual harassment
Communications of the ACM - Internet abuse in the workplace and Game engines in scientific research
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Modern Information Retrieval
Web classification using support vector machine
Proceedings of the 4th international workshop on Web information and data management
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
A Memory-Based Approach to Anti-Spam Filtering for Mailing Lists
Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Text categorization based on k-nearest neighbor approach for web site classification
Information Processing and Management: an International Journal
Supervised term weighting for automated text categorization
Proceedings of the 2003 ACM symposium on Applied computing
An Exploratory Study on Promising Cues in Deception Detection and Application of Decision Tree
HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 1 - Volume 1
Calculating error rates for filtering software
Communications of the ACM - End-user development: tools that empower users to create their own software solutions
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences - Volume 07
A comparison of event models for Naive Bayes anti-spam e-mail filtering
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Internet use and misuse in the workplace
OZCHI '05 Proceedings of the 17th Australia conference on Computer-Human Interaction: Citizens Online: Considerations for Today and the Future
Automatic classification of Web queries using very large unlabeled query logs
ACM Transactions on Information Systems (TOIS)
Metadata and its impact on libraries: Book Reviews
Journal of the American Society for Information Science and Technology
A Comparison of Classification Methods for Predicting Deception in Computer-Mediated Communication
Journal of Management Information Systems
Evaluating and Tuning Predictive Data Mining Models Using Receiver Operating Characteristic Curves
Journal of Management Information Systems
Journal of Management Information Systems
An Empirical Analysis of Data Requirements for Financial Forecasting with Neural Networks
Journal of Management Information Systems
Profiling Web Usage in the Workplace: A Behavior-Based Artificial Intelligence Approach
Journal of Management Information Systems
Generating and Browsing Multiple Taxonomies Over a Document Collection
Journal of Management Information Systems
A machine learning approach to web page filtering using content and structure analysis
Decision Support Systems
Effective spam filtering: A single-class learning and ensemble approach
Decision Support Systems
Instance weighting versus threshold adjusting for cost-sensitive classification
Knowledge and Information Systems
Design science in information systems research
MIS Quarterly
Social networking on smartphones: When mobile phones become addictive
Computers in Human Behavior
Hi-index | 0.00 |
Organizations are becoming increasingly aware of Internet abuse in the workplace. Such abuse results in loss of workers' productivity, network congestion, security risks, and legal liabilities. To address this problem, organizations have started to adopt Internet usage policies, management training, and filtering software. Several commercial Internet filters are experiencing an increasing number of organizational adoptions. These products mainly rely on black lists, white lists, and keyword/profile matching to filter out undesired web pages. In this paper, we describe three top-ranked commercial Internet filters - CYBERSitter, Net Nanny, and CyberPatrol - and evaluate their performance in the context of an Internet abuse problem. We then propose a text mining approach to address the problem and evaluate its performance using six different classification algorithms: naive Bayes, multinominal naive Bayes, support vector machine, decision tree, k-nearest neighbor, and neural network. The evaluation results point to the perils of using commercial Internet filters on one hand, and to the prospects of using text mining on the other. The proposed text mining approach outperforms the commercial filters. We discuss the possible reasons for the relatively poor performance of the filters and the steps that could be taken to improve their performance.