Algorithms for clustering data
Algorithms for clustering data
A guided tour of Chernoff bounds
Information Processing Letters
C4.5: programs for machine learning
C4.5: programs for machine learning
Collaborative interface agents
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Information filtering based on user behavior analysis and best match text retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
PICS: Internet access controls without censorship
Communications of the ACM
Fab: content-based, collaborative recommendation
Communications of the ACM
Bayesian classification (AutoClass): theory and results
Advances in knowledge discovery and data mining
Concept features in Re:Agent, an intelligent Email agent
AGENTS '98 Proceedings of the second international conference on Autonomous agents
Finding salient features for personal Web page categories
Selected papers from the sixth international conference on World Wide Web
Decision Support Systems - From information retrieval to knowledge management: enabling technologies and best practices
Partitioning-based clustering for Web document categorization
Decision Support Systems - Special issue on WITS '97
Filtering objectionable internet content
ICIS '99 Proceedings of the 20th international conference on Information Systems
Explaining collaborative filtering recommendations
CSCW '00 Proceedings of the 2000 ACM conference on Computer supported cooperative work
Getting to know you: learning new user preferences in recommender systems
Proceedings of the 7th international conference on Intelligent user interfaces
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Neural Networks for Web Content Filtering
IEEE Intelligent Systems
Classifying Objectionable Websites Based on Image Content
IDMS '98 Proceedings of the 5th International Workshop on Interactive Distributed Multimedia Systems and Telecommunication Services
Bayesian Models for Early Warning of Bank Failures
Management Science
Letizia: an agent that assists web browsing
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Syskill & webert: Identifying interesting web sites
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Artificial Intelligence in Medicine
Internet content filtering using isotonic separation on content category ratings
ACM Transactions on Internet Technology (TOIT)
Information Processing and Management: an International Journal
Combining Classifiers for Web Violent Content Detection and Filtering
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
WebAngels Filter: A Violent Web Filtering Engine Using Textual and Structural Content-Based Analysis
ICDM '08 Proceedings of the 8th industrial conference on Advances in Data Mining: Medical Applications, E-Commerce, Marketing, and Theoretical Aspects
A two-stage decision model for information filtering
Decision Support Systems
Mining search intents for collaborative cyberporn filtering
Journal of the American Society for Information Science and Technology
Web objectionable text content detection using topic modeling technique
Expert Systems with Applications: An International Journal
A social network-empowered research analytics framework for project selection
Decision Support Systems
Hi-index | 0.00 |
Various entities (e.g., parents, employers) that provide users (e.g., children, employees) access to web content wish to limit the content accessed through those computers. Available filtering methods are crude in that they too often block "acceptable" content while failing to block "unacceptable" content. This paper presents a general and flexible classification method based on statistical techniques applied to text material, that we call, Filtering by Statistical Classification (FSC). According to each individual entity's expressed opinions about what content in a training data set is or is not acceptable, FSC constructs a customized model to represent each individual entity's preferences. FSC then uses this customized model to examine new web content and to block unwanted content. The empirical results suggest that our method has greater predictive power than do a variety of existing approaches.