Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
C4.5: programs for machine learning
C4.5: programs for machine learning
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Frontal-view face detection and facial features extraction using color and morphological operations
Pattern Recognition Letters
Efficient identification of Web communities
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
A Study of Approaches to Hypertext Categorization
Journal of Intelligent Information Systems
Neural Networks for Web Content Filtering
IEEE Intelligent Systems
Machine Learning
WebGuard: Web Based Adult Content Detection and Filtering System
WI '03 Proceedings of the 2003 IEEE/WIC International Conference on Web Intelligence
A highly efficient system for automatic face region detection in MPEG video
IEEE Transactions on Circuits and Systems for Video Technology
Recognition of Pornographic Web Pages by Classifying Texts and Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
An adult image identification system employing image retrieval technique
Pattern Recognition Letters
Information Processing and Management: an International Journal
Harmful Contents Classification Using the Harmful Word Filtering and SVM
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Combining Classifiers for Web Violent Content Detection and Filtering
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
WebAngels Filter: A Violent Web Filtering Engine Using Textual and Structural Content-Based Analysis
ICDM '08 Proceedings of the 8th industrial conference on Advances in Data Mining: Medical Applications, E-Commerce, Marketing, and Theoretical Aspects
Query based optimal web site clustering using simulated annealing
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Expert Systems with Applications: An International Journal
Information Filtering and Information Retrieval with the Web Filtering Toolbar
Electronic Notes in Theoretical Computer Science (ENTCS)
Accelerating Web Content Filtering by the Early Decision Algorithm
IEICE - Transactions on Information and Systems
Discover hierarchical subgraphs with network-topology based ranking score
Proceedings of the Third C* Conference on Computer Science and Software Engineering
Intelligent classification of web pages using contextual and visual features
Applied Soft Computing
Horror image recognition based on emotional attention
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Collaborative cyberporn filtering with collective intelligence
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Recognition of adult images, videos, and web page bags
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Collaborative blacklist generation via searches-and-clicks
Proceedings of the 20th ACM international conference on Information and knowledge management
WIA: a web inspection architecture
International Journal of Knowledge and Web Intelligence
Retrieving keyworded subgraphs with graph ranking score
Expert Systems with Applications: An International Journal
Efficient misbehaving user detection in online video chat services
Proceedings of the fifth ACM international conference on Web search and data mining
Searching Steiner trees for web graph query
Computers and Industrial Engineering
Profanity use in online communities
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Ranking structural parameters for social networks
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
Mining search intents for collaborative cyberporn filtering
Journal of the American Society for Information Science and Technology
Scalable misbehavior detection in online video chat services
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
On line background modeling for moving object segmentation in dynamic scenes
Multimedia Tools and Applications
Web objectionable text content detection using topic modeling technique
Expert Systems with Applications: An International Journal
Objectionable content filtering by click-through data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A survey on visual adult image recognition
Multimedia Tools and Applications
Hi-index | 0.01 |
Along with the ever-growing Web comes the proliferation of objectionable content, such as sex, violence, racism, etc. We need efficient tools for classifying and filtering undesirable Web content. In this paper, we investigate this problem and describe WebGuard, an automatic machine learning-based pornographic Web site classification and filtering system. Unlike most commercial filtering products, which are mainly based on textual content-based analysis such as indicative keywords detection or manually collected black list checking, WebGuard relies on several major data mining techniques associated with textual, structural content-based analysis, and skin color related visual content-based analysis as well. Experiments conducted on a testbed of 400 Web sites including 200 adult sites and 200 nonpornographic ones showed WebGuard's filtering effectiveness, reaching a 97.4 percent classification accuracy rate when textual and structural content-based analysis was combined with visual content-based analysis. Further experiments on a black list of 12,311 adult Web sites manually collected and classified by the French Ministry of Education showed that WebGuard scored a 95.62 percent classification accuracy rate. The basic framework of WebGuard can apply to other categorization problems of Web sites which combine, as most of them do today, textual and visual content.