WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis
IEEE Transactions on Knowledge and Data Engineering
The Role of URLs in Objectionable Web Content Categorization
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Behavioral classification on the click graph
Proceedings of the 17th international conference on World Wide Web
Information Processing and Management: an International Journal
Collaborative blacklist generation via searches-and-clicks
Proceedings of the 20th ACM international conference on Information and knowledge management
Mining search intents for collaborative cyberporn filtering
Journal of the American Society for Information Science and Technology
Web objectionable text content detection using topic modeling technique
Expert Systems with Applications: An International Journal
Objectionable content filtering by click-through data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
This paper presents a user intent method to generate blacklists for collaborative cyberporn filtering. A novel porn detection framework that finds new pornographic web pages by mining user search behaviors is proposed. It employs users' clicks in search query logs to select the suspected web pages without extra human efforts to label data for training, and determines their categories with the help of URL host name and path information, but without web page content. We adopt an MSN porn data set to explore the effectiveness of our method. This user intent approach achieves high precision, while maintaining favorably low false positive rate. In addition, real-life filtering simulation reveals that our user intent method with its accumulative update strategy achieves 43.36% of blocking rate, while maintaining a steadily less than 7% of over-blocking rate.