Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Web searching for sexual information: an exploratory study
Information Processing and Management: an International Journal
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis
IEEE Transactions on Knowledge and Data Engineering
Naked image detection based on adaptive and extensible skin color model
Pattern Recognition
The Role of URLs in Objectionable Web Content Categorization
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Free Speech and Child Protection on the Web
IEEE Internet Computing
Behavioral classification on the click graph
Proceedings of the 17th international conference on World Wide Web
Information Processing and Management: an International Journal
Detecting pornographic video content by combining image features with motion information
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Patch-based skin color detection and its application to pornography image filtering
Proceedings of the 19th international conference on World wide web
Collaborative cyberporn filtering with collective intelligence
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
An intelligent categorization engine for bilingual web content filtering
IEEE Transactions on Multimedia
Objectionable content filtering by click-through data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
This paper presents an intent conformity model to collaboratively generate blacklists for cyberporn filtering. A novel porn detection framework via searches-and-clicks is proposed to explore collective intelligence embedded in query logs. Firstly, the clicked pages are represented in terms of the weighted queries to reflect the degrees related to pornography. Consequently, these weighted queries are regarded as discriminative features to calculate the pornography indicator by an inverse chi-square method for candidate determination. Finally, a candidate whose URL contains at least one pornographic keyword is included in our collaborative blacklists. The experiments on a MSN porn data set indicate that the generated blacklist achieves a high precision, while maintaining a favorably low false positive rate. In addition, real-life filtering simulations reveal that our blacklist is more effective than some publicly released blacklists.