Sensitive webpage classification for content advertising

  • Authors:
  • Xin Jin;Ying Li;Teresa Mah;Jie Tong

  • Affiliations:
  • Microsoft Corporation, Redmond, WA;Microsoft Corporation, Redmond, WA;Microsoft Corporation, Redmond, WA;Microsoft Corporation, Redmond, WA

  • Venue:
  • Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Online advertising has been a popular topic in recent years. In this paper, we address one of the important problems in online advertising, i.e., how to detect whether a publisher webpage contains sensitive content and is appropriate for showing advertisement(s) on it. We take a webpage classification approach to solve this problem. First we design a unique sensitive content taxonomy. Then we adopt an iterative training data collection and classifier building approach, to build a hierarchical classifier which can classify webpages into one of the nodes in the sensitive content taxonomy. The experimental result show that using this approach, we are able to build a unique sensitive content classifier with decent accuracy while only requiring limited amount of human labeling effort.