Internet-scale collection of human-reviewed data

  • Authors:
  • Qi Su;Dmitry Pavlov;Jyh-Herng Chow;Wendell C. Baker

  • Affiliations:
  • Yahoo! Inc, Sunnyvale, CA;Yahoo! Inc, Sunnyvale, CA;Yahoo! Inc, Sunnyvale, CA;Yahoo! Inc, Sunnyvale, CA

  • Venue:
  • Proceedings of the 16th international conference on World Wide Web
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Enterprise and web data processing and content aggregation systems often require extensive use of human-reviewed data (e.g. for training and monitoring machine learning-based applications). Today these needs are often met by in-house efforts or out-sourced offshore contracting. Emerging applications attempt to provide automated collection of human-reviewed data at Internet-scale. We conduct extensive experiments to study the effectiveness of one such application. We also study the feasibility of using Yahoo! Answers, a general question-answering forum, for human-reviewed data collection.