Acquisition of a classification model for a risk search system from unbalanced textual examples

  • Authors:
  • Shigeaki Sakurai;Ryohei Orihara

  • Affiliations:
  • Corporate Research and Development Center, Toshiba Corporation, 1, Komukai-Toshiba-cho, Saiwai-ku, Kawasaki 212-8582, Japan.;Corporate Research and Development Center, Toshiba Corporation, 1, Komukai-Toshiba-cho, Saiwai-ku, Kawasaki 212-8582, Japan

  • Venue:
  • International Journal of Business Intelligence and Data Mining
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method that acquires a more appropriate classification model for a risk search system analysing corporate reputation information included in bulletin board sites. The method inductively acquires the model from textual examples composed of many negative examples and a few positive examples. It selects two kinds of important negative examples by referring to expressions related to a specific label. Here, the label represents the contents of the papers. Finally, the method uses the selected negative examples and all the positive examples to acquire the model. The paper verifies the effectiveness of the method through comparative experiments.