Analysis of textual data with multiple classes

  • Authors:
  • Shigeaki Sakurai;Chong Goh;Ryohei Orihara

  • Affiliations:
  • Corporate Research & Development Center, Toshiba Corporation;Corporate Research & Development Center, Toshiba Corporation;Corporate Research & Development Center, Toshiba Corporation

  • Venue:
  • ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a new method for analyzing textual data. The method deals with items of textual data, where each item includes various viewpoints and each viewpoint is regarded as a class. The method inductively acquires classification models for 2-class classification tasks from items labeled by multiple classes. The method infers classes of new items by using these models. Lastly, the method extracts important expressions from new items in each class and extracts characteristic expressions by comparing the frequency of expressions. This paper applies the method to questionnaire data described by guests at a hotel and verifies its effect through numerical experiments.