A Semantic Triplet Based Story Classifier

  • Authors:
  • Betul Ceran;Ravi Karad;Ajay Mandvekar;Steven R. Corman;Hasan Davulcu

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

A story is defined as "an actor(s) taking action(s) that culminates in a resolution(s)." In this paper, we investigate the utility of standard keyword based features, statistical features based on shallow-parsing (such as density of POS tags and named entities), and a new set of semantic features to develop a story classifier. This classifier is trained to identify a paragraph as a "story," if the paragraph contains mostly story(ies). Training data is a collection of expert-coded story and non-story paragraphs from RSS feeds from a list of extremist web sites. Our proposed semantic features are based on suitable aggregation and generalization of $$ triplets that can be extracted using a parser. Experimental results show that a model of statistical features alongside memory-based semantic linguistic features achieves the best accuracy with a Support Vector Machine (SVM) based classifier.