Measuring peculiarity of text using relation between words on the web

  • Authors:
  • Takeru Nakabayashi;Takayuki Yumoto;Manabu Nii;Yutaka Takahashi;Kazutoshi Sumiya

  • Affiliations:
  • School of Engineering, University of Hyogo, Himeji, Hyogo, Japan;Graduate School of Engineering, University of Hyogo, Himeji, Hyogo, Japan;Graduate School of Engineering, University of Hyogo, Himeji, Hyogo, Japan;Graduate School of Engineering, University of Hyogo, Himeji, Hyogo, Japan;School of Human Science and Environment, University of Hyogo, Himeji, Hyogo, Japan

  • Venue:
  • ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We define the peculiarity of text as a metric of information credibility. Higher peculiarity means lower credibility. We extract the theme word and the characteristic words from text and check whether there is a subject-description relation between them. The peculiarity is defined using the ratio of the subject-description relation between a theme word and characteristic words. We evaluate the extent to which peculiarity can be used to judge by classifying text from Wikipedia and Uncyclopedia in terms of the peculiarity.