Semi-supervised learning for relation extraction in Vietnamese text

  • Authors:
  • Rathany Chan Sam;Huong Thanh Le;Thuy Thanh Nguyen;Dung Anh Le;Ngoc Minh Thi Nguyen

  • Affiliations:
  • Hanoi University of Science and Technology;Hanoi University of Science and Technology;University of Engineering and Technology, Hanoi, VietNam;Hanoi University of Science and Technology;Hanoi University of Science and Technology

  • Venue:
  • Proceedings of the Second Symposium on Information and Communication Technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Relation extraction (RE) is the task of finding semantic relations between entities from text. As the supervised learning method requires a large amount of labeled training data, the semi-supervised learning method is the topics of interest. This paper presents a semi-supervised learning approach to relation extraction for Vietnamese text using bootstrapping. As the accuracy of syntactic parsing in Vietnamese text is still not high, we used Shallow Linguistic Kernel (SLK) which combines global kernel and local kernel to present sentences. The differences between our SLK and Giuliano et al.'s SLK [5] are: our global kernel not only use bags of words but also use part of speech, another entities type, a dictionary of compound verbs; The window size of right kernel of our local context starts from the beginning of the sentence to the word immediately before the second entity, the window size of left kernel start from the word immediately after the first entity to the end of the sentence. Our experimental results show that the supervised method using our SKL can achieve higher accuracy than the one used by Giuliano et al. [5]. And the system's accuracy when applying the bootstrapping method is higher than when applying the supervised one.