A simple rule-based approach to organization name recognition in chinese text

  • Authors:
  • Wang Houfeng;Shi Wuguang

  • Affiliations:
  • Institute of Computational Linguistics, School of Electronic Engineering and Computer Science, Peking University, Beijing, China;Institute of Computational Linguistics, School of Electronic Engineering and Computer Science, Peking University, Beijing, China

  • Venue:
  • CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a simple rule based approach to organization name recognition in Chinese text. Based on Chinese knowledge sources, our approach detects potential left and right boundaries in a text, and then determines whether a left-right boundary pair encloses an organization name by using a length constraint and non-organization name words/POS-tag constraints. Organization names with nested structure are also processed. This approach is easy to implement and the evaluation results are satisfactory.