Chinese named entity recognition based on hierarchical hybrid model

  • Authors:
  • Zhihua Liao;Zili Zhang;Yang Liu

  • Affiliations:
  • Hunan Normal University, Changsha, Hunan, China;Southwest University, Chongqing, China and Deakin University, Geelong, VIC, Australia;Jilin University, Changchun, Jilin, China

  • Venue:
  • PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Chinese named entity recognition is a challenging, difficult, yet important task in natural language processing. This paper presents a novel approach based on a hierarchical hybrid model to recognize Chinese named entities. Three mutually dependent stages- boosting, Markov Logic Networks (MLNs) based recognition, and abbreviation detection - are integrated in the model. AdaBoost algorithm is utilized for fast recognition of simple named entities first. More complex named entities are then piped into MLNs for accurate recognition. In particular, the left boundary recognition of named entities is considered. Lastly, special care is taken for classifying the abbreviated named entities by using the global context information in the same document. Experiments were conducted on People's Daily corpus. The results show that our approach can improve the performance significantly with precision of 94.38%, recall of 93.89%, and Fβ=1 value of 93.97%.