Identification of maximal-length noun phrases based on expanded chunks and classified punctuations in chinese

  • Authors:
  • Xue-Mei Bai;Jin-Ji Li;Dong-Il Kim;Jong-Hyeok Lee

  • Affiliations:
  • Department of Computer Science and Engineering, Electrical and Computer Engineering Division and Advanced Information Technology Research Center (AITrc), Pohang University of Science and Technolog ...;Department of Computer Science and Engineering, Electrical and Computer Engineering Division and Advanced Information Technology Research Center (AITrc), Pohang University of Science and Technolog ...;Language Engineering Institute, Department of Computer, Electron and Telecommunication Engineering, Yanbian University of Science and Technology (YUST), Yanji, Jilin, P.R. China;Department of Computer Science and Engineering, Electrical and Computer Engineering Division and Advanced Information Technology Research Center (AITrc), Pohang University of Science and Technolog ...

  • Venue:
  • ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

In general, there are two types of noun phrases (NP): Base Noun Phrase (BNP), and Maximal-Length Noun Phrase (MNP). MNP identification can largely reduce the complexity of full parsing, help analyze the general structure of complex sentences, and provide important clues for detecting main predicates in Chinese sentences. In this paper, we propose a 2-phase hybrid approach for MNP identification which adopts salient features such as expanded chunks and classified punctuations to improve performance. Experimental result shows a high quality performance of 89.66% in F1-measure.