The Research on the Application of Text Clustering and Natural Language Understanding in Automatic Abstracting

  • Authors:
  • Qinglin Guo;Cunbin Li

  • Affiliations:
  • North China Electric Power University, Beijing, 102206, China;North China Electric Power University, Beijing, 102206, China

  • Venue:
  • FSKD '07 Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 04
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Qinglin Guo1, Cunbin Li2 1. School of Computer Science and Technology, North China Electric Power University, Beijing, 102206, China 2. School of Business Administration, North China Electric Power University, Beijing, 102206, China qlguo88@sohu.com Abstract A method of realization of Automatic Abstracting based on Text Clustering and Natural Language Understanding is brought forward, aimed at overcoming shortages of some current methods. The method makes use of text Clustering and can realize Automatic Abstracting of multi- documents. The algorithm of twice Word Segmentation based on the Title and First- Sentences in Paragraphs is brought forward. Its precision and recall is above 95%. For a specific domain on plastics, an Automatic Abstracting system named TCAAS is implemented. The precision and recall of multi- document's Automatic Abstracting is above 75%. And experiments do prove that it is feasible to use the method to develop a domain Automatic Abstracting System, which is valuable for further study in more depth.