Summary generation centered on important words

  • Authors:
  • Dongli Han;Takashi Noguchi;Tomokazu Yago;Minoru Harada

  • Affiliations:
  • Department of Integrated Information Technology, Aoyama Gakuin University, Sagamihara-shi, Kanagawa, Japan;Department of Industrial and Systems Engineering, Aoyama Gakuin University, Sagamihara-shi, Kanagawa, Japan;Department of Industrial and Systems Engineering, Aoyama Gakuin University, Sagamihara-shi, Kanagawa, Japan;Department of Integrated Information Technology, Aoyama Gakuin University, Sagamihara-shi, Kanagawa, Japan

  • Venue:
  • AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We developed a summarizing system ABISYS based on the output of semantic analysis system SAGE. ABISYS extracts important words from an article and generates summary sentences according to the word meanings and the deep cases among the words in the output from SAGE. In this paper, we define five kinds of scores to evaluate the importance of a word respectively on repetition information, context information, position information, opinion word information and topic-focus information. We first calculate the above scores for each substantive and reflect them in a five-dimensional space. Then the probability of each substantive to be important is calculated using a pan-distance of Mahalanobis. Finally, we complement the indispensable cases for verbs and the Sahen nouns that have been selected as important words, and use them as the summary element words to generate easy-to-read Japanese sentences. We carried out a subjectivity evaluation for our system output by referring to the summaries made by human. In comparison with the subjectivity evaluations made for other summarizing systems, we found that the point of readability was on a par with other systems, while the point of content covering was much better. And 95% of the summary sentences generated by ABISYS were acknowledged as correct Japanese sentences.