COLING '86 Proceedings of the 11th coference on Computational linguistics
Automated text summarization and the SUMMARIST system
TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998
Hi-index | 0.00 |
We developed a summarizing system ABISYS based on the output of semantic analysis system SAGE. ABISYS extracts important words from an article and generates summary sentences according to the word meanings and the deep cases among the words in the output from SAGE. In this paper, we define five kinds of scores to evaluate the importance of a word respectively on repetition information, context information, position information, opinion word information and topic-focus information. We first calculate the above scores for each substantive and reflect them in a five-dimensional space. Then the probability of each substantive to be important is calculated using a pan-distance of Mahalanobis. Finally, we complement the indispensable cases for verbs and the Sahen nouns that have been selected as important words, and use them as the summary element words to generate easy-to-read Japanese sentences. We carried out a subjectivity evaluation for our system output by referring to the summaries made by human. In comparison with the subjectivity evaluations made for other summarizing systems, we found that the point of readability was on a par with other systems, while the point of content covering was much better. And 95% of the summary sentences generated by ABISYS were acknowledged as correct Japanese sentences.