Machine learning of generic and user-focused summarization
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Advances in Automatic Text Summarization
Advances in Automatic Text Summarization
Summarization beyond sentence extraction: a probabilistic approach to sentence compression
Artificial Intelligence
Sentence reduction for automatic text summarization
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Cut and paste based text summarization
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Why inverse document frequency?
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Automatic evaluation of summaries using N-gram co-occurrence statistics
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Understanding the process of multi-document summarization: content selection, rewriting and evaluation
Syntactic simplification for improving content selection in multi-document summarization
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Using N-Grams to understand the nature of summaries
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
The automatic creation of literature abstracts
IBM Journal of Research and Development
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
CDDS: Constraint-driven document summarization models
Expert Systems with Applications: An International Journal
Applying a lightweight iterative merging chinese segmentation in web image annotation
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Hi-index | 0.98 |
Different persons often choose different contents in multi-document as summary. To optimize summarization, we will focus on the selection of content and seeking their valuable features. Statistical methods for automatic summarization are very important. In this paper, we research the correlation between the eigenvalue of content unit in the original document cluster and the probability of the content unit to be selected as a human summary based on a statistical method. When a Basic Element and word are considered as a content unit, we draw conclusions, in user-focus summarization. It is excellent that the BE is regarded as content unit granularity, and it is proved that the frequency eigenvalue of the BE is more suitable to embody content units' weightiness than the TFIDF value. Moreover, the paper reveals that the given topic on user-focus summarization is helpful for the selection of content unit and quality of summarization. They often choose those content units as a summary in which the emerging frequency is relatively high in the sentences including the content unit of a given topic and neighboring sentences. Through researching potential behavioural modes about manual summary, we will put these effect factors of summarization quality into the process of content unit selection and summary generation to optimize automatic summarization.