Foundations of statistical natural language processing
Foundations of statistical natural language processing
Trawling the Web for emerging cyber-communities
WWW '99 Proceedings of the eighth international conference on World Wide Web
Scalable collection summarization and selection
Proceedings of the fourth ACM conference on Digital libraries
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Building a distributed full-text index for the Web
Proceedings of the 10th international conference on World Wide Web
Discovering Structure from Document Databases
PAKDD '99 Proceedings of the Third Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining
Data Mining
Grading knowledge: extracting degree information from texts
Grading knowledge: extracting degree information from texts
On using a quantum physics formalism for multidocument summarization
Journal of the American Society for Information Science and Technology
Hi-index | 0.00 |
We present a multilevel model of discussions in USENET newsgroups that includes the use of statistical and linguistic methods to obtain lexical, semantic and discourse characteristics of the text. We expose constraints that make information extraction and summarization more amenable to analysis at different levels. Our model makes use of posting structure, times of posting, time spans, and length and depth of a thread in order to extract higher-level information on subject matter, interest level, topicality, and discussion trends.