A vector space model for automatic indexing
Communications of the ACM
Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval and novelty detection at the sentence level
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Building Nutch: Open Source Search
Queue - Search Engines
Automatic web news extraction using tree edit distance
Proceedings of the 13th international conference on World Wide Web
Automatic information extraction from large websites
Journal of the ACM (JACM)
Lucene in Action (In Action series)
Lucene in Action (In Action series)
NewsInEssence: summarizing online news topics
Communications of the ACM - The digital society
Tracking and summarizing news on a daily basis with Columbia's Newsblaster
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Hi-index | 0.00 |
Agile access to the huge amount of information published by the thousands of news sites available on-line leads to the application of Information Retrieval techniques to this problem. The aim of this paper is to present NowOnWeb, a news retrieval system that obtains the articles from different on-line sources providing news searching and browsing. The main points solved during the development of NowOnWeb were: article recognition and extraction, redundancy detection and text summarization. For these points we provided effective solutions that put all them together had risen to a system that satisfies, in a reasonable way, the daily information needs of the user.