Click-through prediction for news queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Expert Systems with Applications: An International Journal
RetriBlog: An architecture-centered framework for developing blog crawlers
Expert Systems with Applications: An International Journal
Towards social data platform: automatic topic-focused monitor for twitter stream
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
Weblogs, and other forms of social media, differ from traditional web content in many ways. One of the most important differences is the highly temporal nature of the content. Applications that leverage social media content must, to be effective, have access to this data with minimal publication/acquisition latency. An effective weblog crawler should satisfy the following requirements: low latency, highly scalable, high data quality and appropriate network politeness. In this paper, we outline the weblog crawler implemented in the social streams project and summarize the challenges faced during development.