Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Discovery of Frequent Episodes in Event Sequences
Data Mining and Knowledge Discovery
Data streams: algorithms and applications
SODA '03 Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms
WWW '03 Proceedings of the 12th international conference on World Wide Web
Clustering Data Streams: Theory and Practice
IEEE Transactions on Knowledge and Data Engineering
Newsjunkie: providing personalized newsfeeds via analysis of information novelty
Proceedings of the 13th international conference on World Wide Web
Automatic web news extraction using tree edit distance
Proceedings of the 13th international conference on World Wide Web
Detection of Significant Sets of Episodes in Event Sequences
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
HLT '01 Proceedings of the first international conference on Human language technology research
QCS: a tool for querying, clustering, and summarizing documents
NAACL-Demonstrations '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Demonstrations - Volume 4
A framework for clustering evolving data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
The anatomy of a news search engine
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
nReader: reading news quickly, deeply and vividly
CHI '06 Extended Abstracts on Human Factors in Computing Systems
MAPS: approximate publish/subscribe functionality in peer-to-peer networks
Proceedings of the 1st international workshop on Advanced data processing in ubiquitous computing (ADPUC 2006)
Resource-adaptive real-time new event detection
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Tracking multiple topics for finding interesting articles
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Usage-based ranking of distributed XML data
Proceedings of the 2008 ACM symposium on Applied computing
Predicting News Story Importance Using Language Features
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
TC-SocialRank: Ranking the Social Web
WAW '09 Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph
Scalable Web Mining with Newistic
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Click-through prediction for news queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Mining and ranking streams of news stories using cross-stream sequential patterns
Proceedings of the 18th ACM conference on Information and knowledge management
Concordance-based entity-oriented search
Web Intelligence and Agent Systems
Entity-aware query processing for heterogeneous data with uncertainty and correlations
Proceedings of the 2009 EDBT/ICDT Workshops
Towards recency ranking in web search
Proceedings of the third ACM international conference on Web search and data mining
Autonomous news clustering and classification for an intelligent web portal
ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
Probabilistic identification for hard to classify protocol
WISTP'08 Proceedings of the 2nd IFIP WG 11.2 international conference on Information security theory and practices: smart devices, convergence and next generation networks
Durable top-k search in document archives
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Recommendation in Internet forums and blogs
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
ADC '10 Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104
User comments for news recommendation in forum-based social media
Information Sciences: an International Journal
Challenges in personalized authority flow based ranking of social media
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Measuring the interestingness of articles in a limited user environment
Information Processing and Management: an International Journal
Learning the importance of latent topics to discover highly influential news items
KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Increasing patient safety using explanation-driven personalized content recommendation
Proceedings of the 1st ACM International Health Informatics Symposium
Leadership discovery when data correlatively evolve
World Wide Web
Mining news streams using cross-stream sequential patterns
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Iterative Filtering in Reputation Systems
SIAM Journal on Matrix Analysis and Applications
Discovering authoritative news sources and top news stories
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
A language modeling approach for temporal information needs
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
A link-based ranking model for services
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
Selecting labels for news document clusters
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Processing continuous text queries featuring non-homogeneous scoring functions
Proceedings of the 21st ACM international conference on Information and knowledge management
Ranking news events by influence decay and information fusion for media and users
Proceedings of the 21st ACM international conference on Information and knowledge management
GeoRank: an efficient location-aware news feed ranking system
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Using temporal bursts for query modeling
Information Retrieval
Feature identification for topical relevance assessment in feed search engines
Intelligent Data Analysis
Hi-index | 0.00 |
According to a recent survey made by Nielsen NetRatings, searching on news articles is one of the most important activity online. Indeed, Google, Yahoo, MSN and many others have proposed commercial search engines for indexing news feeds. Despite this commercial interest, no academic research has focused on ranking a stream of news articles and a set of news sources. In this paper, we introduce this problem by proposing a ranking framework which models: (1) the process of generation of a stream of news articles, (2) the news articles clustering by topics, and (3) the evolution of news story over the time. The ranking algorithm proposed ranks news information, finding the most authoritative news sources and identifying the most interesting events in the different categories to which news article belongs. All these ranking measures take in account the time and can be obtained without a predefined sliding window of observation over the stream. The complexity of our algorithm is linear in the number of pieces of news still under consideration at the time of a new posting. This allow a continuous on-line process of ranking. Our ranking framework is validated on a collection of more than 300,000 pieces of news, produced in two months by more then 2000 news sources belonging to 13 different categories (World, U.S, Europe, Sports, Business, etc). This collection is extracted from the index of comeToMyHead, an academic news search engine available online.