Expanding network communities from representative examples
ACM Transactions on Knowledge Discovery from Data (TKDD)
Name-ethnicity classification from open sources
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying co-referential names across large corpora
CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching
Lydia: a system for large-scale news analysis
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Access: news and blog analysis for the social sciences
Proceedings of the 19th international conference on World wide web
Hi-index | 0.00 |
Large-scale analysis over historical news corpora provides us with unique opportunities to examine sociological issues with respect to local and mass media. In particular, we combine the Lydia named entity recognition system with an name-based ethnicity classification engine to examine issues of ethnic and geocentric sentiment/coverage bias in newspapers. We describe new methods for ethnicity and nationality detection for news entities (people), and build on this to identify interesting temporal, geospatial, and association trends in the coverage with respect to 13 distinct cultural/ethnic/linguistic (CEL) groups.