An architecture for detecting events in real-time using massive heterogeneous data sources
Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
How the live web feels about events
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Inferring the origin locations of tweets with quantitative confidence
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Hi-index | 0.00 |
Location is prevalent in most applications nowadays, and is considered a first class citizen in social networks. Locational information is of great significance since it can be used to map information from the online back to the physical world, to contextualize information, or to provide localized recommendations through Location-Based Services (LBS), and can be extended to trajectories and itineraries by adding a time-dimension. Despite all that, location extraction in social networks usually relies on GPS-enabled devices, that provide accurate geodetic coordinates. Nevertheless, most users provide general textual information about their surroundings, such as the city they live in, county or state (or equivalents), without using a GPS-enabled device, which is still valuable information for a number of applications. In this paper, we tackle the problem of extracting location information, usually referred to as geocoding, from additional user-provided content. Instead of using sophisticated and complex algorithms, which are common in online map services but require heavy development and tuning, we rely on software and data which are available online and public ally accessible. We discuss the particularities of geocoding in online social networks and present a simple, lightweight, yet efficient approach for location extraction in such a setting. We finally evaluate our approach experimentally on a large corpus of Twitter users.