Information extraction from HTML: application of a general machine learning approach
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Rank aggregation methods for the Web
Proceedings of the 10th international conference on World Wide Web
Modern Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Efficient similarity search and classification via rank aggregation
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Wrapper induction for information extraction
Wrapper induction for information extraction
Measuring praise and criticism: Inference of semantic orientation from association
ACM Transactions on Information Systems (TOIS)
Predicting the semantic orientation of adjectives
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
The integration of business intelligence and knowledge management
IBM Systems Journal
Natural Language Engineering
Awarded Best Paper! - Scalable Centralized Bayesian Spam Mitigation with Bogofilter
LISA '04 Proceedings of the 18th USENIX conference on System administration
The Wisdom of Crowds
Determining the semantic orientation of terms through gloss classification
Proceedings of the 14th ACM international conference on Information and knowledge management
The Impact of Ranker Quality on Rank Aggregation Algorithms: Information vs. Robustness
ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
Beautiful Evidence
Applications of Voting Theory to Information Mashups
ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
Accessing the deep web: when good ideas go bad
Companion to the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
MONGOOSE: MONitoring Global Online Opinions via Semantic Extraction
CLOUD '09 Proceedings of the 2009 IEEE International Conference on Cloud Computing
Context and Domain Knowledge Enhanced Entity Spotting in Informal Text
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Using Wikipedia and Wiktionary in domain-specific information retrieval
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Ontology-driven automatic entity disambiguation in unstructured text
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Citizen sensor data mining, social media analytics and development centric web applications
Proceedings of the 20th international conference companion on World wide web
Hi-index | 0.00 |
Social Networks provide one of the most rapidly evolving data sets in existence today. Traditional Business Intelligence applications struggle to take advantage of such data sets in a timely manner. The BBC SoundIndex, developed by the authors and others, enabled real-time analytics of music popularity using data from a variety of Social Networks. We present this system as a grounding example of how to overcome the challenges of working with this data from social networks. We discuss a variety of technologies to implement near real-time data analytics to transform Social Intelligence into Business Intelligence and evaluate their effectiveness in the music domain. The SoundIndex project helped to highlight a number of key research areas, including named entity recognition and sentiment analysis in Informal English. It also drew attention to the importance of metadata aggregation in multimodal environments. We explored challenges such as drawing data from a wide set of sources spanning a myriad of modalities, developing adjudication techniques to harmonize inputs, and performing deep analytics on extremely challenging Informal English snippets. Ultimately, we seek to provide guidance on developing applications in a variety of domains that allow an analyst to rapidly grasp the evolution in the social landscape, and show how to validate such a system for a real-world application.