Hunting for the black swan: risk mining from text

Authors:
Jochen L. Leidner;Frank Schilder
Affiliations:
Thomson Reuters Corporation, Research & Development, MN;Thomson Reuters Corporation, Research & Development, MN
Venue:
ACLDemos '10 Proceedings of the ACL 2010 System Demonstrations
Year:
2010

Citing 7
Cited 0

Ramification analysis using causal mapping

Data & Knowledge Engineering
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
The Black Swan: The Impact of the Highly Improbable

The Black Swan: The Impact of the Highly Improbable
Looking for trouble

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
May all your wishes come true: a study of wishes and how to recognize them

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Predicting risk from financial reports with regression

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the business world, analyzing and dealing with risk permeates all decisions and actions. However, to date, risk identification, the first step in the risk management cycle, has always been a manual activity with little to no intelligent software tool support. In addition, although companies are required to list risks to their business in their annual SEC filings in the USA, these descriptions are often very high-level and vague. In this paper, we introduce Risk Mining, which is the task of identifying a set of risks pertaining to a business area or entity. We argue that by combining Web mining and Information Extraction (IE) techniques, risks can be detected automatically before they materialize, thus providing valuable business intelligence. We describe a system that induces a risk taxonomy with concrete risks (e.g., interest rate changes) at its leaves and more abstract risks (e.g., financial risks) closer to its root node. The taxonomy is induced via a bootstrapping algorithms starting with a few seeds. The risk taxonomy is used by the system as input to a risk monitor that matches risk mentions in financial documents to the abstract risk types, thus bridging a lexical gap. Our system is able to automatically generate company specific "risk maps", which we demonstrate for a corpus of earnings report conference calls.