Discovery of extreme events-related communities in contrasting groups of physical system networks

  • Authors:
  • Zhengzhang Chen;William Hendrix;Hang Guan;Isaac K. Tetteh;Alok Choudhary;Fredrick Semazzi;Nagiza F. Samatova

  • Affiliations:
  • North Carolina State University, Raleigh, USA 27695 and Oak Ridge National Laboratory, Oak Ridge, USA 37831;North Carolina State University, Raleigh, USA 27695;Zhejiang University, Hangzhou, China 31000;North Carolina State University, Raleigh, USA 27695;Northwestern University, Evanston, USA 60201;North Carolina State University, Raleigh, USA 27695;North Carolina State University, Raleigh, USA 27695 and Oak Ridge National Laboratory, Oak Ridge, USA 37831

  • Venue:
  • Data Mining and Knowledge Discovery
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The latent behavior of a physical system that can exhibit extreme events such as hurricanes or rainfalls, is complex. Recently, a very promising means for studying complex systems has emerged through the concept of complex networks. Networks representing relationships between individual objects usually exhibit community dynamics. Conventional community detection methods mainly focus on either mining frequent subgraphs in a network or detecting stable communities in time-varying networks. In this paper, we formulate a novel problem--detection of predictive and phase-biased communities in contrasting groups of networks, and propose an efficient and effective machine learning solution for finding such anomalous communities. We build different groups of networks corresponding to different system's phases, such as higher or low hurricane activity, discover phase-related system components as seeds to help bound the search space of community generation in each network, and use the proposed contrast-based technique to identify the changing communities across different groups. The detected anomalous communities are hypothesized (1) to play an important role in defining the target system's state(s) and (2) to improve the predictive skill of the system's states when used collectively in the ensemble of predictive models. When tested on the two important extreme event problems--identification of tropical cyclone-related and of African Sahel rainfall-related climate indices--our algorithm demonstrated the superior performance in terms of various skill and robustness metrics, including 8---16 % accuracy increase, as well as physical interpretability of detected communities. The experimental results also show the efficiency of our algorithm on synthetic datasets.