TY - JOUR
T1 - Discovery of extreme events-related communities in contrasting groups of physical system networks
AU - Chen, Zhengzhang
AU - Hendrix, William
AU - Guan, Hang
AU - Tetteh, Isaac K.
AU - Choudhary, Alok
AU - Semazzi, Fredrick
AU - Samatova, Nagiza F.
PY - 2013/9
Y1 - 2013/9
N2 - The latent behavior of a physical system that can exhibit extreme events such as hurricanes or rainfalls, is complex. Recently, a very promising means for studying complex systems has emerged through the concept of complex networks. Networks representing relationships between individual objects usually exhibit community dynamics. Conventional community detection methods mainly focus on either mining frequent subgraphs in a network or detecting stable communities in time-varying networks. In this paper, we formulate a novel problem - detection of predictive and phase-biased communities in contrasting groups of networks, and propose an efficient and effective machine learning solution for finding such anomalous communities. We build different groups of networks corresponding to different system's phases, such as higher or low hurricane activity, discover phase-related system components as seeds to help bound the search space of community generation in each network, and use the proposed contrast-based technique to identify the changing communities across different groups. The detected anomalous communities are hypothesized (1) to play an important role in defining the target system's state(s) and (2) to improve the predictive skill of the system's states when used collectively in the ensemble of predictive models. When tested on the two important extreme event problems - identification of tropical cyclone-related and of African Sahel rainfall-related climate indices - our algorithm demonstrated the superior performance in terms of various skill and robustness metrics, including 8-16 % accuracy increase, as well as physical interpretability of detected communities. The experimental results also show the efficiency of our algorithm on synthetic datasets.
AB - The latent behavior of a physical system that can exhibit extreme events such as hurricanes or rainfalls, is complex. Recently, a very promising means for studying complex systems has emerged through the concept of complex networks. Networks representing relationships between individual objects usually exhibit community dynamics. Conventional community detection methods mainly focus on either mining frequent subgraphs in a network or detecting stable communities in time-varying networks. In this paper, we formulate a novel problem - detection of predictive and phase-biased communities in contrasting groups of networks, and propose an efficient and effective machine learning solution for finding such anomalous communities. We build different groups of networks corresponding to different system's phases, such as higher or low hurricane activity, discover phase-related system components as seeds to help bound the search space of community generation in each network, and use the proposed contrast-based technique to identify the changing communities across different groups. The detected anomalous communities are hypothesized (1) to play an important role in defining the target system's state(s) and (2) to improve the predictive skill of the system's states when used collectively in the ensemble of predictive models. When tested on the two important extreme event problems - identification of tropical cyclone-related and of African Sahel rainfall-related climate indices - our algorithm demonstrated the superior performance in terms of various skill and robustness metrics, including 8-16 % accuracy increase, as well as physical interpretability of detected communities. The experimental results also show the efficiency of our algorithm on synthetic datasets.
KW - Community detection
KW - Comparative analysis
KW - Complex network analysis
KW - Extreme event prediction
KW - Network motif detection
KW - Spatio-temporal data mining
UR - http://www.scopus.com/inward/record.url?scp=84879420739&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84879420739&partnerID=8YFLogxK
U2 - 10.1007/s10618-012-0289-3
DO - 10.1007/s10618-012-0289-3
M3 - Article
AN - SCOPUS:84879420739
SN - 1384-5810
VL - 27
SP - 225
EP - 258
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
IS - 2
ER -