A grid portal for solving geoscience problems using distributed knowledge discovery services

  • Authors:
  • Gianluigi Folino;Agostino Forestiero;Giuseppe Papuzzo;Giandomenico Spezzano

  • Affiliations:
  • Institute of High-Performance Computing and Networking (ICAR), National Research Council (CNR), Via Pietro Bucci 41C, I-87036 Rende (CS), Italy;Institute of High-Performance Computing and Networking (ICAR), National Research Council (CNR), Via Pietro Bucci 41C, I-87036 Rende (CS), Italy;Institute of High-Performance Computing and Networking (ICAR), National Research Council (CNR), Via Pietro Bucci 41C, I-87036 Rende (CS), Italy;Institute of High-Performance Computing and Networking (ICAR), National Research Council (CNR), Via Pietro Bucci 41C, I-87036 Rende (CS), Italy

  • Venue:
  • Future Generation Computer Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper describes our research effort to employ Grid technologies to enable the development of geoscience applications by integrating workflow technologies with data mining resources and a portal framework in unique work environment called MOSE. Using MOSE, a user can easily compose and execute geo-workflows for analyzing and managing natural disasters such as landslides, earthquakes, floods, wildfires, etc. MOSE is designed to be applicable both for the implementation of response strategies when emergencies occur and for disaster prevention. It takes advantage of the standardized resource access and workflow support for loosely coupled software components provided by web/grid services technologies. The integration of workflows with data mining services significantly improves data analysis. Geospatial data management and mining are critical areas of modern-day geosciences research. An important challenge for geospatial information mining is the distributed nature of the data. MOSE provides knowledge discovery services based on the WEKA data mining library and novel distributed data mining algorithms for spatial data analysis. A P2P bio-inspired algorithm for distributed spatial clustering as an example of distributed knowledge discovery service for intensive data analysis is presented. A real case application for the analysis of landslide hazard areas in the Campania Region near the Sarno area shows the advantages of using the portal.