Data mining middleware for wide-area high-performance networks

  • Authors:
  • Robert L. Grossman;Yunhong Gu;David Hanley;Michal Sabala;Joe Mambretti;Alex Szalay;Ani Thakar;Kazumi Kumazoe;Oie Yuji;Minsun Lee;Yoonjoo Kwon;Woojin Seok

  • Affiliations:
  • National Center for Data Mining, University of Illinois at Chicago;National Center for Data Mining, University of Illinois at Chicago;National Center for Data Mining, University of Illinois at Chicago;National Center for Data Mining, University of Illinois at Chicago;International Center for Advanced Internet Research, Northwestern University;Johns Hopkins University;Johns Hopkins University;Kitakyushu JGNII Research Center, Japan;Kitakyushu JGNII Research Center, Japan;Korea Institute of Science and Technology Information, Republic of Korea;Korea Institute of Science and Technology Information, Republic of Korea;Korea Institute of Science and Technology Information, Republic of Korea

  • Venue:
  • Future Generation Computer Systems - IGrid 2005: The global lambda integrated facility
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe two distributed, data intensive applications that were demonstrated at iGrid 2005 (iGrid Demonstration US 109 and iGrid Demonstration US121). One involves transporting astronomical data from the Sloan Digital Sky Survey (SDSS) and the other involves computing histograms from multiple high-volume data streams. Both rely on newly developed data transport and data mining middleware. Specifically, we describe a new version of the UDT network protocol called Composible-UDT, a file transfer utility based upon UDT called UDT-Gateway, and an application for building histograms on high-volume data flows called BESH (for Best Effort Streaming Histogram). For both demonstrations, we include a summary of the experimental studies performed at iGrid 2005.