Technical Section: EXOD: A tool for building and exploring a large graph of open datasets

  • Authors:
  • Tianyang Liu;Fatma Bouali;Gilles Venturini

  • Affiliations:
  • -;-;-

  • Venue:
  • Computers and Graphics
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present in this paper a tool called EXOD (EXploration of Open Datasets) for the visual analysis of a large collection of open datasets. EXOD aims at helping the users to find datasets of interest. EXOD starts with the download of a large collection of datasets from an open data web site. For each dataset, it extracts its meta-data and its content. To describe each dataset in a vector space, EXOD extracts features by using text mining techniques. It considers both the metadata and the content of each dataset. Using this feature space, EXOD establishes a proximity graph by computing the Relative Neighborhood Graph. Considering the size of the collection, EXOD uses a GPU-based implementation for building this graph. We visualize the graph using the Tulip software and provide a visual and interactive global map of the collection. We developed a specific plug-in for Tulip to download and open the datasets in an interactive way. All of the presented results concern the French Open Data. EXOD was able to process 293,000 datasets, and half of this collection was visualized in Tulip. We show how clusters and other information can be discovered and how the created links can be used for local and content-based exploration.