Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Graph Visualization and Navigation in Information Visualization: A Survey
IEEE Transactions on Visualization and Computer Graphics
The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations
VL '96 Proceedings of the 1996 IEEE Symposium on Visual Languages
Natural Language Processing and Text Mining
Natural Language Processing and Text Mining
GPU-Based Interactive Visualization Techniques (Mathematics and Visualization)
GPU-Based Interactive Visualization Techniques (Mathematics and Visualization)
Multi-Level Graph Layout on the GPU
IEEE Transactions on Visualization and Computer Graphics
Visual Perception and Mixed-Initiative Interaction for Assisted Visualization Design
IEEE Transactions on Visualization and Computer Graphics
Incremental Neighborhood Graphs Construction for Multidimensional Databases Indexing
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Glimmer: Multilevel MDS on the GPU
IEEE Transactions on Visualization and Computer Graphics
On Automatic Plagiarism Detection Based on n-Grams Comparison
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Exemplar-based Visualization of Large Document Corpus (InfoVis2009-1115)
IEEE Transactions on Visualization and Computer Graphics
Fast Approximate kNN Graph Construction for High Dimensional Data via Recursive Lanczos Bisection
The Journal of Machine Learning Research
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
TWC LOGD: A portal for linked open government data ecosystems
Web Semantics: Science, Services and Agents on the World Wide Web
A comparison of language identification approaches on short, query-style texts
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Evaluating the use of clustering for automatically organising digital library collections
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Visualizing a large collection of open datasets: an experiment with proximity graphs
Proceedings of the 2nd International Workshop on Open Data
The Journal of Supercomputing
Graph-Based Relational Data Visualization
IV '13 Proceedings of the 2013 17th International Conference on Information Visualisation
Hi-index | 0.00 |
We present in this paper a tool called EXOD (EXploration of Open Datasets) for the visual analysis of a large collection of open datasets. EXOD aims at helping the users to find datasets of interest. EXOD starts with the download of a large collection of datasets from an open data web site. For each dataset, it extracts its meta-data and its content. To describe each dataset in a vector space, EXOD extracts features by using text mining techniques. It considers both the metadata and the content of each dataset. Using this feature space, EXOD establishes a proximity graph by computing the Relative Neighborhood Graph. Considering the size of the collection, EXOD uses a GPU-based implementation for building this graph. We visualize the graph using the Tulip software and provide a visual and interactive global map of the collection. We developed a specific plug-in for Tulip to download and open the datasets in an interactive way. All of the presented results concern the French Open Data. EXOD was able to process 293,000 datasets, and half of this collection was visualized in Tulip. We show how clusters and other information can be discovered and how the created links can be used for local and content-based exploration.