Greenstone: a comprehensive open-source digital library software system
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Self-Organizing Maps
OpenDLib: A Digital Library Service System
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
Fedora: an architecture for complex objects and their relationships
International Journal on Digital Libraries
Integrating Domain Knowledge in Equation Discovery
Computational Discovery of Scientific Knowledge
VICTORY: a 3D search engine over P2P and wireless P2P networks
Proceedings of the 4th Annual International Conference on Wireless Internet
Visual cluster analysis of trajectory data with interactive Kohonen maps
Information Visualization
DataCite - A Global Registration Agency for Research Data
COINFO '09 Proceedings of the 2009 Fourth International Conference on Cooperation and Promotion of Information Resources in Science and Technology
Clustering of time series data-a survey
Pattern Recognition
A visual digital library approach for time-oriented scientific primary data
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Comparative Analysis of Multidimensional, Quantitative Data
IEEE Transactions on Visualization and Computer Graphics
Content-based layouts for exploratory metadata search in scientific research data
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Guided discovery of interesting relationships between time series clusters and metadata properties
Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies
A benchmark for content-based retrieval in bivariate data collections
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Visual-interactive querying for multivariate research data repositories using bag-of-words
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Hi-index | 0.01 |
Increasing amounts of data are collected in many areas of research and application. The degree to which this data can be accessed, retrieved, and analyzed is decisive to obtain progress in fields such as scientific research or industrial production. We present a novel method supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, functional dependencies are a key characteristic of data that researchers are often interested in. Our methods are able to describe the functional form of such dependencies, e.g., the relationship between inflation and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models, to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.