Analysis of a Classifier with Random Thresholds
Cybernetics and Systems Analysis
Properties of Numeric Codes for the Scheme of Random Subspaces RSC
Cybernetics and Systems Analysis
A Bayes-true data generator for evaluation of supervised and unsupervised learning methods
Pattern Recognition Letters
Structure and attributes community detection benchmark and a novel selection method
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Proceedings of the 7th Workshop on Social Network Mining and Analysis
Hi-index | 0.10 |
Dataset generators are useful for the evaluation of an algorithm's performance because they allow control of the characteristics and amount of data used for benchmarking. We propose a dataset generator called DataGen that allows varying the number of input features and output classes, the complexity and realizations of class regions, the distributions of data samples, the noise level, the number of data samples. A C language listing of basic DataGen version is provided.