Artificial dataset for clustering algorithms(Complete)
Download (103.3 MB) This item is shared privately
dataset
modified on 2018-09-27, 07:41 This file contains a number of randomly
generated datasets. The properties of each dataset are indicated in the
name of each respective file: 'C' indicates the number of classes, 'F'
indicates the number of features, 'Ne' indicates the number of objects
contained in each class, 'A' is related to the average separation
between classes and 'R' is an index used to differentiate distinct
random trials. So, for instance, the file C2F10N2Ne5A1.2R0 is a dataset
containing 2 classes, 10 features, 5 objects for each class and having a
typical separation between classes of 1.2. The methodology used for
generating the datasets is described in the accompanying reference.