SAMMON
Multidimensional Datasets for Cluster Analysis


SAMMON is a dataset directory which contains examples of 6 sets of M-dimensional test data for multivariate data clustering.

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Related Data and Programs:

MARTINEZ, a dataset directory which contains datasets for computational statistics;

MDS, a dataset directory which contains datasets for M-dimensional scaling;

PCL, a dataset directory which contains datasets from a gene expression experiment on Arabidopsis, which are candidates for data cluster analysis;

SAMMON_DATA, a MATLAB program which generates six sets of M-dimensional data for cluster analysis.

SPAETH, a dataset directory which contains datasets for cluster analysis;

SPAETH2, a dataset directory which contains datasets for cluster analysis;

Reference:

  1. Ronald Fisher,
    The use of multiple measurements in taxonomic problems,
    Annual Eugenics,
    Volume 7, part II, 1936, pages 179-188.
  2. John Sammon,
    A nonlinear mapping for data structure analysis,
    IEEE Transactions on Computers,
    Volume C-18, Number 5, May 1969, pages 401-409.

Datasets:

You can go up one level to the DATASET directory.


Last revised on 01 September 2011