Get ACES

ACES is a machine learning toolbox for clustering analysis and visualization of both biological data and other types data. Given the biological data or their distance/probability matrix, ACES can automatically extract the features of each identity and cluster them by various widely used clustering algorithms. To facilitate the Hierarchical and k-means Clustering, the candidate centroids of clusters are first estimated by a novel distances and standard deviation based algorithm. To visualize the original data or distance matrix, Principle Component Analysis is used to reduce the dimensionality and extract three significant components for plotting them into 3D space. ACES also provides the interface for clustering analysis and visualization together with the attributes or sample information of each identi.ty. It is clear to show which attributes contribute to the clustering results.

Run ACES

Step 1: Get the latest version of ACES, together with some data for testing, from here. To run ACES just double-click on the latest ACES.jar file.

Step 2: Check the format of files. Original Data, Distance Matrix, Data Attributes

Step 3: Start your ACES journey.

How to use?

Load Original Data File:

Check the recommended format

Open a file

Distance Matrix:

Check the recommended format

Open a file or files

Clustering Analysis:

Get the number of Clusters

Show Hierarchical clustering results

Show K-means clustering results

Show DBSCAN clustering results

Attributes:

Check the recommended format

Open a SampleInfo file

Show all the Attributes in the SampleInfo file

Show the discriminative power of each Attribute

Select an Attribute to plot

Add Clusters Info to the SampleInfo file

Save the SampleInfo

Visualization:

Plot samples with clustering results

Plot samples with the selected attribute

Heat map of the samples

Heat map of the samples with clustering results

Heat map of the samples with the selected attribute

Original Sample Data Format

There are two choices to format your sample data file: File -> Formats -> Raw data

Format 1: The Label ID is shown in the one of columns. The data vector of each sample is distributed by rows.
Format 2: The Label ID is shown in the one of rows. The data vector of each sample is distributed by columns.

Run ACES

How to use?

Load Original Data File:

Distance Matrix:

Clustering Analysis:

Attributes:

Visualization:

Original Sample Data Format

Open a Sample Data file

Distance Matrix

Open a Distance Matrix file

Get the number of Clusters

Show Hierarchical clustering results

Show K-means clustering results

Show DBSCAN clustering results

Plot samples with clustering results (3D and 2D)

Heat map of the samples

Heat map of the samples with clustering results

Check the recommended format for attributes files

Open a SampleInfo file

Show all the Attributes in the SampleInfo file

Show the discriminative power of each Attribute

Select an Attribute to plot

Add Clusters Info to the SampleInfo file

Save the SampleInfo file

Plot samples with the selected attribute(3D and 2D)

Heat map of the samples with the selected attribute