Hierarchical-cluster-cancer

The following project was an attempt to cluster data on cervical cancer. How many groups we can clusters, and what is the optimum number of clusters.

Hierarchical clustering collects similar vectors after clusters of similar vectors are formed, the process repeats itself by making a bigger cluster compromising of the smaller cluster. The end cluster is the aggregate cluster consisting all the nested clusters. This process is useful in assessing how similar samples are to each other. We can also determine optimum number of clusters by using looking at average silehotte width

As we can see that optimal number of cluster is 2. So we base our clustering in taking 2 clusters

We get the following this might represent cancer patients from non cancer patients

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Hierarchical-cluster-cancer

Files

README.md

Latest commit

History

README.md

File metadata and controls

Hierarchical-cluster-cancer