Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 1.02 KB

README.md

File metadata and controls

13 lines (8 loc) · 1.02 KB

Hierarchical-cluster-cancer

The following project was an attempt to cluster data on cervical cancer. How many groups we can clusters, and what is the optimum number of clusters.

Hierarchical clustering collects similar vectors after clusters of similar vectors are formed, the process repeats itself by making a bigger cluster compromising of the smaller cluster. The end cluster is the aggregate cluster consisting all the nested clusters. This process is useful in assessing how similar samples are to each other. We can also determine optimum number of clusters by using looking at average silehotte width selection_criterion

As we can see that optimal number of cluster is 2. So we base our clustering in taking 2 clusters opti_clusters

We get the following this might represent cancer patients from non cancer patients