Integrate K-Means Clustering with CRRao #109

sourish-cmi · 2023-03-07T02:22:29Z

Integrate K-means clustering with CRRao
Integrate K-means clustering with CRRao from Clustering.jl package.

The Clustering.jl package is weird because it wants data to be supplied as d x n, where d is the dimension of the data, i.e., number of variables, and n is the number of samples. However, this is the opposite practice of the Stat community. In the Statistics community, it must be supplied as n x d. So we need to fix it.

The possible solution would look like

container = fit(DataFrame, KMeansClustering(),K::Int64,...)

If somebody does not want to use all variables in the DataFrame, then the solution would look like

container = fit(VarName, DataFrame, KMeansClustering(),K::Int64,...)

Warning: The dimension of data input in Clustering.jl is n x d

The text was updated successfully, but these errors were encountered:

a-keshav · 2024-01-18T09:51:52Z

Hey, is this issue still open? and if yes, could you please assign it to me?

sourish-cmi · 2024-01-22T04:27:41Z

Hey, is this issue still open? and if yes, could you please assign it to me?

Sure why not - you can try it.

a-keshav · 2024-02-09T16:18:20Z

I have submitted a PR for this issue. In this implementation, the function returns a 'KmeansResult' object. One hurdle I see with the current implementation is that the attributes of the object returned are also of the form (d x n). Do you believe that instead of passing the object, the clustering results would be better passed as tuples?

sourish-cmi added enhancement New feature or request good first issue Good for newcomers labels Mar 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate K-Means Clustering with CRRao #109

Integrate K-Means Clustering with CRRao #109

sourish-cmi commented Mar 7, 2023

a-keshav commented Jan 18, 2024 •

edited

Loading

sourish-cmi commented Jan 22, 2024

a-keshav commented Feb 9, 2024

Integrate K-Means Clustering with CRRao #109

Integrate K-Means Clustering with CRRao #109

Comments

sourish-cmi commented Mar 7, 2023

a-keshav commented Jan 18, 2024 • edited Loading

sourish-cmi commented Jan 22, 2024

a-keshav commented Feb 9, 2024

a-keshav commented Jan 18, 2024 •

edited

Loading