You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Integrate K-means clustering with CRRao
Integrate K-means clustering with CRRao from Clustering.jl package.
The Clustering.jl package is weird because it wants data to be supplied as d x n, where d is the dimension of the data, i.e., number of variables, and n is the number of samples. However, this is the opposite practice of the Stat community. In the Statistics community, it must be supplied as n x d. So we need to fix it.
I have submitted a PR for this issue. In this implementation, the function returns a 'KmeansResult' object. One hurdle I see with the current implementation is that the attributes of the object returned are also of the form (d x n). Do you believe that instead of passing the object, the clustering results would be better passed as tuples?
Integrate K-means clustering with CRRao
Integrate K-means clustering with CRRao from Clustering.jl package.
The Clustering.jl package is weird because it wants data to be supplied as
d x n
, whered
is the dimension of the data, i.e., number of variables, andn
is the number of samples. However, this is the opposite practice of the Stat community. In the Statistics community, it must be supplied asn x d
. So we need to fix it.The possible solution would look like
If somebody does not want to use all variables in the DataFrame, then the solution would look like
Warning: The dimension of data input in Clustering.jl is
n x d
The text was updated successfully, but these errors were encountered: