Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output umap clusters (label, features and cluster id) #1149

Open
dgcovell opened this issue Sep 3, 2024 · 1 comment
Open

output umap clusters (label, features and cluster id) #1149

dgcovell opened this issue Sep 3, 2024 · 1 comment

Comments

@dgcovell
Copy link

dgcovell commented Sep 3, 2024

This is not a legitimate 'issue', rather a request for more information from umap results. I scanned the past issues and could find #1041 and #938 as somewhat relevant, but not exactly. If possible, using any example, except for the dynamic cases, could code be provided to generate umap clusters from the raw data. This should include all the labels and features (as well as latent space (coordinates??)). This may already be in the existing examples, but I am not finding it. If it is, please comment where.

Thanks,
BTW, as you know, umap results represent strong competition for existing tensor flow utilities.

@dgcovell
Copy link
Author

dgcovell commented Sep 4, 2024

For example:
umap.plot.points(mapper, labels=pendigits.target)

yields an embedding scatter plot. How do I associate these labels with mapper.embedding_ and export three columns to an excel file? So far I can get only the two column embeddings.

df = pd.DataFrame(mapper.embedding_)
df.to_excel('mapper_embeddings.xlsx')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant