Collection of resources for embeddings
- khellific/anidb-series-embeddings at main : https://huggingface.co/datasets/khellific/anidb-series-embeddings/tree/main
- Hacker News OpenAI Embeddings | Kaggle : https://www.kaggle.com/datasets/julien040/hacker-news-openai-embeddings
- Cohere/wikipedia-22-12-en-embeddings · Datasets at Hugging Face : https://huggingface.co/datasets/Cohere/wikipedia-22-12-en-embeddings
- Glove Embeddings | Kaggle : https://www.kaggle.com/datasets/anmolkumar/glove-embeddings
- NLPL word embeddings repository : http://vectors.nlpl.eu/repository/
- RxRx19a COVID-19 Image Embeddings | Kaggle : https://www.kaggle.com/datasets/tunguz/rxrx19a
- Gensim Word Embeddings | Kaggle : https://www.kaggle.com/datasets/iezepov/gensim-embeddings-dataset
- 130k Images (512x512) - Universal Image Embeddings | Kaggle : https://www.kaggle.com/datasets/rhtsingh/130k-images-512x512-universal-image-embeddings
- Pre-trained Word Vectors for Spanish | Kaggle : https://www.kaggle.com/datasets/rtatman/pretrained-word-vectors-for-spanish
- Embeddings: GloVe, Crawl, etc. | torch cached | Kaggle : https://www.kaggle.com/datasets/leighplt/embeddings-glove-crawl-torch-cached
- fasttext embeddings | Kaggle : https://www.kaggle.com/datasets/abhishek/fasttext
- OpenAI Embeddings for New York Times Articles | Kaggle : https://www.kaggle.com/datasets/dilwong/openai-embeddings-for-new-york-times-articles?resource=download
- GitHub - erikbern/ann-benchmarks: Benchmarks of approximate nearest neighbor libraries in Python : https://github.com/erikbern/ann-benchmarks/tree/main#data-sets