From edf7668291a30d6c73dd0fb884a74d1d78e5786d Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Jorge=20Ant=C3=B3nio?= Date: Mon, 7 Oct 2024 16:30:56 +0100 Subject: [PATCH] improve (#2548) --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index a351ab667..4c84a0918 100644 --- a/README.md +++ b/README.md @@ -187,6 +187,7 @@ And then head over to - [`candle-sampling`](https://github.com/EricLBuehler/candle-sampling): Sampling techniques for Candle. - [`gpt-from-scratch-rs`](https://github.com/jeroenvlek/gpt-from-scratch-rs): A port of Andrej Karpathy's _Let's build GPT_ tutorial on YouTube showcasing the Candle API on a toy problem. - [`candle-einops`](https://github.com/tomsanbear/candle-einops): A pure rust implementation of the python [einops](https://github.com/arogozhnikov/einops) library. +- [`atoma-infer`](https://github.com/atoma-network/atoma-infer): A Rust library for fast inference at scale, leveraging FlashAttention2 for efficient attention computation, PagedAttention for efficient KV-cache memory management, and multi-GPU support. It is OpenAI api compatible. If you have an addition to this list, please submit a pull request.