Skip to content
View mindest's full-sized avatar

Block or report mindest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Zotero plugin for syncing items and notes into Notion

TypeScript 2,508 107 Updated Jan 12, 2025

AlphaFold 3 inference pipeline.

Python 5,822 699 Updated Jan 9, 2025

Open source code for AlphaFold 2.

Python 13,078 2,314 Updated Dec 20, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,303 287 Updated Jan 7, 2025

Port of OpenAI's Whisper model in C/C++

C++ 36,845 3,787 Updated Jan 9, 2025

Generative AI extensions for onnxruntime

C++ 574 145 Updated Jan 11, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,019 1,141 Updated May 23, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,125 1,057 Updated Jan 8, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,562 2,579 Updated Jan 7, 2025

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Python 88 5 Updated Dec 20, 2024

A plugin that will automatically download PDFs of zotero items from sci-hub

TypeScript 3,624 183 Updated Jun 28, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,568 5,130 Updated Jan 12, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 9,765 906 Updated Jan 9, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,290 1,237 Updated Dec 12, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,983 1,036 Updated Jan 10, 2025

torchview: visualize pytorch models

Python 856 39 Updated Oct 29, 2024
Python 1,025 95 Updated Jan 4, 2024

Inference code for Llama models

Python 57,173 9,652 Updated Aug 18, 2024
Python 2 Updated Nov 9, 2022

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,074 346 Updated Jan 11, 2025

Fast and memory-efficient exact attention

Python 15,011 1,413 Updated Jan 11, 2025

ONNX Graph ToolBox - Operate on your ONNX model with ease, visualize ONNX LLM models containing thousands of nodes.

Python 11 1 Updated Jul 3, 2024

ROCm Communication Collectives Library (RCCL)

C++ 288 128 Updated Jan 11, 2025

A Chinese Hanzi variation of Wordle - 汉字 Wordle

TypeScript 1,306 190 Updated Aug 18, 2024

Benchmarking Deep Learning operations on different hardware

C++ 1,080 236 Updated Apr 25, 2021

imagination

Python 3 Updated Aug 26, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,261 2,995 Updated Jan 12, 2025

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,804 3,263 Updated Aug 12, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,061 2,827 Updated Jan 12, 2025

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,225 2,946 Updated Dec 24, 2024
Next
Showing results