Stars
A Zotero plugin for syncing items and notes into Notion
Open source code for AlphaFold 2.
Whisper realtime streaming for long speech-to-text transcription and translation
Port of OpenAI's Whisper model in C/C++
Generative AI extensions for onnxruntime
llama3 implementation one matrix multiplication at a time
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
EmbeddedLLM / vllm
Forked from vllm-project/vllmvLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
A plugin that will automatically download PDFs of zotero items from sci-hub
A high-throughput and memory-efficient inference and serving engine for LLMs
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Fast and memory-efficient exact attention
ONNX Graph ToolBox - Operate on your ONNX model with ease, visualize ONNX LLM models containing thousands of nodes.
A Chinese Hanzi variation of Wordle - 汉字 Wordle
Benchmarking Deep Learning operations on different hardware
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Visualizer for neural network, deep learning and machine learning models
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases