tanzelin430

ZelinTan tanzelin430

University of Science and Technology of China(BA&Master[in-process])

42 followers · 28 following

University of Science and Technology of China
Hefei,China
20:38 - 12h behind

Achievements

Highlights

Lists (1)

Sort

瓜

1 repository

Stars

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 87,522 12,956 Updated Mar 29, 2025

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,244 281 Updated Mar 12, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 14,212 1,625 Updated Mar 22, 2025

zartbot / shallowsim

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 88 8 Updated Mar 27, 2025

deepseek-ai / profile-data

Analyze computation-communication overlap in V3/R1.

970 130 Updated Mar 21, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 43,011 6,534 Updated Mar 29, 2025

Zippland / city_compare

在不同城市要过上同等生活水平的我到底需要多少钱？

TypeScript 80 7 Updated Mar 21, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,054 184 Updated Mar 26, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,675 281 Updated Mar 10, 2025

tw93 / Pake

🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用

Rust 36,966 6,646 Updated Mar 25, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,385 811 Updated Mar 1, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,178 899 Updated Mar 25, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,947 230 Updated Mar 4, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,695 101 Updated Mar 7, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 1,609 93 Updated Mar 29, 2025

LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 518 53 Updated Aug 19, 2024

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 349 27 Updated Mar 28, 2025

yinfan98 / PaddleSpeech

Forked from PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 2 Updated Jan 15, 2025