Skip to content
View wangzihe1996's full-sized avatar

Block or report wangzihe1996

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 486 15 Updated Feb 27, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 41,659 5,656 Updated Mar 9, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 476 66 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 22,424 2,011 Updated Mar 9, 2025

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,396 21,114 Updated Oct 3, 2024

📻Terminal/ssh/telnet/serialport/RDP/VNC/sftp client(linux, mac, win)

JavaScript 11,938 981 Updated Mar 9, 2025

[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Python 97 13 Updated Dec 10, 2024

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 236,548 25,385 Updated Aug 11, 2024

A framework for few-shot evaluation of language models.

Python 8,174 2,182 Updated Mar 7, 2025

A Dataset for Multi-Turn Dialogue Reasoning

Python 304 40 Updated Oct 7, 2020

Powerful menu bar manager for macOS

Swift 17,218 307 Updated Jan 26, 2025

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

491 45 Updated Oct 25, 2024

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,546 5,678 Updated Mar 9, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,476 242 Updated Feb 20, 2025

Megatron's multi-modal data loader

Python 168 19 Updated Mar 7, 2025

Audio Large Language Models

Python 424 26 Updated Mar 9, 2025

Collection of papers for scalable automated alignment.

85 7 Updated Oct 22, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,713 618 Updated Mar 6, 2025
JavaScript 98 4 Updated Sep 10, 2024

《自然语言处理:大模型理论与实践》配套数据和代码

Python 53 5 Updated Dec 26, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,249 377 Updated Mar 9, 2025

A calmer internet, without any gimmicks.

JavaScript 26,644 696 Updated Mar 10, 2025

binary releases of VS Code without MS branding/telemetry/licensing

Shell 26,572 1,193 Updated Mar 9, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,527 5,327 Updated Mar 7, 2025

NCCL Tests

Cuda 1,020 264 Updated Feb 28, 2025

Monitor for displaying process traffic on Mac Status bar

Swift 614 37 Updated Sep 18, 2024

计算机自学指南

HTML 60,908 7,119 Updated Mar 3, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,798 505 Updated Mar 9, 2025
Next
Showing results