Skip to content
View REIGN12's full-sized avatar
  • Tsinghua University

Highlights

  • Pro

Organizations

@Open-Reasoner-Zero

Block or report REIGN12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 4,741 286 Updated Mar 28, 2025

Fully open reproduction of DeepSeek-R1

Python 23,450 2,132 Updated Mar 28, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 46,827 1,318 Updated Mar 28, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 40,525 6,786 Updated Mar 27, 2025

The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".

Python 109 1 Updated Sep 14, 2024

Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 476 35 Updated Mar 16, 2025

A modern download manager that supports all platforms. Built with Golang and Flutter.

Dart 18,617 1,297 Updated Mar 28, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,328 684 Updated Mar 28, 2025

🔥 A minimal training framework for scaling FLA models

Python 91 14 Updated Mar 22, 2025

Agentless🐱: an agentless approach to automatically solve software development problems

Python 1,600 168 Updated Dec 22, 2024

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,289 112 Updated Mar 13, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,946 230 Updated Mar 4, 2025
Python 4,084 328 Updated Mar 12, 2025

[ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Python 20 Updated Mar 10, 2025

Official Repo for Open-Reasoner-Zero

Python 1,685 80 Updated Mar 5, 2025

Democratizing Reinforcement Learning for LLMs

Python 2,154 186 Updated Feb 16, 2025

A cat(1) clone with wings.

Rust 51,916 1,287 Updated Mar 18, 2025

s1: Simple test-time scaling

Python 6,086 711 Updated Mar 6, 2025
Python 573 20 Updated Mar 14, 2025
Python 262 16 Updated Mar 16, 2025

🤗 smolagents: a barebones library for agents that think in python code.

Python 15,932 1,406 Updated Mar 28, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,718 347 Updated Mar 28, 2025

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 51,359 2,092 Updated Feb 27, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,261 87 Updated Mar 26, 2025

Simple RL training for reasoning

Python 3,325 247 Updated Mar 26, 2025

A plugin manager for Fish

Shell 8,185 266 Updated Sep 10, 2024

CYaRon: Yet Another Random Olympic-iNformatics test data generator

Python 1,471 172 Updated Feb 22, 2025
Next
Showing results