- ๐ M.S. student at THU, with a CS background from BUPT.
- ๐ญ Diving into large language models and reasoning, with a long-term aim at AGI.
- ๐ Avid runner, fitness buff, and badminton player.
- ๐ฌ Open for chat and collaboration โ don't hesitate to reach out!
Pinned Loading
-
microsoft/rho
microsoft/rho PublicRepo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
-
microsoft/ToRA
microsoft/ToRA PublicToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
-
microsoft/ProphetNet
microsoft/ProphetNet PublicA research project for natural language generation, containing the official implementations by MSRA NLC team.
-
math-evaluation-harness
math-evaluation-harness PublicA simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจ
-
PaddlePaddle/Research
PaddlePaddle/Research Publicnovel deep learning research works with PaddlePaddle
-
CriticBench/CriticBench
CriticBench/CriticBench Public[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.