Skip to content

yaimath/pychain-lab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

18 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿงฎ LLM Math Reasoning Lab

์ด ํ”„๋กœ์ ํŠธ๋Š” ๋‹ค์–‘ํ•œ ๋ฐฉ์‹์˜ ์ˆ˜ํ•™ ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์œ„ํ•œ LLM ์‘์šฉ ๊ธฐ๋ฒ•์„ ์—ฐ๊ตฌํ•˜๊ณ  ๊ตฌํ˜„ํ•œ ํ”„๋กœ์ ํŠธ ๋ชจ์Œ์ž…๋‹ˆ๋‹ค.
๊ฐ ํ•˜์œ„ ๋””๋ ‰ํ† ๋ฆฌ๋Š” ๋…๋ฆฝ์ ์ธ ์‹คํ—˜ ํ”„๋กœ์ ํŠธ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ์œผ๋ฉฐ, ๋‹ค์–‘ํ•œ ๋ชจ๋ธ ๋ฐ ์ถ”๋ก  ๋ฐฉ์‹์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ LLM์„ ์ด์šฉํ•ด ์ˆ˜ํ•™ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š” ๋ฐฉ๋ฒ•์„ ํƒ์ƒ‰ํ•ฉ๋‹ˆ๋‹ค.


๐Ÿ“‚ ์„œ๋ธŒ ๋””๋ ‰ํ† ๋ฆฌ ์†Œ๊ฐœ

ํ”„๋กœ์ ํŠธ ์„ค๋ช…
verifier LLaDA-CoT์˜ ์ถœ๋ ฅ์„ ๊ฒ€์ฆํ•˜๋Š” Verifier. DeepSeek ๋ชจ๋ธ ๊ธฐ๋ฐ˜์œผ๋กœ OpenMathReasoning dataset์„ ํ•™์Šต.
chain-of-thought QLoRA ๊ธฐ๋ฐ˜์˜ Chain-of-Thought ํ•™์Šต ๋ฐ ์˜ค๋‹ต ๊ธฐ๋ฐ˜ ์žฌํ•™์Šต/๋ผ์šฐํŒ… ๋ชจ๋ธ ํฌํ•จ.
discrete-diffusion-llm Semi-AutoRegressive ๋ฐฉ์‹์œผ๋กœ ์ˆ˜ํ•™ ์ถ”๋ก ์„ ์ˆ˜ํ–‰ํ•˜๋Š” diffusion-style ๋ชจ๋ธ.
tool-integrated-reasoning Tool ํ˜ธ์ถœ ๊ธฐ๋ฐ˜ ์ˆ˜ํ•™ ์ถ”๋ก (TIR)์„ Nemotron ๋ชจ๋ธ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตฌํ˜„ํ•œ ์‹คํ—˜.

๐Ÿ“ฆ ํŒจํ‚ค์ง€ ์„ค์น˜

๊ฐ ์„œ๋ธŒ ๋””๋ ‰ํ† ๋ฆฌ์˜ requirements.txt ํŒŒ์ผ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”


๐Ÿš€ ์‹คํ–‰ ๋ฐฉ๋ฒ•

๊ฐ ํ”„๋กœ์ ํŠธ ๋””๋ ‰ํ† ๋ฆฌ ์•ˆ์— ์žˆ๋Š” README.md๋ฅผ ์ฐธ๊ณ ํ•˜์„ธ์š”.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published