Skip to content

Actions: hiyouga/LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,173 workflow runs
3,173 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add trust_remote_code Parameter and Set Default to False
tests #1639: Pull request #5819 synchronize by yafshar
December 12, 2024 13:16 Action required yafshar:remote_code
December 12, 2024 13:16 Action required
December 12, 2024 09:33 11s
Merge pull request #6317 from hiyouga/hiyouga/qwenvl_mrope
tests #1638: Commit c708ebd pushed by hiyouga
December 12, 2024 09:22 7m 43s main
December 12, 2024 09:22 7m 43s
使用Qwen数据模板,微调input构造不合理
label_issue #1804: Issue #6318 opened by phbst
December 12, 2024 09:17 14s
December 12, 2024 09:17 14s
[model] fix: qwen2vl mrope
tests #1637: Pull request #6317 opened by hiyouga
December 12, 2024 09:12 6m 47s hiyouga/qwenvl_mrope
December 12, 2024 09:12 6m 47s
padding_side的设置
label_issue #1803: Issue #6316 opened by lllabmaster
December 12, 2024 09:08 14s
December 12, 2024 09:08 14s
昇腾训练千问2-7B DPO下设置batch_size问题求助
label_issue #1802: Issue #6315 opened by liuanping
December 12, 2024 05:48 10s
December 12, 2024 05:48 10s
support telechat2 model
tests #1636: Pull request #6313 opened by ge-xing
December 12, 2024 01:41 Action required ge-xing:main
December 12, 2024 01:41 Action required
关于sharegpt的数据格式的loss 计算,可能有问题
label_issue #1800: Issue #6312 opened by yangchao-zhou
December 11, 2024 09:42 12s
December 11, 2024 09:42 12s
单机多卡报错
label_issue #1799: Issue #6311 opened by 122550888
December 11, 2024 09:42 15s
December 11, 2024 09:42 15s
Add PEFT add_weighted_adapter() Function for Merging Multiple Adapters
tests #1635: Pull request #6310 synchronize by Dlemonha
December 11, 2024 08:53 Action required Dlemonha:dwt_llama_factory
December 11, 2024 08:53 Action required
Add PEFT add_weighted_adapter() Function for Merging Multiple Adapters
tests #1634: Pull request #6310 opened by Dlemonha
December 11, 2024 08:37 Action required Dlemonha:dwt_llama_factory
December 11, 2024 08:37 Action required
Should we support TGI-v3 serving model integration?
label_issue #1797: Issue #6308 opened by phanxuanphucnd
December 11, 2024 07:48 10s
December 11, 2024 07:48 10s
llamafactory-cli webchat推理速度非常慢
label_issue #1796: Issue #6307 opened by eyexin
December 11, 2024 02:23 14s
December 11, 2024 02:23 14s
添加早停机制
label_issue #1795: Issue #6306 opened by huangshimai
December 11, 2024 02:23 19s
December 11, 2024 02:23 19s
Help
label_issue #1794: Issue #6304 opened by Bing-a-ling7
December 10, 2024 13:52 12s
December 10, 2024 13:52 12s
按照LLaMA-Factory QuickStart 进行微调时,没有正确输出损失函数图
label_issue #1793: Issue #6303 opened by qfikh
December 10, 2024 13:31 12s
December 10, 2024 13:31 12s
max_steps: 256, streaming: true, buffer_size: 128 报错
label_issue #1792: Issue #6302 opened by StarDewXXX
December 10, 2024 12:03 14s
December 10, 2024 12:03 14s
Qwen2.5 72B 32k全参微调,跑不起来
label_issue #1791: Issue #6301 opened by iallblue
December 10, 2024 10:11 18s
December 10, 2024 10:11 18s
多模态paligemma训练 ValueError: Decompressed Data Too Large
label_issue #1790: Issue #6300 opened by CLL112
December 10, 2024 08:48 12s
December 10, 2024 08:48 12s
ReFT在Openai大放异彩,请问有计划支持ReFT吗?
label_issue #1789: Issue #6299 opened by zhoushaoxiang
December 10, 2024 08:03 12s
December 10, 2024 08:03 12s
December 10, 2024 06:31 9s