Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NLG评估DPO,不输出结果 #6321

Open
1 task done
sunxiaoyu12 opened this issue Dec 12, 2024 · 0 comments
Open
1 task done

NLG评估DPO,不输出结果 #6321

sunxiaoyu12 opened this issue Dec 12, 2024 · 0 comments
Labels
pending This problem is yet to be addressed

Comments

@sunxiaoyu12
Copy link

Reminder

  • I have read the README and searched the existing issues.

System Info

  • llamafactory version: 0.9.1.dev0
  • Platform: Linux-6.2.0-26-generic-x86_64-with-glibc2.35
  • Python version: 3.10.12
  • PyTorch version: 2.3.0a0+ebedce2 (GPU)
  • Transformers version: 4.46.1
  • Datasets version: 3.1.0
  • Accelerate version: 1.0.1
  • PEFT version: 0.12.0
  • TRL version: 0.9.6
  • GPU type: NVIDIA H100 80GB HBM3
  • DeepSpeed version: 0.15.4
  • Bitsandbytes version: 0.44.1

Reproduction

CUDA_VISIBLE_DEVICES=0,1,2,3 llamafactory-cli train \
--stage dpo \
--do_predict True \
--finetuning_type lora \
--model_name_or_path  model/Meta-Llama-3-8B-Instruct \
--adapter_name_or_path  saves/Meta-Llama-3-8B-Instruct/lora/dpo_train_2024-12-12-10-01-38 \
--flash_attn auto \
--dataset_dir data \
--eval_dataset dpo_zh_demo \
--template llama3 \
--cutoff_len 1024 \
--max_samples 1000 \
--preprocessing_num_workers 16 \
--predict_with_generate False \
--per_device_eval_batch_size 2 \
--max_new_tokens 512 \
--top_p 0.7 \
--temperature 0.95 \
--output_dir saves/Meta-Llama-3-8B-Instruct/lora/eval_nlg_dpo_2024-12-12-09 \
--overwrite_output_dir True

4457ca8d26e80cf0e03ef62851af1cf9

Expected behavior

训练阶段是DPO,微调方法是lora,训练完成之后,使用以上代码进行模型NLG评估,运行期间并未出错,但是却没有结果输出,请问这是怎么回事?是否是少了什么参数?

Others

No response

@github-actions github-actions bot added the pending This problem is yet to be addressed label Dec 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pending This problem is yet to be addressed
Projects
None yet
Development

No branches or pull requests

1 participant