-
Notifications
You must be signed in to change notification settings - Fork 368
Issues: modelscope/ms-swift
Fine-tuning best practices for qwen2.5-72b-instruct and qwen2...
#2064
opened Sep 18, 2024 by
Jintao-Huang
Open
18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
llava1.6_llama3.1_8B使用报错error occurs in lazy tokenizer: pixel values
#2409
opened Nov 7, 2024 by
JHL328
求助:请问在lora微调时,/swift/llm/sft.py内的trainer_train函数是如何指导权重更新的?
#2402
opened Nov 7, 2024 by
woshidahunzi1
Question about 'load_dataset_from_local' call 'preprocess_func' twice ?
#2396
opened Nov 6, 2024 by
lluo-Desktop
runtimeerror:unsupported operation :more than one element of the written-to tensor refers to a single memory location. plese clone the tensor before performing the operation
bug
Something isn't working
#2375
opened Nov 2, 2024 by
xiaochounikuaixiao
Previous Next
ProTip!
Follow long discussions with comments:>50.