is it possible to fine-tune a LLAMA2 70B with 4 A6000s? #61

akaikite · 2023-10-22T12:36:47Z

akaikite
Oct 22, 2023

I was wondering if it is possible to train on 4 A6000s with the 4096 context length option.
I think it's probably not possible, but I ask just in case.

KaiLv69 · 2023-10-22T13:49:32Z

KaiLv69
Oct 22, 2023
Maintainer

I think it worth a try. You can simply use the script here https://github.com/OpenLMLab/collie/blob/dev/examples/profile/memory_optim.py, with just modify the 2048 to 4096 :)
BTW, AdaLomo should outperform LOMO generally. So, if possible, AdaLomo is recommended.

1 reply

akaikite Oct 23, 2023
Author

Thanks for the answer. I'm going to get 4 a6000's and give it a go.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is it possible to fine-tune a LLAMA2 70B with 4 A6000s? #61

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

is it possible to fine-tune a LLAMA2 70B with 4 A6000s? #61

akaikite Oct 22, 2023

Replies: 1 comment · 1 reply

KaiLv69 Oct 22, 2023 Maintainer

akaikite Oct 23, 2023 Author

akaikite
Oct 22, 2023

Replies: 1 comment 1 reply

KaiLv69
Oct 22, 2023
Maintainer

akaikite Oct 23, 2023
Author