Can unlimiformer work with common fine-tuning methods？ #23

mrlzh · 2023-07-18T09:26:13Z

I set max_source_length with 10240 and training t5 models，but it ran out of CUDA memory.
I would like to know if unlimiformer can run together with fine-tuning methods such as LoRA.

abertsch72 · 2023-09-25T15:59:03Z

Hi @mrlzh , thanks for your interest!

We haven't tried using Unlimiformer with LoRA, but there isn't a theoretical reason that they wouldn't work together. If you try it, please let us know how it goes!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can unlimiformer work with common fine-tuning methods？ #23

Can unlimiformer work with common fine-tuning methods？ #23

mrlzh commented Jul 18, 2023 •

edited

Loading

abertsch72 commented Sep 25, 2023

Can unlimiformer work with common fine-tuning methods？ #23

Can unlimiformer work with common fine-tuning methods？ #23

Comments

mrlzh commented Jul 18, 2023 • edited Loading

abertsch72 commented Sep 25, 2023

mrlzh commented Jul 18, 2023 •

edited

Loading