Replies: 1 comment
-
Hi, Great to hear you're exploring hyperparameter tuning for the Llama-2 model using Ludwig! Deep learning is an empirical science, so please take this with a grain of salt. Based on my experience, here are some hyperparameters that you might consider tuning:
Others feel free to chime in and if you give some of these suggestions a try, let me know if this lines up! |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am exploring hyper parameter tuning of Llama-2 model through Ludwig.
Since LLM's have a large # of hyperparameters, which are the most important hyperparemeters one needs to consider while fine tuning (learning rate, epochs).
Can anyone share the list of hyperparameters and their corresponding range of values as a starting point from your experience of fine tuning LLM's?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions