How to use train and test split with the recipes? #2222
Labels
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
Dear torchtune team,
With sft trainer we can do
train_dataset=ds["train"],
eval_dataset=ds["validation"],
if it is a split from huggingface dataset.
I wonder under a fine-tuning recipes with instruction dataset, how is this achieved, particular in a YAML configuration file? With current example on the tutorial: split: train, I feel that the whole dataset is used for training. Should we prepare json/csv files before hand with spitted on train/test/validation set?
Thanks
The text was updated successfully, but these errors were encountered: