How can I run the mmlu task in offline mode? #2394

95jinchul · 2024-10-11T07:48:25Z

In the #1223, there are solution for offline mode.
So, I try run mmlu task using below yaml setting.

However, in the case of mmlu, it is difficult to transfer data_files to data_kwargs because it is mapped to group configuration.

Usually, datasets are imported in the following way.

for name in ['all', 'abstract_algebra', 'anatomy', 'astronomy', 'business_ethics', 'clinical_knowledge', 'college_biology', 'college_chemistry', 'college_computer_science', 'college_mathematics', 'college_medicine', 'college_physics ', 'computer_security', 'conceptual_physics', 'econometrics', ... ]: dataset = load_dataset("hails/mmlu_no_train", f'{name}') dataset.save_to_disk(f"dataset/mmlu/{name}")

So, is there any way to load this save_to_disk file into load_dataset? I would like to import it from harness as is without going through hf_hub, but errors always occur and difficulties arise.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I run the mmlu task in offline mode? #2394

How can I run the mmlu task in offline mode? #2394

95jinchul commented Oct 11, 2024

How can I run the mmlu task in offline mode? #2394

How can I run the mmlu task in offline mode? #2394

Comments

95jinchul commented Oct 11, 2024