Adding support for latest OLMo architectures #331

natolambert · 2024-09-05T20:13:54Z

No description provided.

hamishivi

Some comments... seems okay, but the tokenizer checks should be changed to not assume the user has hf_olmo installed.

hamishivi · 2024-09-09T21:19:49Z

Dockerfile.olmo

@@ -0,0 +1,115 @@
+ARG CUDA


do we want to add documentation about building with this image or nah? We had it before.

hamishivi · 2024-09-09T21:20:24Z

configs/train_configs/sft/olmo_7b_0924.yaml

+model_name_or_path: ai2-adapt-dev/OLMo-medium-peteish7-anneal-from-928646-50B-nowup-dclm07-flan
+model_revision: main
+use_flash_attn: false
+tokenizer_name: ai2-adapt-dev/OLMo-medium-peteish7-anneal-from-928646-50B-nowup-dclm07-flan


I would double check with luca that this is the right tokenizer. Sometimes conversions from the olmo repo and tokenizers get jumbled...

hamishivi · 2024-09-09T21:21:13Z

open_instruct/dpo_tune.py

@@ -672,7 +679,7 @@ def load_model():
            0,
            1,
        ], "LlamaTokenizer should only add one special token - the pad_token, or no tokens if pad token present."
-    elif isinstance(tokenizer, GPTNeoXTokenizerFast):
+    elif isinstance(tokenizer, GPTNeoXTokenizerFast) or isinstance(tokenizer, OLMoTokenizerFast):


wont this error if olmo is not imported? Maybe we should do:
elif isinstance(tokenizer, GPTNeoXTokenizerFast) or (check_hf_olmo_availability() and isinstance(tokenizer, OLMoTokenizerFast)):

hamishivi · 2024-09-09T21:21:31Z

open_instruct/finetune.py

@@ -643,7 +650,7 @@ def main(args: FlatArguments):
            0,
            1,
        ], "LlamaTokenizer should only add one special token - the pad_token, or no tokens if pad token present."
-    elif isinstance(tokenizer, GPTNeoXTokenizerFast):
+    elif isinstance(tokenizer, GPTNeoXTokenizerFast) or isinstance(tokenizer, OLMoTokenizerFast): # noqa


same comment as above. elif isinstance(tokenizer, GPTNeoXTokenizerFast) or (check_hf_olmo_availability() and isinstance(tokenizer, OLMoTokenizerFast)):

hamishivi · 2024-09-09T21:22:28Z

open_instruct/utils.py

+    if package_exists:
+        try:
+            # Primary method to get the package version
+            package_version = importlib.metadata.version(pkg_name)


curious why doing this over like:

try import hf_olmo except ImportError ...

is preferable. Don't mind it, just curious.

vwxyzjn

Looks good! Do you have a testing run?

natolambert · 2024-09-17T19:54:37Z

Yup @vwxyzjn have trained a couple models. Most recent one https://wandb.ai/ai2-llm/open_instruct_internal/runs/ny1m6zwx?nw=nwusernatolambert (plus another that already had eval scores, see slack)

vwxyzjn

LGTM! Nice change and thanks @natolambert!

…olmo_again

# Conflicts: # open_instruct/mix_data.py # requirements.txt

init

65aaaec

natolambert marked this pull request as draft September 5, 2024 20:13

natolambert added 4 commits September 5, 2024 20:35

up

7da49a9

branch dockerfile

d929a89

update

6ef706c

debugging and minor fixes

a4797fb

natolambert marked this pull request as ready for review September 9, 2024 20:57

hamishivi reviewed Sep 9, 2024

View reviewed changes

natolambert added 14 commits September 9, 2024 21:32

nit and style

56d6d86

fixes

50500ea

add weka mounting

5eb61cc

up

b134410

add hardcode flash_attn

d007da0

tweaks

5d82ea2

making it work nicely

11397bf

clean

ee5c7d2

clean

fd16aee

clean

fa447f8

up

e02e985

no longer install from branch

a0a32bf

Merge branch 'main' into olmo_again

eed8e4f

fixes

503e61e

natolambert requested review from hamishivi and vwxyzjn September 16, 2024 20:08

vwxyzjn reviewed Sep 17, 2024

View reviewed changes

vwxyzjn approved these changes Sep 17, 2024

View reviewed changes

natolambert added 3 commits September 21, 2024 00:20

Merge branch 'main' of https://github.com/allenai/open-instruct into …

ec745b0

…olmo_again

dpo config

9827264

temp olmo changes

b175717

nouhadziri added 4 commits November 4, 2024 13:23

Merge branch 'main' into olmo_again

a5ed1c1

# Conflicts: # open_instruct/mix_data.py # requirements.txt

add olmo training

281e28e

fix dir in config

e8ddd56

rollback my changes

abd25bf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for latest OLMo architectures #331

Adding support for latest OLMo architectures #331

natolambert commented Sep 5, 2024

hamishivi left a comment

hamishivi Sep 9, 2024

hamishivi Sep 9, 2024

hamishivi Sep 9, 2024

hamishivi Sep 9, 2024

hamishivi Sep 9, 2024

vwxyzjn left a comment

natolambert commented Sep 17, 2024

vwxyzjn left a comment

Adding support for latest OLMo architectures #331

Are you sure you want to change the base?

Adding support for latest OLMo architectures #331

Conversation

natolambert commented Sep 5, 2024

hamishivi left a comment

Choose a reason for hiding this comment

hamishivi Sep 9, 2024

Choose a reason for hiding this comment

hamishivi Sep 9, 2024

Choose a reason for hiding this comment

hamishivi Sep 9, 2024

Choose a reason for hiding this comment

hamishivi Sep 9, 2024

Choose a reason for hiding this comment

hamishivi Sep 9, 2024

Choose a reason for hiding this comment

vwxyzjn left a comment

Choose a reason for hiding this comment

natolambert commented Sep 17, 2024

vwxyzjn left a comment

Choose a reason for hiding this comment