Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any lessons from Imbue for training-in-the-large? #270

Open
lukstafi opened this issue Jul 4, 2024 · 1 comment
Open

Any lessons from Imbue for training-in-the-large? #270

lukstafi opened this issue Jul 4, 2024 · 1 comment
Labels
explore Priority below "enhancement", non-blocking for milestones

Comments

@lukstafi
Copy link
Collaborator

lukstafi commented Jul 4, 2024

https://imbue.com/research/70b-infrastructure/

"In the span of a few months, with a small team of researchers and engineers, we trained a 70B parameter model from scratch on our own infrastructure that outperformed zero-shot GPT-4o on reasoning-related tasks.

Today, we’re sharing an end-to-end guide for setting up the required infrastructure: from bringing up the initial cluster and installing the OS, to automatically recovering from errors encountered during training."

@lukstafi lukstafi added the explore Priority below "enhancement", non-blocking for milestones label Jul 4, 2024
@lukstafi
Copy link
Collaborator Author

Also, from llm.c:
karpathy/llm.c#677

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
explore Priority below "enhancement", non-blocking for milestones
Projects
None yet
Development

No branches or pull requests

1 participant