Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NeMo is not friendly to HF compatibility. #12166

Open
dyang67 opened this issue Feb 13, 2025 · 0 comments
Open

NeMo is not friendly to HF compatibility. #12166

dyang67 opened this issue Feb 13, 2025 · 0 comments
Assignees

Comments

@dyang67
Copy link

dyang67 commented Feb 13, 2025

Why isn’t NeMo compatible with HF’s checkpoints?

Each conversion causes many compatibility issues, significantly increasing Development costs and time.

Since HF models are updated daily, AI models' architectures undergo frequent changes, ensuring compatibility is quite challenging.

Give an example.

Mistral offers various models, such as NeMo-12B, Mistral-Small 22B, and the latest Mistral 24B. However, NeMo can only perfectly convert Mistral-7B and Mixtral-7*8B. With significant changes in Mistral 24B, converting HF checkpoints requires substantial effort and meticulous validation, creating a considerable challenge for developers.

Has NeMo considered being more compatible with HF? Directly load HF models without checkpoint conversion?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants