Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language capabilities #247

Open
janEbert opened this issue Jan 8, 2025 · 4 comments
Open

Language capabilities #247

janEbert opened this issue Jan 8, 2025 · 4 comments

Comments

@janEbert
Copy link

janEbert commented Jan 8, 2025

Congrats on the amazing model release! I've been unable to find which languages you actually consider "supported" for the model, since the paper just mentions that you were "expanding multilingual coverage beyond English and Chinese" without further data. If you don't have a concrete answer to that question, maybe you can respond which languages the model has been instruction-tuned with. If you can open up the per-language sampling/mixture rate in the data distribution, that'd also be very helpful.

I also noticed that the paper says you used the non-English part from the MMMLU dataset. In the paper, the HF repo is referenced, which does not contain an English part (I'm assuming you mean that the original English MMLU wasn't included), but which does contain a Chinese part.
I'm assuming that you did not include English in this evaluation set in order to not skew the multilingual results, because English is one of the two main powerful languages of the model. Shouldn't Chinese have been filtered as well to achieve a similar reduction of result skewing?

Thank you and good luck continuing the great work! :)

Copy link

github-actions bot commented Feb 8, 2025

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. If you believe this issue is still relevant, please leave a comment to keep it open. Thank you for your contributions!

@github-actions github-actions bot added the stale label Feb 8, 2025
@janEbert
Copy link
Author

janEbert commented Feb 9, 2025

Bla.

@github-actions github-actions bot removed the stale label Feb 10, 2025
Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. If you believe this issue is still relevant, please leave a comment to keep it open. Thank you for your contributions!

@github-actions github-actions bot added the stale label Mar 12, 2025
@MikeTheSnowman
Copy link

Bla again

@github-actions github-actions bot removed the stale label Mar 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants