-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
You script to quantize the instructor models simply doesn't work. #85
Comments
It works just fine for me. For GPU, you need to seek other solutions (https://discuss.pytorch.org/t/does-dynamic-quantization-support-gpu/119231). |
Do you have a sample script or series of commands that you used? I've tried verbatim. |
I just used the code from the quantization section in the readme of this repo. |
Thanks. I can't get it to work. If I try to get the error messages or log messages, would you be willing to help me just a little bit? |
I figured out how to dynamically quantize the instructor-xl model, but at the point that it's supposed to create the embeddings, i want it to use gpu acceleration (cuda) just like it does when I use the float32 version of the model. Is that possible? If I understand the comments above, it's not? What about quantizing the model beforehand NOT using the "dynamic" method? I've been struggling with this for months so any help would be much appreciated. The link above is to a discussion back in 2021 and "seek other solutions" doesn't point me in the right direction so...I'm looking at bitsandbytes but couldn't find a solution either... Here is the portion of the script I'm trying to use:
|
Hi, Thanks a lot for your interest in the INSTRUCTOR! The following seems to work for me:
Hope this helps! |
I get with this script:
I'm on M1 Mac, sentence_transformers==2.2.2 (also had the problem with the token) #106 |
Tried every which way to get it to work, just can't get it to work. No other examples on the Internet of it working either.
The text was updated successfully, but these errors were encountered: