-
Notifications
You must be signed in to change notification settings - Fork 149
Issues: predibase/lorax
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Quantization appears to be broken, at least for AWQ and BnB
#722
opened Dec 21, 2024 by
codybum
2 of 4 tasks
Attention not working properly in FlashRobertaModel and FlashBertModel
#694
opened Nov 22, 2024 by
sgiorgis
2 of 4 tasks
Not able to host Llama3.2-11b on Azure A100 80GB server
#679
opened Nov 13, 2024 by
alokgupta1996
2 of 4 tasks
Throughput and Latency degradation with a single LoRA adapter on A100 40 GB
#670
opened Nov 8, 2024 by
kaushikmitr
2 of 4 tasks
Things started failing after new commit into main
#661
opened Oct 30, 2024 by
gane5hvarma
2 of 4 tasks
Issue: recognizing a base causal language model as an embedding model
#657
opened Oct 24, 2024 by
veezbo
2 of 4 tasks
Unexpected response with long-context model (Phi-3)
#651
opened Oct 17, 2024 by
prd-tuong-nguyen
2 of 4 tasks
Phi 3.5 vision (4B model)
enhancement
New feature or request
#637
opened Oct 8, 2024 by
CheeseAndMeat
2 tasks done
seems like when max total token is so huge like 130000, and in the request if there is no max new token the response will be wrong
#601
opened Sep 11, 2024 by
ejiang-eog
1 of 4 tasks
Fail to run server with prefix-caching option
#599
opened Sep 11, 2024 by
prd-tuong-nguyen
2 of 4 tasks
Passing a
--revision
causes failure in loading tokenizer config
#563
opened Aug 1, 2024 by
chiragjn
2 of 4 tasks
if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B?
#549
opened Jul 20, 2024 by
tensimixt
LORAX_USE_GLOBAL_HF_TOKEN is not applied at the first time of calling adapter from huggingface private hub
#541
opened Jul 16, 2024 by
monologg
2 of 4 tasks
RuntimeError: CUDA error: no kernel image is available for execution on the device
#535
opened Jul 3, 2024 by
nethi
3 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.