Skip to content

Support GPTQ/Marlin format quantization (4bit weight, f16 input) (#89) #285

Support GPTQ/Marlin format quantization (4bit weight, f16 input) (#89)

Support GPTQ/Marlin format quantization (4bit weight, f16 input) (#89) #285