Skip to content

Support GPTQ/Marlin format quantization (4bit weight, f16 input) #284

Support GPTQ/Marlin format quantization (4bit weight, f16 input)

Support GPTQ/Marlin format quantization (4bit weight, f16 input) #284