Skip to content

Support GPTQ/Marlin format quantization (4bit weight, f16 input) #284

Support GPTQ/Marlin format quantization (4bit weight, f16 input)

Support GPTQ/Marlin format quantization (4bit weight, f16 input) #284

Annotations

1 error and 9 warnings

Typos

failed Oct 14, 2024 in 4s