transformers/docs/source/en/quantization
Marc Sun 96a074fa7e
Add new quant method (#32047)
* Add new quant method

* update

* fix multi-device

* add test

* add offload

* style

* style

* add simple example

* initial doc

* docstring

* style again

* works ?

* better docs

* switch to non persistant

* remove print

* fix init

* code review
2024-07-22 20:21:59 +02:00
..
aqlm.md
awq.md docs: fix broken link (#31370) 2024-06-12 11:33:00 +01:00
bitsandbytes.md
contribute.md
eetq.md
fbgemm_fp8.md Add new quant method (#32047) 2024-07-22 20:21:59 +02:00
gptq.md
hqq.md
optimum.md
overview.md Add new quant method (#32047) 2024-07-22 20:21:59 +02:00
quanto.md