mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* Add TorchAOHfQuantizer Summary: Enable loading torchao quantized model in huggingface. Test Plan: local test Reviewers: Subscribers: Tasks: Tags: * Fix a few issues * style * Added tests and addressed some comments about dtype conversion * fix torch_dtype warning message * fix tests * style * TorchAOConfig -> TorchAoConfig * enable offload + fix memory with multi-gpu * update torchao version requirement to 0.4.0 * better comments * add torch.compile to torchao README, add perf number link --------- Co-authored-by: Marc Sun <marc@huggingface.co> |
||
|---|---|---|
| .. | ||
| aqlm.md | ||
| awq.md | ||
| bitsandbytes.md | ||
| contribute.md | ||
| eetq.md | ||
| fbgemm_fp8.md | ||
| gptq.md | ||
| hqq.md | ||
| optimum.md | ||
| overview.md | ||
| quanto.md | ||
| torchao.md | ||