mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* Fix the incorrect permutation of gguf * rename num_kv_heads Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * add typing to num_kv_heads Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * rename variables * refactor permute function name * update the expected text of the llama3 q4 test --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| aqlm_integration | ||
| autoawq | ||
| bnb | ||
| eetq_integration | ||
| ggml | ||
| gptq | ||
| hqq | ||
| quanto_integration | ||