transformers/tests
Ilyas Moutawwakil 4fc708f98c
Exllama kernels support for AWQ models (#28634)
* added exllama kernels support for awq models

* doc

* style

* Update src/transformers/modeling_utils.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* refactor

* moved exllama post init to after device dispatching

* bump autoawq version

* added exllama test

* style

* configurable exllama kernels

* copy exllama_config from gptq

* moved exllama version check to post init

* moved to quantization dockerfile

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2024-03-05 03:22:48 +01:00
..
benchmark
bettertransformer
deepspeed fix failing trainer ds tests (#29057) 2024-02-16 17:18:45 +05:30
extended
fixtures
fsdp Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
generation GenerationConfig validate both constraints and force_words_ids (#29163) 2024-02-27 01:43:52 +01:00
models Add UDOP (#22940) 2024-03-04 18:49:02 +01:00
optimization
peft_integration FIX [CI]: Fix failing tests for peft integration (#29330) 2024-02-29 03:56:16 +01:00
pipelines [tests] enable automatic speech recognition pipeline tests on XPU (#29308) 2024-03-04 08:24:38 +01:00
quantization Exllama kernels support for AWQ models (#28634) 2024-03-05 03:22:48 +01:00
repo_utils
sagemaker Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tokenization Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tools
trainer 🚨 Fully revert atomic checkpointing 🚨 (#29370) 2024-03-04 06:17:42 -05:00
utils Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
__init__.py
test_backbone_common.py
test_cache_utils.py Llama: fix batched generation (#29109) 2024-02-20 10:23:17 +00:00
test_configuration_common.py
test_configuration_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_feature_extraction_common.py
test_feature_extraction_utils.py
test_image_processing_common.py Raise unused kwargs image processor (#29063) 2024-02-20 16:20:20 +01:00
test_image_processing_utils.py
test_image_transforms.py
test_modeling_common.py Adding SegGPT (#27735) 2024-02-26 18:17:19 +00:00
test_modeling_flax_common.py [Flax] Update no init test for Flax v0.7.1 (#28735) 2024-01-26 18:20:39 +00:00
test_modeling_flax_utils.py Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599) 2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
test_modeling_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_pipeline_mixin.py Image Feature Extraction pipeline (#28216) 2024-02-05 14:50:07 +00:00
test_processing_common.py Don't save processor_config.json if a processor has no extra attribute (#28584) 2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_tokenization_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00