mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-15 21:01:19 +00:00
* Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> |
||
|---|---|---|
| .. | ||
| callback.mdx | ||
| configuration.mdx | ||
| data_collator.mdx | ||
| deepspeed.mdx | ||
| feature_extractor.mdx | ||
| keras_callbacks.mdx | ||
| logging.mdx | ||
| model.mdx | ||
| onnx.mdx | ||
| optimizer_schedules.mdx | ||
| output.mdx | ||
| pipelines.mdx | ||
| processors.mdx | ||
| text_generation.mdx | ||
| tokenizer.mdx | ||
| trainer.mdx | ||