transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-14 20:58:08 +00:00

History

Sugawara 6daa9cb515 add GPTNeoXForSequenceClassification (#22671 ) * add GPTNeoXForSequenceClassification * move the labels to logits.device (ref: #22561) * fix		2023-04-10 11:52:23 -04:00
..
internal	Generate: `TextIteratorStreamer` (streamer for gradio) (#22501 )	2023-04-03 15:04:37 +01:00
main_classes	Fix quantization docs typo (#22666 )	2023-04-10 08:53:53 -04:00
model_doc	add GPTNeoXForSequenceClassification (#22671 )	2023-04-10 11:52:23 -04:00
tasks	add GPTNeoXForSequenceClassification (#22671 )	2023-04-10 11:52:23 -04:00
_config.py
_toctree.yml	Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575 )	2023-04-10 10:57:21 +02:00
accelerate.mdx
add_new_model.mdx	🚨🚨🚨 Enforce single model initialization (#21431 )	2023-02-09 15:46:26 -05:00
add_new_pipeline.mdx
add_tensorflow_model.mdx
attention.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
autoclass_tutorial.mdx
benchmarks.mdx
bertology.mdx	update: bertology paper (#22012 )	2023-03-08 07:54:30 -05:00
big_models.mdx
community.mdx	Fix en documentation typos (#21799 )	2023-02-27 08:36:36 +01:00
contributing.md
converting_tensorflow_models.mdx
create_a_model.mdx
custom_models.mdx
debugging.mdx
fast_tokenizers.mdx
generation_strategies.mdx	Generate: add API warning to streamers (#22659 )	2023-04-07 14:15:20 -04:00
glossary.mdx	docs: New terms and updates to glossary (#21982 )	2023-03-13 19:09:37 -04:00
hpo_train.mdx
index.mdx	Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575 )	2023-04-10 10:57:21 +02:00
installation.mdx	Can't install tf2 on M1 Chip by default (#22046 )	2023-03-09 07:44:58 -05:00
migration.mdx
model_sharing.mdx	Fix `PushToHubCallback` import in Share a model docs (#21457 )	2023-02-06 09:26:22 -05:00
model_summary.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
multilingual.mdx
notebooks.md
pad_truncation.mdx	Example of pad_to_multiple_of for padding and truncation guide & docstring update (#22278 )	2023-03-20 14:18:55 -04:00
perf_hardware.mdx
perf_infer_cpu.mdx
perf_infer_gpu_many.mdx
perf_infer_gpu_one.mdx	[`Doc`] Fix int8 docs (#21487 )	2023-02-07 15:09:27 +01:00
perf_infer_special.mdx
perf_train_cpu.mdx	Add perf numbers for perf_train_cpu (#20974 )	2023-02-06 09:20:43 -05:00
perf_train_cpu_many.mdx
perf_train_gpu_many.mdx
perf_train_gpu_one.mdx
perf_train_special.mdx
perf_train_tpu.mdx
perf_train_tpu_tf.mdx	Typos/fixes to link syntax (#21450 )	2023-02-07 15:19:19 +00:00
performance.mdx
perplexity.mdx	Fix bug in perplexity guide calculations and update perplexity numbers. Fixes #22348 (#22411 )	2023-03-28 09:09:17 -04:00
philosophy.mdx
pipeline_tutorial.mdx	Update 2 doctest expected values for torch 2.0.0 (#22148 )	2023-03-14 09:13:16 +00:00
pipeline_webserver.mdx	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
pr_checks.mdx	Cleanup quality (#21493 )	2023-02-07 12:27:31 -05:00
preprocessing.mdx
quicktour.mdx	Fix 2 quicktour file doctest (#21742 )	2023-02-23 09:41:28 +01:00
run_scripts.mdx
sagemaker.mdx
serialization.mdx	Add Mega: Moving Average Equipped Gated Attention (#21766 )	2023-03-24 08:17:27 -04:00
task_summary.mdx	Remove trailing 'extractive' word from en documentation (#21594 )	2023-02-13 10:09:00 -05:00
tasks_explained.mdx	Update task summary (#21067 )	2023-02-02 11:41:27 -08:00
testing.mdx	[`tests`] add `accelerate` marker (#21743 )	2023-02-27 12:33:34 +01:00
tf_xla.mdx
tokenizer_summary.mdx
torchscript.mdx
training.mdx
troubleshooting.mdx	Removed BLIP mention from the troubleshooting guide (#21872 )	2023-03-01 08:26:25 -05:00