transformers/docs/source/en
Sugawara 6daa9cb515
add GPTNeoXForSequenceClassification (#22671)
* add GPTNeoXForSequenceClassification

* move the labels to logits.device (ref: #22561)

* fix
2023-04-10 11:52:23 -04:00
..
internal Generate: TextIteratorStreamer (streamer for gradio) (#22501) 2023-04-03 15:04:37 +01:00
main_classes Fix quantization docs typo (#22666) 2023-04-10 08:53:53 -04:00
model_doc add GPTNeoXForSequenceClassification (#22671) 2023-04-10 11:52:23 -04:00
tasks add GPTNeoXForSequenceClassification (#22671) 2023-04-10 11:52:23 -04:00
_config.py
_toctree.yml Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575) 2023-04-10 10:57:21 +02:00
accelerate.mdx
add_new_model.mdx 🚨🚨🚨 Enforce single model initialization (#21431) 2023-02-09 15:46:26 -05:00
add_new_pipeline.mdx
add_tensorflow_model.mdx
attention.mdx Refactor model summary (#21408) 2023-02-15 10:35:14 -08:00
autoclass_tutorial.mdx
benchmarks.mdx
bertology.mdx update: bertology paper (#22012) 2023-03-08 07:54:30 -05:00
big_models.mdx
community.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
contributing.md
converting_tensorflow_models.mdx
create_a_model.mdx
custom_models.mdx
debugging.mdx
fast_tokenizers.mdx
generation_strategies.mdx Generate: add API warning to streamers (#22659) 2023-04-07 14:15:20 -04:00
glossary.mdx docs: New terms and updates to glossary (#21982) 2023-03-13 19:09:37 -04:00
hpo_train.mdx
index.mdx Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575) 2023-04-10 10:57:21 +02:00
installation.mdx Can't install tf2 on M1 Chip by default (#22046) 2023-03-09 07:44:58 -05:00
migration.mdx
model_sharing.mdx Fix PushToHubCallback import in Share a model docs (#21457) 2023-02-06 09:26:22 -05:00
model_summary.mdx Refactor model summary (#21408) 2023-02-15 10:35:14 -08:00
multilingual.mdx
notebooks.md
pad_truncation.mdx Example of pad_to_multiple_of for padding and truncation guide & docstring update (#22278) 2023-03-20 14:18:55 -04:00
perf_hardware.mdx
perf_infer_cpu.mdx
perf_infer_gpu_many.mdx
perf_infer_gpu_one.mdx [Doc] Fix int8 docs (#21487) 2023-02-07 15:09:27 +01:00
perf_infer_special.mdx
perf_train_cpu.mdx Add perf numbers for perf_train_cpu (#20974) 2023-02-06 09:20:43 -05:00
perf_train_cpu_many.mdx
perf_train_gpu_many.mdx
perf_train_gpu_one.mdx
perf_train_special.mdx
perf_train_tpu.mdx
perf_train_tpu_tf.mdx Typos/fixes to link syntax (#21450) 2023-02-07 15:19:19 +00:00
performance.mdx
perplexity.mdx Fix bug in perplexity guide calculations and update perplexity numbers. Fixes #22348 (#22411) 2023-03-28 09:09:17 -04:00
philosophy.mdx
pipeline_tutorial.mdx Update 2 doctest expected values for torch 2.0.0 (#22148) 2023-03-14 09:13:16 +00:00
pipeline_webserver.mdx Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
pr_checks.mdx Cleanup quality (#21493) 2023-02-07 12:27:31 -05:00
preprocessing.mdx
quicktour.mdx Fix 2 quicktour file doctest (#21742) 2023-02-23 09:41:28 +01:00
run_scripts.mdx
sagemaker.mdx
serialization.mdx Add Mega: Moving Average Equipped Gated Attention (#21766) 2023-03-24 08:17:27 -04:00
task_summary.mdx Remove trailing 'extractive' word from en documentation (#21594) 2023-02-13 10:09:00 -05:00
tasks_explained.mdx Update task summary (#21067) 2023-02-02 11:41:27 -08:00
testing.mdx [tests] add accelerate marker (#21743) 2023-02-27 12:33:34 +01:00
tf_xla.mdx
tokenizer_summary.mdx
torchscript.mdx
training.mdx
troubleshooting.mdx Removed BLIP mention from the troubleshooting guide (#21872) 2023-03-01 08:26:25 -05:00