| .. |
|
albert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
align
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
altclip
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
audio_spectrogram_transformer
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
auto
|
|
|
|
autoformer
|
Compute dropout_probability only in training mode (#24486)
|
2023-06-26 18:36:47 +02:00 |
|
bark
|
Update Bark generation configs and tests (#25409)
|
2023-08-09 18:28:02 +02:00 |
|
bart
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
barthez
|
|
|
|
bartpho
|
|
|
|
beit
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bert_generation
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bert_japanese
|
|
|
|
bertweet
|
|
|
|
big_bird
|
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420)
|
2023-06-22 16:11:27 +02:00 |
|
bigbird_pegasus
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
biogpt
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bit
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
blenderbot
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
blenderbot_small
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
blip
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
blip_2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bloom
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
bridgetower
|
Check models used for common tests are small (#24824)
|
2023-07-14 14:43:19 -04:00 |
|
byt5
|
|
|
|
camembert
|
|
|
|
canine
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
chinese_clip
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
clap
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
clip
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
clipseg
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
codegen
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
conditional_detr
|
Check models used for common tests are small (#24824)
|
2023-07-14 14:43:19 -04:00 |
|
convbert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
convnext
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
convnextv2
|
|
|
|
cpm
|
|
|
|
cpmant
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
ctrl
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
cvt
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
data2vec
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
deberta
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
deberta_v2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
decision_transformer
|
|
|
|
deformable_detr
|
Fix some bugs for two stage training of deformable detr (#25045)
|
2023-08-02 11:30:36 +01:00 |
|
deit
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
deta
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
detr
|
Check models used for common tests are small (#24824)
|
2023-07-14 14:43:19 -04:00 |
|
dinat
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
dinov2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
distilbert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
dit
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
donut
|
|
|
|
dpr
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
dpt
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
efficientformer
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
efficientnet
|
Update InstructBLIP & Align values after rescale update (#25209)
|
2023-08-03 11:01:10 +01:00 |
|
electra
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
encodec
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
encoder_decoder
|
|
|
|
ernie
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
ernie_m
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
esm
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
falcon
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
flaubert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
flava
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
fnet
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
focalnet
|
|
|
|
fsmt
|
|
|
|
funnel
|
Big TF test cleanup (#24282)
|
2023-06-16 15:40:49 +01:00 |
|
git
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
glpn
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
gpt2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gpt_bigcode
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gpt_neo
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gpt_neox
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gpt_neox_japanese
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gpt_sw3
|
|
|
|
gptj
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
gptsan_japanese
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
graphormer
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
groupvit
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
herbert
|
|
|
|
hubert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
ibert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
imagegpt
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
informer
|
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420)
|
2023-06-22 16:11:27 +02:00 |
|
instructblip
|
Update InstructBLIP & Align values after rescale update (#25209)
|
2023-08-03 11:01:10 +01:00 |
|
jukebox
|
Set TF32 flag for PyTorch cuDNN backend (#25075)
|
2023-07-25 08:04:48 -04:00 |
|
layoutlm
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
layoutlmv2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
layoutlmv3
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
layoutxlm
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
led
|
Speed up TF tests by reducing hidden layer counts (#24595)
|
2023-06-30 16:30:33 +01:00 |
|
levit
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
lilt
|
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420)
|
2023-06-22 16:11:27 +02:00 |
|
llama
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
longformer
|
Fix more offload edge cases (#25342)
|
2023-08-07 17:45:41 +02:00 |
|
longt5
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
luke
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
lxmert
|
Big TF test cleanup (#24282)
|
2023-06-16 15:40:49 +01:00 |
|
m2m_100
|
|
|
|
marian
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
markuplm
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mask2former
|
Add mask2former fp16 support (#25093)
|
2023-08-07 20:07:29 +01:00 |
|
maskformer
|
Add mask2former fp16 support (#25093)
|
2023-08-07 20:07:29 +01:00 |
|
mbart
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mbart50
|
|
|
|
mega
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
megatron_bert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
megatron_gpt2
|
|
|
|
mgp_str
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mluke
|
|
|
|
mobilebert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mobilenet_v1
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
mobilenet_v2
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
mobilevit
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
mobilevitv2
|
Make more test models smaller (#25005)
|
2023-07-24 10:08:47 -04:00 |
|
mpnet
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mpt
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mra
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
mt5
|
|
|
|
musicgen
|
[MusicGen] Fix integration tests (#25169)
|
2023-07-28 18:50:15 +01:00 |
|
mvp
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
nat
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
nezha
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
nllb
|
|
|
|
nllb_moe
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
nystromformer
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
oneformer
|
Use small config for OneFormerModelTest.test_model_with_labels (#25383)
|
2023-08-08 17:15:34 +02:00 |
|
openai
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
opt
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
owlvit
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
pegasus
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
pegasus_x
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
perceiver
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
phobert
|
|
|
|
pix2struct
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
plbart
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
poolformer
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
prophetnet
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
pvt
|
Update tiny model info. and pipeline testing (#25213)
|
2023-07-31 19:35:33 +02:00 |
|
qdqbert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
rag
|
Big TF test cleanup (#24282)
|
2023-06-16 15:40:49 +01:00 |
|
realm
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
reformer
|
|
|
|
regnet
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
rembert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
resnet
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
roberta
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
roberta_prelayernorm
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
roc_bert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
roformer
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
rwkv
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
sam
|
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420)
|
2023-06-22 16:11:27 +02:00 |
|
segformer
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
sew
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
sew_d
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
speech_encoder_decoder
|
|
|
|
speech_to_text
|
Update some torchscript tests after #24505 (#24566)
|
2023-06-29 16:05:24 +02:00 |
|
speech_to_text_2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
speecht5
|
add generate method to SpeechT5ForTextToSpeech (#25233)
|
2023-08-03 14:12:07 +01:00 |
|
splinter
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
squeezebert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
swiftformer
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
swin
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
swin2sr
|
|
|
|
swinv2
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
switch_transformers
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
t5
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
table_transformer
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
tapas
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
time_series_transformer
|
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420)
|
2023-06-22 16:11:27 +02:00 |
|
timesformer
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
timm_backbone
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
transfo_xl
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
trocr
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
tvlt
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
umt5
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
unispeech
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
unispeech_sat
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
upernet
|
Fix last models for common tests that are too big. (#25058)
|
2023-07-25 07:56:04 -04:00 |
|
videomae
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
vilt
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
vision_encoder_decoder
|
Update old existing feature extractor references (#24552)
|
2023-06-29 10:17:36 +01:00 |
|
vision_text_dual_encoder
|
Fix VisionTextDualEncoderIntegrationTest (#24661)
|
2023-07-05 13:44:30 +02:00 |
|
visual_bert
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
vit
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
vit_hybrid
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
vit_mae
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
vit_msn
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
vivit
|
Update InstructBLIP & Align values after rescale update (#25209)
|
2023-08-03 11:01:10 +01:00 |
|
wav2vec2
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
wav2vec2_conformer
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
wav2vec2_phoneme
|
|
|
|
wav2vec2_with_lm
|
|
|
|
wavlm
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
whisper
|
Update some torchscript tests after #24505 (#24566)
|
2023-06-29 16:05:24 +02:00 |
|
x_clip
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
xglm
|
Fix test_model_parallelism (#25359)
|
2023-08-08 10:48:45 +02:00 |
|
xlm
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
xlm_prophetnet
|
|
|
|
xlm_roberta
|
|
|
|
xlm_roberta_xl
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
xlnet
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
xmod
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
yolos
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
yoso
|
CI with num_hidden_layers=2 🚀🚀🚀 (#25266)
|
2023-08-02 20:22:36 +02:00 |
|
__init__.py
|
|
|