mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* hidden layers, huh, what are they good for (absolutely nothing) * Some tests break with 1 hidden layer, use 2 * Use 1 hidden layer in a few slow models * Use num_hidden_layers=2 everywhere * Slightly higher tol for groupvit * Slightly higher tol for groupvit |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_modeling_data2vec_audio.py | ||
| test_modeling_data2vec_text.py | ||
| test_modeling_data2vec_vision.py | ||
| test_modeling_tf_data2vec_vision.py | ||