mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* refactor LlamaAttention * minimal changes * fix llama * update * modular gemmas * modular nits * modular updates * nits * simplify * gpt2 * more modualr and fixes * granite * modular modular modular * nits * update * qwen2 + starcoder2 * mostly gemma2 * Update image_processing_auto.py * fix * Update modular_starcoder2.py * fix * remove all copied from attentions * remove gcv * make fix-copies * oups * oups2.0 * fix some modulars + all copied from * should be good now * revert unwanted changes * Update modeling_decision_transformer.py * finish cleanup * Update modeling_olmo.py * consistency * re-add gradient checkpointing attribute * fix * style * make config necessary * bis * bis * Update modeling_my_new_model2.py * is_causal attr * fix * remove past kv return from decoder layer * fix * default rope config * correctly fix rope config * fix bias * fix gpt2 attention output * fix test * fix inits * fix default sdpa * fix default sdpa implementation * harmonize classes * fix mistral * fix sliding window models * mixtral * be more explicit * style * fix * several fixes * Update modeling_dbrx.py * fix test * olmo + phi * rotary * syle * phi * phi again * again * kwargs * Update test_modeling_common.py * skip fx tracing tests * Update modeling_utils.py * gemma 2 * again * Update modeling_recurrent_gemma.py * gemma2 * granite * style * starcoder * Update sdpa_attention.py * switch args * Update modeling_mllama.py * fix * cache type tests * gpt2 * Update test_modeling_common.py * fix * consistency * fix shape with encoder * should be the last one * tests non model * most comments * small oupsi * be more explicit in modulars * more explicit modulars * CIs! it works locally * add kwargs to _flash_attention_forward --------- Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> |
||
|---|---|---|
| .. | ||
| test_module | ||
| tf_ops | ||
| add_pipeline_model_mapping_to_test.py | ||
| check_bad_commit.py | ||
| check_build.py | ||
| check_config_attributes.py | ||
| check_config_docstrings.py | ||
| check_copies.py | ||
| check_doc_toc.py | ||
| check_docstrings.py | ||
| check_doctest_list.py | ||
| check_dummies.py | ||
| check_inits.py | ||
| check_model_tester.py | ||
| check_modular_conversion.py | ||
| check_repo.py | ||
| check_self_hosted_runner.py | ||
| check_support_list.py | ||
| check_table.py | ||
| check_tf_ops.py | ||
| create_dependency_mapping.py | ||
| create_dummy_models.py | ||
| custom_init_isort.py | ||
| deprecate_models.py | ||
| download_glue_data.py | ||
| extract_warnings.py | ||
| get_ci_error_statistics.py | ||
| get_github_job_time.py | ||
| get_modified_files.py | ||
| get_previous_daily_ci.py | ||
| get_test_info.py | ||
| important_models.txt | ||
| models_to_deprecate.py | ||
| modular_model_converter.py | ||
| not_doctested.txt | ||
| notification_service.py | ||
| notification_service_doc_tests.py | ||
| notification_service_quantization.py | ||
| past_ci_versions.py | ||
| patch_helper.py | ||
| pr_slow_ci_models.py | ||
| print_env.py | ||
| process_bad_commit_report.py | ||
| process_circleci_workflow_test_reports.py | ||
| process_test_artifacts.py | ||
| release.py | ||
| set_cuda_devices_for_ci.py | ||
| slow_documentation_tests.txt | ||
| sort_auto_mappings.py | ||
| split_doctest_jobs.py | ||
| split_model_tests.py | ||
| tests_fetcher.py | ||
| update_metadata.py | ||
| update_tiny_models.py | ||