mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-15 21:01:19 +00:00
* REALM initial commit * Retriever OK (Update new_gelu). * Encoder prediction score OK * Encoder pretrained model OK * Update retriever comments * Update docs, tests, and imports * Prune unused models * Make embedder as a module `RealmEmbedder` * Add RealmRetrieverOutput * Update tokenization * Pass all tests in test_modeling_realm.py * Prune RealmModel * Update docs * Add training test. * Remove completed TODO * Style & Quality * Prune `RealmModel` * Fixup * Changes: 1. Remove RealmTokenizerFast 2. Update docstrings 3. Add a method to RealmTokenizer to handle candidates tokenization. * Fix up * Style * Add tokenization tests * Update `from_pretrained` tests * Apply suggestions * Style & Quality * Copy BERT model * Fix comment to avoid docstring copying * Make RealmBertModel private * Fix bug * Style * Basic QA * Save * Complete reader logits * Add searcher * Complete searcher & reader * Move block records init to constructor * Fix training bug * Add some outputs to RealmReader * Add finetuned checkpoint variable names parsing * Fix bug * Update REALM config * Add RealmForOpenQA * Update convert_tfrecord logits * Fix bugs * Complete imports * Update docs * Update naming * Add brute-force searcher * Pass realm model tests * Style * Exclude RealmReader from common tests * Fix * Fix * convert docs * up * up * more make style * up * upload * up * Fix * Update src/transformers/__init__.py * adapt testing * change modeling code * fix test * up * up * up * correct more * make retriever work * update * make style * finish main structure * Resolve merge conflict * Make everything work * Style * Fixup * Fixup * Update training test * fix retriever * remove hardcoded path * Fix * Fix modeling test * Update model links * Initial retrieval test * Fix modeling test * Complete retrieval tests * Fix * style * Fix tests * Fix docstring example * Minor fix of retrieval test * Update license headers and docs * Apply suggestions from code review * Style * Apply suggestions from code review * Add an example to RealmEmbedder * Fix Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| internal | ||
| main_classes | ||
| model_doc | ||
| _config.py | ||
| _toctree.yml | ||
| add_new_model.mdx | ||
| add_new_pipeline.mdx | ||
| benchmarks.mdx | ||
| bertology.mdx | ||
| community.mdx | ||
| contributing.md | ||
| converting_tensorflow_models.mdx | ||
| custom_datasets.mdx | ||
| debugging.mdx | ||
| examples.md | ||
| fast_tokenizers.mdx | ||
| glossary.mdx | ||
| index.mdx | ||
| installation.mdx | ||
| migration.mdx | ||
| model_sharing.mdx | ||
| model_summary.mdx | ||
| multilingual.mdx | ||
| notebooks.md | ||
| parallelism.mdx | ||
| performance.mdx | ||
| perplexity.mdx | ||
| philosophy.mdx | ||
| pr_checks.mdx | ||
| preprocessing.mdx | ||
| quicktour.mdx | ||
| sagemaker.mdx | ||
| serialization.mdx | ||
| task_summary.mdx | ||
| testing.mdx | ||
| tokenizer_summary.mdx | ||
| training.mdx | ||
| troubleshooting.mdx | ||