mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
update with #s of sentences/tokens (#6546)
This commit is contained in:
parent
63144701ed
commit
fb7330b30e
1 changed files with 2 additions and 0 deletions
|
|
@ -15,6 +15,8 @@ tags:
|
|||
* Newscrawl 300k portion of the [Leipzig Corpora](https://wortschatz.uni-leipzig.de/en/download/irish)
|
||||
* Private news corpus crawled with [Corpus Crawler](https://github.com/google/corpuscrawler)
|
||||
|
||||
(2125804 sentences, 47419062 tokens, as reckoned by wc)
|
||||
|
||||
```
|
||||
from transformers import pipeline
|
||||
fill_mask = pipeline("fill-mask", model="jimregan/BERTreach", tokenizer="jimregan/BERTreach")
|
||||
|
|
|
|||
Loading…
Reference in a new issue