Stas Bekman
443b0cad96
rename the function to match the rest of the test convention ( #5692 )
2020-07-13 18:09:49 +08:00
onepointconsulting
74843695eb
Added first description of the model ( #5672 )
...
Added general description, information about the tags and also some example usage code.
2020-07-13 02:53:48 -04:00
Kevin Canwen Xu
0befb51327
Pipeline model type check ( #5679 )
...
* Add model type check for pipelines
* Add model type check for pipelines
* rename func
* Fix the init parameters
* Fix format
* rollback unnecessary refactor
2020-07-12 12:34:21 +08:00
Kevin Canwen Xu
dc31a72f50
Add Microsoft's CodeBERT ( #5683 )
...
* Add Microsoft's CodeBERT
* link style
* single modal
* unused import
2020-07-11 21:37:30 +08:00
Sylvain Gugger
7fad617dc1
Document model outputs ( #5673 )
...
* Document model outputs
* Update docs/source/main_classes/output.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-07-10 17:31:02 -04:00
Sylvain Gugger
df983b7483
Deprecate old past arguments ( #5671 )
2020-07-10 17:25:52 -04:00
Tomo Lazovich
cdf4cd7068
[squad] add version tag to squad cache ( #5669 )
2020-07-10 16:34:21 -04:00
Patrick von Platen
223084e42b
Add Reformer to notebooks
2020-07-10 18:34:25 +02:00
Julien Chaumond
201d23f285
Update The Big Table of Tasks
...
Co-Authored-By: Suraj Patil <surajp815@gmail.com>
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-07-10 18:07:29 +02:00
Bashar Talafha
82f7bbbd93
Update README.md ( #5617 )
...
* Update README.md
* Update README.md
2020-07-10 11:43:27 -04:00
Manuel Romero
bf497376ee
Create README.md ( #5572 )
2020-07-10 11:42:49 -04:00
kolk
3653d01f2a
Create README.md for electra-base-squad2 ( #5574 )
2020-07-10 11:39:44 -04:00
Txus
aa69c81f29
Add freshly trained `base` version ( #5621 )
2020-07-10 11:39:04 -04:00
Teven
227e0a406d
Fixed use of memories in XLNet (caching for language generation + warning when loading improper memoryless model) ( #5632 )
...
* Pytorch gpu => cpu proper device
* Memoryless XLNet warning + fixed memories during generation
* Revert "Pytorch gpu => cpu proper device"
This reverts commit 93489b36
* made black happy
* TF generation with memories
* dim => axis
* added padding_text to TF XL models
* Added comment, added TF
2020-07-10 17:38:36 +02:00
Manuel Romero
3b7b646563
Create README.md ( #5638 )
2020-07-10 11:38:23 -04:00
Manuel Romero
0039b965db
Create model card ( #5655 )
...
Create model card for T5-small fine-tuned on SQUAD v2
2020-07-10 11:38:11 -04:00
Nils Reimers
46982d612f
Create README.md - Model card ( #5657 )
...
Model card for sentence-transformers/bert-base-nli-cls-token
2020-07-10 11:38:03 -04:00
Nils Reimers
c483803d1b
Create README.md - Model card ( #5658 )
...
Model card for sentence-transformers/bert-base-nli-max-tokens
2020-07-10 11:37:56 -04:00
Sylvain Gugger
edfd82f5ff
Change model outputs types to self-document outputs ( #5438 )
...
* [WIP] Proposal for model outputs
* All Bert models
* Make CI green maybe?
* Fix ONNX test
* Isolate ModelOutput from pt and tf
* Formatting
* Add Electra models
* Auto-generate docstrings from outputs
* Add TF outputs
* Add some BERT models
* Revert TF side
* Remove last traces of TF changes
* Fail with a clear error message
* Add Albert and work through Bart
* Add CTRL and DistilBert
* Formatting
* Progress on Bart
* Renames and finish Bart
* Formatting
* Fix last test
* Add DPR
* Finish Electra and add FlauBERT
* Add GPT2
* Add Longformer
* Add MMBT
* Add MobileBert
* Add GPT
* Formatting
* Add Reformer
* Add Roberta
* Add T5
* Add Transformer XL
* Fix test
* Add XLM + fix XLMForTokenClassification
* Style + XLMRoberta
* Add XLNet
* Formatting
* Add doc of return_tuple arg
2020-07-10 11:36:53 -04:00
Suraj Parmar
fa265230a2
Create Model card for RoBERTa-hindi-guj-san ( #5661 )
2020-07-10 11:34:23 -04:00
Sylvain Gugger
b2747af543
Improvements to PretrainedConfig documentation ( #5642 )
...
* Update PretrainedConfig doc
* Formatting
* Small fixes
* Forgotten args and more cleanup
2020-07-10 10:31:47 -04:00
Julien Chaumond
bfacb2e34f
[model_card] BART for ELI5
...
cc @yjernite
2020-07-10 08:10:24 -04:00
Nils Reimers
2e6bb0e9c3
Create README.md ( #5652 )
2020-07-10 05:41:10 -04:00
Julien Chaumond
552e4591f5
[model_card] Add meta + fix link to image
...
(hotlinking to image works on GitHub but not on external sites)
cc @bashartalafha
2020-07-10 05:07:33 -04:00
Teven
02a0b43014
Fixed TextGenerationPipeline on torch + GPU ( #5629 )
...
* Pytorch gpu => cpu proper device
* Memoryless XLNet warning + fixed memories during generation
* Revert "Memoryless XLNet warning + fixed memories during generation"
This reverts commit 3d3251ff
* Took the operations on the generated_sequence out of the ensure_device scope
2020-07-09 16:29:32 -04:00
Sylvain Gugger
760f726e51
Add forum link in the docs ( #5637 )
2020-07-09 15:13:22 -04:00
Stas Bekman
bfeaae2235
fix 404 ( #5616 )
2020-07-09 15:12:29 -04:00
Lysandre Debut
b25f7802de
Should check that torch TPU is available ( #5636 )
2020-07-09 13:54:32 -04:00
Lysandre Debut
3cc23eee06
More explicit error when failing to tensorize overflowing tokens ( #5633 )
2020-07-09 13:35:21 -04:00
Lysandre
b9d8af07e6
Update stable doc
2020-07-09 11:06:23 -04:00
Lysandre Debut
1158e56551
Correct extension ( #5631 )
2020-07-09 11:03:07 -04:00
Lysandre
5c82bf6831
Update stable doc
2020-07-09 10:16:13 -04:00
Lysandre Debut
0533cf4706
Test XLA examples ( #5583 )
...
* Test XLA examples
* Style
* Using `require_torch_tpu`
* Style
* No need for pytest
2020-07-09 09:19:19 -04:00
Funtowicz Morgan
3bd55199cd
QA pipeline BART compatible ( #5496 )
...
* Ensure padding and question cannot have higher probs than context.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Add bart the the list of tokenizers adding two <sep> tokens for squad_convert_example_to_feature
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Format.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Addressing @patrickvonplaten comments.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Addressing @patrickvonplaten comments about masking non-context element when generating the answer.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Addressing @sshleifer comments.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Make sure we mask CLS after handling impossible answers
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Mask in the correct vectors ...
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
2020-07-09 15:11:40 +02:00
Stas Bekman
fa5423b169
doc fixes ( #5613 )
2020-07-08 19:52:44 -04:00
Txus
7d0ef00420
Add newly trained `calbert-tiny-uncased` ( #5599 )
...
* Create README.md
Add newly trained `calbert-tiny-uncased` (complete rewrite with SentencePiece)
* Add Exbert link
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-08 17:54:51 -04:00
Lorenzo Ampil
0cc4eae0e6
Fix Inconsistent NER Grouping (Pipeline) ( #4987 )
...
* Add B I handling to grouping
* Add fix to include separate entity as last token
* move last_idx definition outside loop
* Use first entity in entity group as reference for entity type
* Add test cases
* Take out extra class accidentally added
* Return tf ner grouped test to original
* Take out redundant last entity
* Get last_idx safely
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>
* Fix first entity comment
* Create separate functions for group_sub_entities and group_entities (splitting call method to testable functions)
* Take out unnecessary last_idx
* Remove additional forward pass test
* Move token classification basic tests to separate class
* Move token classification basic tests back to monocolumninputtestcase
* Move base ner tests to nerpipelinetests
* Take out unused kwargs
* Add back mandatory_keys argument
* Add unitary tests for group_entities in _test_ner_pipeline
* Fix last entity handling
* Fix grouping fucntion used
* Add typing to group_sub_entities and group_entities
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>
2020-07-08 16:18:17 -04:00
Suraj Patil
82ce8488bb
create model cards for qg models ( #5610 )
2020-07-08 16:08:56 -04:00
Bashar Talafha
d6b6ab11f0
Create README.md ( #5601 )
2020-07-08 16:07:48 -04:00
Patrick von Platen
40d98ebf50
Update benchmark notebook ( #5603 )
...
* Créé avec Colaboratory
* delete old file
2020-07-08 16:03:59 +02:00
Sylvain Gugger
281e394889
Update question template ( #5585 )
2020-07-08 08:46:35 -04:00
Patrick von Platen
f82a2a5e8e
[Benchmark] Add benchmarks for TF Training ( #5594 )
...
* tf_train
* adapt timing for tpu
* fix timing
* fix timing
* fix timing
* fix timing
* update notebook
* add tests
2020-07-08 12:11:09 +02:00
Ji Xin
cfbb982974
Add DeeBERT (entropy-based early exiting for *BERT) ( #5477 )
...
* Add deebert code
* Add readme of deebert
* Add test for deebert
Update test for Deebert
* Update DeeBert (README, class names, function refactoring); remove requirements.txt
* Format update
* Update test
* Update readme and model init methods
2020-07-08 08:17:59 +08:00
Joe Davison
b4b33fdf25
Guide to fixed-length model perplexity evaluation ( #5449 )
...
* add first draft ppl guide
* upload imgs
* expand on strides
* ref typo
* rm superfluous past var
* add tokenization disclaimer
2020-07-07 16:04:15 -06:00
Patrick von Platen
fde217c679
readme for benchmark ( #5363 )
2020-07-07 23:21:23 +02:00
Sam Shleifer
d6eab53058
mbart.prepare_translation_batch: pass through kwargs ( #5581 )
2020-07-07 13:46:05 -04:00
Sam Shleifer
353b8f1e7a
Add mbart-large-cc25, support translation finetuning ( #5129 )
...
improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg
2020-07-07 13:23:01 -04:00
Julien Chaumond
141492448b
Create xlm-roberta-large-finetuned-conll03-german-README.md
...
cc @BramVanroy
2020-07-07 13:15:10 -04:00
Patrick von Platen
4dc65591b5
[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming and keras compile ( #5395 )
...
* add first version of clm tf
* make style
* add more tests for bert
* update tf clm loss
* fix tests
* correct tf ner script
* add mlm loss
* delete bogus file
* clean tf auto model + add tests
* finish adding clm loss everywhere
* fix training in distilbert
* fix flake8
* save intermediate
* fix tf t5 naming
* remove prints
* finish up
* up
* fix tf gpt2
* fix new test utils import
* fix flake8
* keep backward compatibility
* Update src/transformers/modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_roberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_mobilebert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_tf_distilbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* apply sylvains suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-07-07 18:15:53 +02:00
Suraj Patil
33e43edddc
[docs] fix model_doc links in model summary ( #5566 )
...
* fix model_doc links
* update model links
2020-07-07 11:06:12 -04:00