huggingface-transformers

Граф коммитов

Автор	SHA1	Сообщение	Дата
Stas Bekman	bfeaae2235	fix 404 (#5616 )	2020-07-09 15:12:29 -04:00
Lysandre Debut	b25f7802de	Should check that torch TPU is available (#5636 )	2020-07-09 13:54:32 -04:00
Lysandre Debut	3cc23eee06	More explicit error when failing to tensorize overflowing tokens (#5633 )	2020-07-09 13:35:21 -04:00
Lysandre	b9d8af07e6	Update stable doc	2020-07-09 11:06:23 -04:00
Lysandre Debut	1158e56551	Correct extension (#5631 )	2020-07-09 11:03:07 -04:00
Lysandre	5c82bf6831	Update stable doc	2020-07-09 10:16:13 -04:00
Lysandre Debut	0533cf4706	Test XLA examples (#5583 ) * Test XLA examples * Style * Using `require_torch_tpu` * Style * No need for pytest	2020-07-09 09:19:19 -04:00
Funtowicz Morgan	3bd55199cd	QA pipeline BART compatible (#5496 ) * Ensure padding and question cannot have higher probs than context. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Add bart the the list of tokenizers adding two <sep> tokens for squad_convert_example_to_feature Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Format. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @patrickvonplaten comments. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @patrickvonplaten comments about masking non-context element when generating the answer. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @sshleifer comments. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make sure we mask CLS after handling impossible answers Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Mask in the correct vectors ... Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-07-09 15:11:40 +02:00
Stas Bekman	fa5423b169	doc fixes (#5613 )	2020-07-08 19:52:44 -04:00
Txus	7d0ef00420	Add newly trained `calbert-tiny-uncased` (#5599 ) * Create README.md Add newly trained `calbert-tiny-uncased` (complete rewrite with SentencePiece) * Add Exbert link * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-08 17:54:51 -04:00
Lorenzo Ampil	0cc4eae0e6	Fix Inconsistent NER Grouping (Pipeline) (#4987 ) * Add B I handling to grouping * Add fix to include separate entity as last token * move last_idx definition outside loop * Use first entity in entity group as reference for entity type * Add test cases * Take out extra class accidentally added * Return tf ner grouped test to original * Take out redundant last entity * Get last_idx safely Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com> * Fix first entity comment * Create separate functions for group_sub_entities and group_entities (splitting call method to testable functions) * Take out unnecessary last_idx * Remove additional forward pass test * Move token classification basic tests to separate class * Move token classification basic tests back to monocolumninputtestcase * Move base ner tests to nerpipelinetests * Take out unused kwargs * Add back mandatory_keys argument * Add unitary tests for group_entities in _test_ner_pipeline * Fix last entity handling * Fix grouping fucntion used * Add typing to group_sub_entities and group_entities Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>	2020-07-08 16:18:17 -04:00
Suraj Patil	82ce8488bb	create model cards for qg models (#5610 )	2020-07-08 16:08:56 -04:00
Bashar Talafha	d6b6ab11f0	Create README.md (#5601 )	2020-07-08 16:07:48 -04:00
Patrick von Platen	40d98ebf50	Update benchmark notebook (#5603 ) * Créé avec Colaboratory * delete old file	2020-07-08 16:03:59 +02:00
Sylvain Gugger	281e394889	Update question template (#5585 )	2020-07-08 08:46:35 -04:00
Patrick von Platen	f82a2a5e8e	[Benchmark] Add benchmarks for TF Training (#5594 ) * tf_train * adapt timing for tpu * fix timing * fix timing * fix timing * fix timing * update notebook * add tests	2020-07-08 12:11:09 +02:00
Ji Xin	cfbb982974	Add DeeBERT (entropy-based early exiting for BERT) (#5477 ) Add deebert code * Add readme of deebert * Add test for deebert Update test for Deebert * Update DeeBert (README, class names, function refactoring); remove requirements.txt * Format update * Update test * Update readme and model init methods	2020-07-08 08:17:59 +08:00
Joe Davison	b4b33fdf25	Guide to fixed-length model perplexity evaluation (#5449 ) * add first draft ppl guide * upload imgs * expand on strides * ref typo * rm superfluous past var * add tokenization disclaimer	2020-07-07 16:04:15 -06:00
Patrick von Platen	fde217c679	readme for benchmark (#5363 )	2020-07-07 23:21:23 +02:00
Sam Shleifer	d6eab53058	mbart.prepare_translation_batch: pass through kwargs (#5581 )	2020-07-07 13:46:05 -04:00
Sam Shleifer	353b8f1e7a	Add mbart-large-cc25, support translation finetuning (#5129 ) improve unittests for finetuning, especially w.r.t testing frozen parameters fix freeze_embeds for T5 add streamlit setup.cfg	2020-07-07 13:23:01 -04:00
Julien Chaumond	141492448b	Create xlm-roberta-large-finetuned-conll03-german-README.md cc @BramVanroy	2020-07-07 13:15:10 -04:00
Patrick von Platen	4dc65591b5	[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming and keras compile (#5395 ) * add first version of clm tf * make style * add more tests for bert * update tf clm loss * fix tests * correct tf ner script * add mlm loss * delete bogus file * clean tf auto model + add tests * finish adding clm loss everywhere * fix training in distilbert * fix flake8 * save intermediate * fix tf t5 naming * remove prints * finish up * up * fix tf gpt2 * fix new test utils import * fix flake8 * keep backward compatibility * Update src/transformers/modeling_tf_albert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_electra.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_roberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_mobilebert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_bert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_distilbert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply sylvains suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-07 18:15:53 +02:00
Suraj Patil	33e43edddc	[docs] fix model_doc links in model summary (#5566 ) * fix model_doc links * update model links	2020-07-07 11:06:12 -04:00
Quentin Lhoest	4fedc1256c	Fix tests imports dpr (#5576 ) * fix test imports * fix max_length * style * fix tests	2020-07-07 16:35:12 +02:00
Sam Shleifer	d4886173b2	[Bart] enable test_torchscript, update test_tie_weights (#5457 ) * Passing all but one torchscript test * Style * move comment * remove unneeded assert	2020-07-07 10:06:48 -04:00
Suraj Patil	e49393c361	[examples] Add trainer support for question-answering (#4829 ) * add SquadDataset * add DataCollatorForQuestionAnswering * update __init__ * add run_squad with trainer * add DataCollatorForQuestionAnswering in __init__ * pass data_collator to trainer * doc tweak * Update run_squad_trainer.py * Update __init__.py * Update __init__.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-07 08:57:08 -04:00
Quentin Lhoest	fbd8792195	Add DPR model (#5279 ) * beginning of dpr modeling * wip * implement forward * remove biencoder + better init weights * export dpr model to embed model for nlp lib * add new api * remove old code * make style * fix dumb typo * don't load bert weights * docs * docs * style * move the `k` parameter * fix init_weights * add pretrained configs * minor * update config names * style * better config * style * clean code based on PR comments * change Dpr to DPR * fix config * switch encoder config to a dict * style * inheritance -> composition * add messages in assert startements * add dpr reader tokenizer * one tokenizer per model * fix base_model_prefix * fix imports * typo * add convert script * docs * change tokenizers conf names * style * change tokenizers conf names * minor * minor * fix wrong names * minor * remove unused convert functions * rename convert script * use return_tensors in tokenizers * remove n_questions dim * move generate logic to tokenizer * style * add docs * docs * quality * docs * add tests * style * add tokenization tests * DPR full tests * Stay true to the attention mask building * update docs * missing param in bert input docs * docs * style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-07-07 08:56:12 -04:00
Savaş Yıldırım	d2a9399115	Update model card (#5491 )	2020-07-07 18:43:49 +08:00
Savaş Yıldırım	2e653d89d7	Update model card (#5492 )	2020-07-07 18:43:34 +08:00
Savaş Yıldırım	beaf60e589	bert-turkish-text-classification model card (#5493 )	2020-07-07 18:43:09 +08:00
Manuel Romero	e6eba8419c	electra-small-finetuned-squadv1 model card (#5430 ) * Create model card Create model card for electra-small-discriminator finetuned on SQUAD v1.1 * Set right model path in code example	2020-07-07 18:41:42 +08:00
Vitalii Radchenko	43b7ad5df5	ukr-roberta-base model card (#5514 )	2020-07-07 18:40:23 +08:00
Manuel Romero	87aa857d7e	roberta-base-1B-1-finetuned-squadv1 model card (#5515 )	2020-07-07 18:39:09 +08:00
Moseli Motsoehli	c7d96b60e4	zuBERTa model card (#5536 ) * Create README * Update README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-07-07 18:38:15 +08:00
Manuel Romero	b95dfcf110	roberta-base-1B-1-finetuned-squadv2 model card (#5523 )	2020-07-07 18:33:42 +08:00
Abel	6912265711	Make T5 compatible with ONNX (#5518 ) * Default decoder inputs to encoder ones for T5 if neither are specified. * Fixing typo, now all tests are passing. * Changing einsum to operations supported by onnx * Adding a test to ensure T5 can be exported to onnx op>9 * Modified test for onnx export to make it faster * Styling changes. * Styling changes. * Changing notation for matrix multiplication Co-authored-by: Abel Riboulot <tkai@protomail.com>	2020-07-07 11:32:29 +02:00
Patrick von Platen	989ae326b5	[Reformer] Adapt Reformer MaskedLM Attn mask (#5560 ) * fix attention mask * fix slow test * refactor attn masks * fix fp16 generate test	2020-07-07 10:48:06 +02:00
Shashank Gupta	3dcb748e31	Added data collator for permutation (XLNet) language modeling and related calls (#5522 ) * Added data collator for XLNet language modeling and related calls Added DataCollatorForXLNetLanguageModeling in data/data_collator.py to generate necessary inputs for language modeling training with XLNetLMHeadModel. Also added related arguments, logic and calls in examples/language-modeling/run_language_modeling.py. Resolves: #4739, #2008 (partially) * Changed name to `DataCollatorForPermutationLanguageModeling` Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`. Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use. CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of similar to `mems` for XLNet). Changed calls and imports appropriately. * Added detailed comments, changed variable names Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative. * Added tests for new data collator Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences. * Fixed styling issues	2020-07-07 10:17:37 +02:00
Lysandre	1d2332861f	Post v3.0.2 release commit	2020-07-06 18:56:47 -04:00
Lysandre	b0892fa0e8	Release: v3.0.2	2020-07-06 18:49:44 -04:00
Sylvain Gugger	f1e2e423ab	Fix fast tokenizers too (#5562 )	2020-07-06 18:45:01 -04:00
Anthony MOI	5787e4c159	Various tokenizers fixes (#5558 ) * BertTokenizerFast - Do not specify strip_accents by default * Bump tokenizers to new version * Add test for AddedToken serialization	2020-07-06 18:27:53 -04:00
Sylvain Gugger	21f28c34b7	Fix #5507 (#5559 ) * Fix #5507 * Fix formatting	2020-07-06 17:26:48 -04:00
Lysandre Debut	9d9b872b66	The `add_space_before_punct_symbol` is only for TransfoXL (#5549 )	2020-07-06 12:17:05 -04:00
Lysandre Debut	d6b0b9d451	GPT2 tokenizer should not output token type IDs (#5546 ) * GPT2 tokenizer should not output token type IDs * Same for OpenAIGPT	2020-07-06 11:33:57 -04:00
Sylvain Gugger	7833b21a5a	Fix #5544 (#5551 )	2020-07-06 11:22:24 -04:00
Thomas Wolf	c473484087	Fix the tokenization warning noted in #5505 (#5550 ) * fix warning * style and quality	2020-07-06 11:15:25 -04:00
Lysandre	1bbc28bee7	Imports organization	2020-07-06 10:27:10 -04:00
Mohamed Taher Alrefaie	1bc13697b1	Update convert_pytorch_checkpoint_to_tf2.py (#5531 ) fixed ImportError: cannot import name 'hf_bucket_url'	2020-07-06 09:55:10 -04:00

1 2 3 4 5 ...

4475 Коммитов Все ветки Поиск

4475 Коммитов

Все ветки