huggingface-transformers

Граф коммитов

Автор	SHA1	Сообщение	Дата
Adriano Diniz	3363a19b12	Create README.md (#5152 ) * Create README.md * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-22 17:59:33 -04:00
Michaël Benesty	0cca61925c	Add link to new comunity notebook (optimization) (#5195 ) * Add link to new comunity notebook (optimization) related to https://github.com/huggingface/transformers/issues/4842#event-3469184635 This notebook is about benchmarking model training with/without dynamic padding optimization. https://github.com/ELS-RD/transformers-notebook Using dynamic padding on MNLI provides a 4.7 times training time reduction, with max pad length set to 512. The effect is strong because few examples are >> 400 tokens in this dataset. IRL, it will depend of the dataset, but it always bring improvement and, after more than 20 experiments listed in this [article](https://towardsdatascience.com/divide-hugging-face-transformers-training-time-by-2-or-more-21bf7129db9q-21bf7129db9e?source=friends_link&sk=10a45a0ace94b3255643d81b6475f409), it seems to not hurt performance. Following advice from @patrickvonplaten I do the PR myself :-) * Update notebooks/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-22 23:47:33 +02:00
Lee Haau-Sing	1c5cd8e5f5	Add README.md (nyu-mll) (#5174 ) * nyu-mll: roberta on smaller datasets * Update README.md * Update README.md Co-authored-by: Alex Warstadt <alexwarstadt@gmail.com>	2020-06-22 17:24:27 -04:00
Sylvain Gugger	c439752482	Switch master/stable doc and add older releases (#5193 )	2020-06-22 16:38:53 -04:00
Sylvain Gugger	417e492f1e	Quick tour (#5145 ) * Quicktour part 1 * Update * All done * Typos Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address comments in quick tour * Update docs/source/quicktour.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update from feedback Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-22 16:08:09 -04:00
Thomas Wolf	75e1eed8d1	Cleaner warning when loading pretrained models (#4557 ) * Cleaner warning when loading pretrained models This make more explicit logging messages when using the various `from_pretrained` methods. It also make these messages as `logging.warning` because it's a common source of silent mistakes. * Update src/transformers/modeling_utils.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Update src/transformers/modeling_utils.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * style and quality Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-22 21:58:47 +02:00
Lysandre Debut	4e741efa92	Have documentation fail on warning (#5189 ) * Have documentation fail on warning * Force ci failure * Revert "Force ci failure" This reverts commit f0a4666ec2eb4cd00a4da48af3357defc63324a0.	2020-06-22 15:49:50 -04:00
Sylvain Gugger	1262495a91	Add TF auto model to the docs + fix sphinx warnings (#5187 )	2020-06-22 14:43:52 -04:00
Adriano Diniz	88429c57bc	Create README.md (#5165 )	2020-06-22 13:49:14 -04:00
Manuel Romero	76ee9c8bc9	Create README.md (#5107 ) * Create README.md @julien-c check out that dataset meta tag is right * Fix typo Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-22 13:47:30 -04:00
Manuel Romero	bf493d5569	Model card for t5-base-finetuned-emotion (recognition) (#5179 )	2020-06-22 13:45:45 -04:00
Patrick von Platen	e9ef21175e	improve doc (#5185 )	2020-06-22 19:00:11 +02:00
Thomas Wolf	ebc36108dc	[tokenizers] Fix #5081 and improve backward compatibility (#5125 ) * fix #5081 and improve backward compatibility (slightly) * add nlp to setup.cfg - style and quality * align default to previous default * remove test that doesn't generalize	2020-06-22 17:25:43 +02:00
Malte	d2a7c86dc3	Check if `text` is set to avoid IndexError (#4209 ) Fix for https://github.com/huggingface/transformers/issues/3809	2020-06-22 11:09:05 -04:00
Iz Beltagy	90f4b24520	Add support for gradient checkpointing in BERT (#4659 ) * add support for gradient checkpointing in BERT * fix unit tests * isort * black * workaround for `torch.utils.checkpoint.checkpoint` not accepting bool * Revert "workaround for `torch.utils.checkpoint.checkpoint` not accepting bool" This reverts commit 5eb68bb804f5ffbfc7ba13c45a47717f72d04574. * workaround for `torch.utils.checkpoint.checkpoint` not accepting bool Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-22 10:47:14 -04:00
Joseph Liu	f4e1f02210	Output hidden states (#4978 ) * Configure all models to use output_hidden_states as argument passed to foward() * Pass all tests * Remove cast_bool_to_primitive in TF Flaubert model * correct tf xlnet * add pytorch test * add tf test * Fix broken tests * Configure all models to use output_hidden_states as argument passed to foward() * Pass all tests * Remove cast_bool_to_primitive in TF Flaubert model * correct tf xlnet * add pytorch test * add tf test * Fix broken tests * Refactor output_hidden_states for mobilebert * Reset and remerge to master Co-authored-by: Joseph Liu <joseph.liu@coinflex.com> Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2020-06-22 10:10:45 -04:00
Kevin Canwen Xu	866a8ccabb	Add model cards for Microsoft's MiniLM (#5178 ) * Add model cards for Microsoft's MiniLM * XLMRobertaTokenizer * format * Add thumbnail * finishing up	2020-06-22 21:48:14 +08:00
RafaelWO	b99ad457f4	Added feature to move added tokens in vocabulary for Transformer-XL (#4953 ) * Fixed resize_token_embeddings for transfo_xl model * Fixed resize_token_embeddings for transfo_xl. Added custom methods to TransfoXLPreTrainedModel for resizing layers of the AdaptiveEmbedding. * Updated docstring * Fixed resizinhg cutoffs; added check for new size of embedding layer. * Added test for resize_token_embeddings * Fixed code quality * Fixed unchanged cutoffs in model.config * Added feature to move added tokens in tokenizer. * Fixed code quality * Added feature to move added tokens in tokenizer. * Fixed code quality * Fixed docstring, renamed sym to oken. Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>	2020-06-22 15:40:52 +02:00
Sylvain Gugger	eb0ca71ef6	Update glossary (#5148 ) * Update glossary * Update docs/source/glossary.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-22 08:30:49 -04:00
Patrick von Platen	fa0be6d761	Benchmarks (#4912 ) * finish benchmark * fix isort * fix setup cfg * retab * fix time measuring of tf graph mode * fix tf cuda * clean code * better error message	2020-06-22 12:06:56 +02:00
Zihao Fu	18a0150bfa	fix bart doc (#5132 ) fix bart doc	2020-06-22 10:58:28 +02:00
Mikael Souza	3fe75c7f70	Fixing docs for Encoder Decoder Config (#5171 )	2020-06-22 10:51:17 +02:00
flozi00	59345cc87f	Typo (#5147 )	2020-06-22 10:49:23 +02:00
Ilya Boytsov	bc3a0c0607	[examples] fixes arguments for summarization finetune scripts (#5157 ) Authored-by: i.boytsov <i.boytsov@MAC867.local>	2020-06-21 11:51:21 -04:00
Tim Suchanek	68e19f1c22	Fix typo in root README (#5073 )	2020-06-20 23:00:04 +08:00
Kevin Canwen Xu	c0c577cf8f	Fix PABEE's result table (#5158 )	2020-06-20 22:56:39 +08:00
Julien Chaumond	aa6a29bc25	SummarizationPipeline: init required task name (#5086 ) * SummarizationPipeline: init required task name * Update src/transformers/pipelines.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Apply suggestions from code review Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-06-20 03:16:30 -04:00
Kevin Canwen Xu	2fd28d4363	Add BERT Loses Patience (Patience-based Early Exit) (#5078 ) * Add BERT Loses Patience (Patience-based Early Exit) * update model archive * update format * sort import * flake8 * Add results * full results * align the table * refactor to inherit * default per gpu eval = 1 * Formatting * Formatting * isort * modify readme * Add check * Fix format * Fix format * Doc strings * ALBERT & BERT for sequence classification don't inherit from the original anymore * Remove incorrect comments * Remove incorrect comments * Remove incorrect comments * Sync up with new code * Sync up with new code * Add a test * Add a test * Add a test * Add a test * Add a test * Add a test * Finishing up!	2020-06-20 13:41:46 +08:00
Zhu Baohe	f1679d7c48	Fix dropout in TFMobileBert (#5150 )	2020-06-20 13:21:19 +08:00
Kevin Canwen Xu	5ed94b2312	Update note to avoid confusion (#5131 )	2020-06-20 10:13:34 +08:00
Lysandre	d97b4176e5	Correct device assignment	2020-06-19 21:58:28 -04:00
Vasily Shamporov	9a3f91088c	Add MobileBert (#4901 ) * Add MobileBert * Quality + Conversion script * style * Update src/transformers/modeling_mobilebert.py * Links to S3 * Style * TFMobileBert Slight fixes to the pytorch MobileBert Style * MobileBertForMaskedLM (PT + TF) * MobileBertForNextSentencePrediction (PT + TF) * MobileFor{MultipleChoice, TokenClassification} (PT + TF) ss * Tests + Auto * Doc * Tests * Addressing @sgugger's comments * Adressing @patrickvonplaten's comments * Style * Style * Integration test * style * Model card Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-19 16:38:36 -04:00
Sam Shleifer	f45e873910	[bart-mnli] Fix class flipping bug (#5141 )	2020-06-19 13:33:24 -04:00
Erick Rocha Fonseca	e33929ef1e	Fix in Reformer Config documentation (#5138 )	2020-06-19 15:41:31 +02:00
Sam Shleifer	84be482f66	AutoTokenizer supports mbart-large-en-ro (#5121 )	2020-06-18 20:47:37 -04:00
Sam Shleifer	2db1e2f415	[cleanup] remove redundant code in SummarizationDataset (#5119 )	2020-06-18 20:34:48 -04:00
Sylvain Gugger	5f721ad6e4	Fix #5114 (#5122 )	2020-06-18 19:20:04 -04:00
Pri Oberoi	a258982af3	Add missing arg in 02-transformers notebook (#5085 ) * Add missing arg when creating model * Fix typos * Remove from_tf flag when creating model	2020-06-18 19:04:04 -04:00
Deniz	32e94cff64	tf add resize_token_embeddings method (#4351 ) * resize token embeddings * add tokens * add tokens * add tokens * add t5 token method * add t5 token method * add t5 token method * typo * debugging input * debugging input * debug * debug * debug * trying to set embedding tokens properly * set embeddings for generation head too * set embeddings for generation head too * debugging * debugging * enable generation * add base method * add base method * add base method * return logits in the main call * reverting to generation * revert back * set embeddings for the bert main layer * description * fix conflicts * logging * set base model as self * refactor * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * v0 * v0 * finalize * final * black * add tests * revert back the emb call * comments * comments * add the second test * add vocab size condig * add tf models * add tf models. add common tests * remove model specific embedding tests * stylish * remove files * stylez * Update src/transformers/modeling_tf_transfo_xl.py change the error. Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * adding unchanged weight test Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-18 18:41:26 -04:00
Lysandre Debut	973433260e	Pin `sphinx-rtd-theme` (#5128 )	2020-06-18 18:07:59 -04:00
Sam Shleifer	8a377c3d6e	[fix] Move _adjust_logits above postprocess to fix Marian.generate (#5126 )	2020-06-18 18:06:27 -04:00
Sam Shleifer	3d3e605aff	[cleanup] generate_beam_search comments (#5115 )	2020-06-18 16:30:24 -04:00
Suraj Patil	ca2d0f98c4	ElectraForMultipleChoice (#4954 ) * add ElectraForMultipleChoice * add test_for_multiple_choice * add ElectraForMultipleChoice in auto model * add ElectraForMultipleChoice in all_model_classes * add SequenceSummary related parameters * get rid pooler, use SequenceSummary instead * add electra multiple choice test Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-18 14:59:35 -04:00
Ori Garin	279d8e24f7	support local_files_only option for tf models (#5116 )	2020-06-18 13:47:05 -04:00
Julien Chaumond	355954ffca	Create distilbert-base-uncased-distilled-squad-README.md	2020-06-18 05:17:45 -04:00
Suraj Patil	18177a1a60	lm_labels => labels (#5080 )	2020-06-18 09:16:29 +02:00
Lysandre	efeb75b805	Remove misleading comment closes #4958	2020-06-17 18:24:35 -04:00
Saurabh Misra	bb154ac50c	Fixing TPU training by disabling wandb.watch gradients logging for TPU (#4926 )	2020-06-17 18:04:11 -04:00
Suraj Patil	fb6cccb863	fix qa example (#4929 )	2020-06-17 17:54:16 -04:00
Karthikeyan Singaravelan	38bba9cdd5	Fix deprecation warnings due to invalid escape sequences. (#4924 )	2020-06-17 17:46:58 -04:00

... 3 4 5 6 7 ...

4475 Коммитов Все ветки Поиск

4475 Коммитов

Все ветки