Граф коммитов

2214 Коммитов

Автор SHA1 Сообщение Дата
Sharat Chikkerur 34dd431bc3
Adding additional models to text classification
Adding additional models for text classification. 
Also adding languages based on the list of currently supported models
['albert-base-v1',
 'albert-base-v2',
 'albert-large-v1',
 'albert-large-v2',
 'albert-xlarge-v1',
 'albert-xlarge-v2',
 'albert-xxlarge-v1',
 'albert-xxlarge-v2',
 'bart-large',
 'bart-large-cnn',
 'bart-large-mnli',
 'bart-large-xsum',
 'bert-base-cased',
 'bert-base-cased-finetuned-mrpc',
 'bert-base-chinese',
 'bert-base-dutch-cased',
 'bert-base-finnish-cased-v1',
 'bert-base-finnish-uncased-v1',
 'bert-base-german-cased',
 'bert-base-german-dbmdz-cased',
 'bert-base-german-dbmdz-uncased',
 'bert-base-japanese',
 'bert-base-japanese-char',
 'bert-base-japanese-char-whole-word-masking',
 'bert-base-japanese-whole-word-masking',
 'bert-base-multilingual-cased',
 'bert-base-multilingual-uncased',
 'bert-base-uncased',
 'bert-large-cased',
 'bert-large-cased-whole-word-masking',
 'bert-large-cased-whole-word-masking-finetuned-squad',
 'bert-large-uncased',
 'bert-large-uncased-whole-word-masking',
 'bert-large-uncased-whole-word-masking-finetuned-squad',
 'camembert-base',
 'distilbert-base-cased',
 'distilbert-base-cased-distilled-squad',
 'distilbert-base-german-cased',
 'distilbert-base-multilingual-cased',
 'distilbert-base-uncased',
 'distilbert-base-uncased-distilled-squad',
 'distilbert-base-uncased-finetuned-sst-2-english',
 'distilroberta-base',
 'flaubert-base-cased',
 'flaubert-base-uncased',
 'flaubert-large-cased',
 'flaubert-small-cased',
 'mbart-large-en-ro',
 'roberta-base',
 'roberta-base-openai-detector',
 'roberta-large',
 'roberta-large-mnli',
 'roberta-large-openai-detector',
 'umberto-commoncrawl-cased-v1',
 'umberto-wikipedia-uncased-v1',
 'xlm-clm-ende-1024',
 'xlm-clm-enfr-1024',
 'xlm-mlm-100-1280',
 'xlm-mlm-17-1280',
 'xlm-mlm-en-2048',
 'xlm-mlm-ende-1024',
 'xlm-mlm-enfr-1024',
 'xlm-mlm-enro-1024',
 'xlm-mlm-tlm-xnli15-1024',
 'xlm-mlm-xnli15-1024',
 'xlm-roberta-base',
 'xlm-roberta-large',
 'xlm-roberta-large-finetuned-conll02-dutch',
 'xlm-roberta-large-finetuned-conll02-spanish',
 'xlm-roberta-large-finetuned-conll03-english',
 'xlm-roberta-large-finetuned-conll03-german',
 'xlnet-base-cased',
 'xlnet-large-cased']
2020-06-13 21:21:30 -04:00
Said Bleik a011680b0c
Merge pull request #593 from microsoft/emawa/mnli_download_preprocessed_tsv_data
Feat: add functionality to download MNLI preprocessed tsv data.
2020-06-11 09:49:17 -04:00
Said Bleik e1c476ff63
Merge pull request #594 from microsoft/update-notice-patch-1
Update NOTICE.txt
2020-06-11 09:49:02 -04:00
Said Bleik 96d1076405
Update NOTICE.txt 2020-06-10 23:10:04 -04:00
Emmanuel Awa 2091c3895e Feat: add functionality to download MNLI preprocessed tsv data.
Leverage NYU Jiant Toolkit preprocessed tsv data source
2020-06-08 17:38:50 +00:00
Said Bleik 0650ccf256
Merge pull request #592 from microsoft/add-xlm-cg-patch-1
added xlm
2020-06-01 13:26:15 -04:00
Said Bleik 6227ac0f84
added xlm 2020-06-01 12:56:00 -04:00
Said Bleik ab25969e51
Merge pull request #590 from microsoft/bleik/add-models
Bleik/add models - minor edits
2020-05-20 10:28:53 -04:00
saidbleik db9f076d68 minor edits 2020-05-20 14:26:05 +00:00
saidbleik af4df5c06f add load model fn 2020-05-20 14:23:25 +00:00
Said Bleik aa7340c34f
Merge pull request #588 from mikaelsouza/fixing-capitalization-and-table-headers
Fixing capitalization and table headers
2020-05-13 15:28:34 -04:00
Mikael Souza a363eca50d Added references table header 2020-05-13 12:01:16 -04:00
Mikael Souza 9d4743458e Fixed capitalization in README.md 2020-05-13 12:00:46 -04:00
Said Bleik e02e3b5525
Merge pull request #586 from microsoft/bleik/add-models
update utils and examples
2020-05-13 00:20:56 -04:00
Said Bleik 30afaa4619
Merge pull request #587 from microsoft/azure-pipelines
Update gpu_unit_tests_linux.yml for Azure Pipelines
2020-05-13 00:20:40 -04:00
Said Bleik b34a71566b
Merge branch 'staging' into azure-pipelines 2020-05-13 00:20:28 -04:00
Said Bleik cd50ef868d Update gpu_unit_tests_linux.yml for Azure Pipelines 2020-05-13 00:16:23 -04:00
saidbleik 2e92e319b3 Merge branch 'bleik/add-models' of https://github.com/microsoft/nlp-recipes into bleik/add-models 2020-05-12 22:06:45 +00:00
saidbleik ad323753e4 fix ner test 2020-05-12 22:06:10 +00:00
Said Bleik 477db37474
Merge branch 'staging' into bleik/add-models 2020-05-11 12:02:06 -04:00
saidbleik 9d9ef2001b resolve conflicts 2020-05-11 14:56:11 +00:00
saidbleik 78726ff471 update summarization models 2020-05-08 22:38:54 +00:00
saidbleik 8f965d44e5 qa updates 2020-05-08 20:12:04 +00:00
saidbleik c1aaf2592e common updates 2020-05-08 20:11:34 +00:00
saidbleik e418fc9b75 update seq classification utils 2020-05-08 17:33:28 +00:00
saidbleik 2c3b38b1c9 update ner utils 2020-05-08 17:32:48 +00:00
saidbleik 71921dacb4 update NER example 2020-05-08 17:31:11 +00:00
saidbleik d331f321cd minor edit 2020-05-05 15:18:26 +00:00
saidbleik eab92fa40d update NER example 2020-05-05 02:46:25 +00:00
Daisy Deng 75e70a2470
Merge pull request #584 from microsoft/daden/minilm
Daden/minilm Add MiniLM for Abstractive Summarization
2020-04-20 23:37:34 -04:00
Daisy Deng 886e44ba46 cleanup and add documentation 2020-04-21 01:30:36 +00:00
Daisy Deng 4a46755fe7 add timer for fitting time 2020-04-21 01:03:28 +00:00
Daisy Deng b2183d850f fix notebook 2020-04-17 19:20:48 +00:00
Daisy Deng ac90fd7766 modify the notebook according to review comments 2020-04-17 17:59:04 +00:00
Daisy Deng 910ff02644 increase the duration of the test before canceling due to timeout 2020-04-16 19:15:43 +00:00
Daisy Deng b04b38354b update documentation 2020-04-16 18:59:10 +00:00
Daisy Deng 3508bb24c4 add integration test 2020-04-16 18:51:13 +00:00
Daisy Deng 73961dbb8d add minilm notebook 2020-04-16 03:06:41 +00:00
Daisy Deng 8dbb0f9834 add printout to inform user about using the cached file 2020-04-16 03:06:05 +00:00
Daisy Deng bf218ec069 handle uncased model; add bertconfig for minilm 2020-04-14 18:49:33 +00:00
Daisy Deng bb1b5ceee2 add top_n when creating S2SAbsSumDataset; minor change according to the latest decoding script 2020-04-14 04:43:11 +00:00
Daisy Deng f43070f3f3 add minilm into tests 2020-04-11 03:07:58 +00:00
Daisy Deng da0fb13d53 Merge branch 'staging' of https://github.com/microsoft/nlp-recipes into daden/minilm 2020-04-10 21:51:32 +00:00
Daisy Deng 11551c5937 add minilm components 2020-04-10 21:51:05 +00:00
Said Bleik ebdff5278d
Merge pull request #580 from microsoft/staging
staging to master
2020-04-08 11:23:39 -04:00
Said Bleik 302228c63e
Merge branch 'master' into staging 2020-04-08 11:18:09 -04:00
Daisy Deng 01e8296b78
Merge pull request #582 from narayanacharya6/narayanacharya6/fix-local-entailment-example-link
Fixes link to example Python notebook for MultiNLI local environment …
2020-04-07 21:33:49 -04:00
Daisy Deng 6e6e08cd96 Update cpu_unit_tests_linux.yml for Azure Pipelines 2020-04-07 16:41:05 -04:00
Narayan Acharya cda24655e9 Fixes link to example Python notebook for MultiNLI local environment entailment example in README. 2020-04-07 15:37:53 -04:00
Daisy Deng fc61c3f713
Merge pull request #579 from microsoft/daden/unilm_cpu
Daden/unilm cpu  upgrade unilm utilities to use the latest s2s-ft package
2020-04-06 13:59:54 -04:00