Hong Lu
|
05d40f2e73
|
Added bert image to notebook.
|
2019-05-29 15:04:15 -04:00 |
Hong Lu
|
aaf0114cd7
|
Removed old scripts.
|
2019-05-29 14:57:23 -04:00 |
Hong Lu
|
074df70067
|
Removed old notebook.
|
2019-05-29 14:42:57 -04:00 |
Hong Lu
|
52bd027555
|
Added helper function for postprocessing token classification results.
|
2019-05-29 14:39:58 -04:00 |
Said Bleik
|
c30d63e459
|
Merge branch 'bleik' of https://github.com/Microsoft/NLP into bleik
|
2019-05-28 23:17:59 -04:00 |
Said Bleik
|
5a81055e70
|
updated device utils and bert seq classifier
|
2019-05-28 23:16:19 -04:00 |
Said Bleik
|
ba716d109a
|
Merge pull request #70 from microsoft/datasets
Datasets
|
2019-05-28 13:39:39 -04:00 |
Said Bleik
|
96b3015096
|
Merge pull request #72 from microsoft/abhiram-embedding-fix
Fix to limit the memory usage when using fasttext embedding loaders
|
2019-05-28 13:02:39 -04:00 |
Said Bleik
|
71fe5a4eb1
|
Merge pull request #69 from eisber/bleik
added missing pandas to environment.yml
|
2019-05-28 13:02:02 -04:00 |
Abhiram E
|
36d7411bec
|
Fix to limit the memory usage when using fasttext embedding loaders. Code changes to use the simpler version
|
2019-05-28 12:04:57 -04:00 |
miguelgfierro
|
7ffc3cb6f6
|
Merge branch 'datasets' of https://github.com/Microsoft/NLP into datasets
|
2019-05-28 16:06:52 +01:00 |
miguelgfierro
|
835492509b
|
minor 🐛 in readme
|
2019-05-28 15:57:56 +01:00 |
miguelgfierro
|
aee2197db4
|
add bigger tolerance
|
2019-05-28 13:58:05 +00:00 |
Markus Cozowicz
|
156ea29bb0
|
added missing pandas
|
2019-05-28 14:42:21 +01:00 |
miguelgfierro
|
403457b3c3
|
refactor 💥
|
2019-05-28 13:38:02 +01:00 |
Said Bleik
|
2dc37f87eb
|
Merge pull request #67 from microsoft/test
Test
|
2019-05-24 23:34:03 -04:00 |
Said Bleik
|
4fa1aa8bcd
|
Merge pull request #66 from microsoft/rijai/componentgovernance
Adding component governance tool to build pipeline.
|
2019-05-24 23:31:16 -04:00 |
Hong Lu
|
52cc16fb9b
|
Updated token classifier api.
|
2019-05-24 18:09:56 -04:00 |
Hong Lu
|
5258c9cd7e
|
Added some utility functions to the common script. Will be merged with common.py later.
|
2019-05-24 18:09:04 -04:00 |
Richin Jain
|
620f3ebe8c
|
Adding component governance tool to build pipeline.
|
2019-05-24 15:12:19 -04:00 |
miguelgfierro
|
3c1708a21d
|
readme update 📝
|
2019-05-24 18:32:28 +00:00 |
miguelgfierro
|
c8fc93d4b6
|
🐛
|
2019-05-24 17:57:53 +00:00 |
miguelgfierro
|
f0936bd9b1
|
added papermill
|
2019-05-24 16:54:17 +00:00 |
miguelgfierro
|
0f2fcd4f83
|
added new notebooks
|
2019-05-24 15:00:18 +00:00 |
miguelgfierro
|
f03f712cfa
|
added data integration tests with notebooks
|
2019-05-24 14:26:36 +00:00 |
miguelgfierro
|
03b3b387a6
|
refactoring tests
|
2019-05-24 14:06:22 +01:00 |
Said Bleik
|
2daeb1716e
|
Merge pull request #64 from microsoft/casey-gensen-noblank
Gensen noblank bugfix + Add preprocessing tests
|
2019-05-22 15:14:42 -04:00 |
Casey Hong
|
a1da16f391
|
use fixture directly
|
2019-05-22 12:42:01 -04:00 |
Casey Hong
|
1cd36ccff7
|
fix snli noblank bug and add preprocessing tests
|
2019-05-21 23:00:56 -04:00 |
Said Bleik
|
63e546ab3c
|
updated prerocessing, utils, classification
|
2019-05-21 16:45:23 -04:00 |
Hong Lu
|
2473e1a75c
|
Black auto formatting.
|
2019-05-20 18:53:57 -04:00 |
Hong Lu
|
3d1c1862d9
|
Removed old data utils script.
|
2019-05-20 14:08:39 -04:00 |
Hong Lu
|
4a41ec41e8
|
Added a constant file.
|
2019-05-20 14:00:12 -04:00 |
Hong Lu
|
1393c74fb3
|
Minor updates for data class updates.
|
2019-05-20 13:59:38 -04:00 |
Hong Lu
|
9919a7bd35
|
Remived InputFeature class. Use namedtuple instead of class for input data.
|
2019-05-20 13:58:54 -04:00 |
Hong Lu
|
e81138ad08
|
Changed optimizer and number of epochs configuration.
|
2019-05-20 13:58:16 -04:00 |
Said Bleik
|
49bb116474
|
update seq classifer
|
2019-05-17 10:04:46 -04:00 |
Hong Lu
|
eef85dea41
|
Consolidated all configuration classes into a single class.
|
2019-05-16 18:11:21 -04:00 |
Hong Lu
|
7ca29691ae
|
Consolidated some utility functions into BertTokenClassifier.
|
2019-05-16 18:10:47 -04:00 |
Hong Lu
|
d87dfbc2af
|
Minor edits and added docstring.
|
2019-05-16 18:10:14 -04:00 |
Hong Lu
|
2732da2717
|
Updated NER notebook with new BertTokenClassifier class.
|
2019-05-16 18:09:40 -04:00 |
Hong Lu
|
14543fbd52
|
Added yaml configuration file for NER example.
|
2019-05-16 18:08:50 -04:00 |
Miguel González-Fierro
|
9bd941d2f8
|
Merge pull request #61 from microsoft/abhiram-gensim-limit
Added option to limit number of word vectors for glove and word2vec
|
2019-05-16 13:04:41 +01:00 |
Abhiram E
|
ce6d783adf
|
Separated the asserts in tests
|
2019-05-15 10:51:56 -04:00 |
Miguel González-Fierro
|
7aa740606d
|
Merge pull request #59 from microsoft/issue_template
issue template
|
2019-05-15 15:17:51 +01:00 |
Abhiram E
|
52d720e9bf
|
Added option to limit number of word vectors for glove and word2vec
|
2019-05-15 00:22:37 -04:00 |
miguelgfierro
|
a5144f2626
|
issue template
|
2019-05-14 12:21:40 +01:00 |
Said Bleik
|
33da65e0a3
|
Merge pull request #58 from microsoft/janhavi-update-snliNB
[Fix] SNLI notebook and preprocess.py
|
2019-05-13 21:45:01 -04:00 |
Janhavi Mahajan
|
1ed2c4dc0a
|
feat(bug fix) updated snli notebook with to_lowercase_all() instead of to_lowercase() that expects a column name list. Fixed None object returning in to_lowercase when column name list is not passed
|
2019-05-13 18:14:31 -04:00 |
Said Bleik
|
e9c17a961e
|
update BERTSequenceClassifier and notebook
|
2019-05-13 15:18:21 -04:00 |