Граф коммитов

2209 Коммитов

Автор SHA1 Сообщение Дата
Hong Lu 05d40f2e73 Added bert image to notebook. 2019-05-29 15:04:15 -04:00
Hong Lu aaf0114cd7 Removed old scripts. 2019-05-29 14:57:23 -04:00
Hong Lu 074df70067 Removed old notebook. 2019-05-29 14:42:57 -04:00
Hong Lu 52bd027555 Added helper function for postprocessing token classification results. 2019-05-29 14:39:58 -04:00
Said Bleik c30d63e459 Merge branch 'bleik' of https://github.com/Microsoft/NLP into bleik 2019-05-28 23:17:59 -04:00
Said Bleik 5a81055e70 updated device utils and bert seq classifier 2019-05-28 23:16:19 -04:00
Said Bleik ba716d109a
Merge pull request #70 from microsoft/datasets
Datasets
2019-05-28 13:39:39 -04:00
Said Bleik 96b3015096
Merge pull request #72 from microsoft/abhiram-embedding-fix
Fix to limit the memory usage when using fasttext embedding loaders
2019-05-28 13:02:39 -04:00
Said Bleik 71fe5a4eb1
Merge pull request #69 from eisber/bleik
added missing pandas to environment.yml
2019-05-28 13:02:02 -04:00
Abhiram E 36d7411bec Fix to limit the memory usage when using fasttext embedding loaders. Code changes to use the simpler version 2019-05-28 12:04:57 -04:00
miguelgfierro 7ffc3cb6f6 Merge branch 'datasets' of https://github.com/Microsoft/NLP into datasets 2019-05-28 16:06:52 +01:00
miguelgfierro 835492509b minor 🐛 in readme 2019-05-28 15:57:56 +01:00
miguelgfierro aee2197db4 add bigger tolerance 2019-05-28 13:58:05 +00:00
Markus Cozowicz 156ea29bb0 added missing pandas 2019-05-28 14:42:21 +01:00
miguelgfierro 403457b3c3 refactor 💥 2019-05-28 13:38:02 +01:00
Said Bleik 2dc37f87eb
Merge pull request #67 from microsoft/test
Test
2019-05-24 23:34:03 -04:00
Said Bleik 4fa1aa8bcd
Merge pull request #66 from microsoft/rijai/componentgovernance
Adding component governance tool to build pipeline.
2019-05-24 23:31:16 -04:00
Hong Lu 52cc16fb9b Updated token classifier api. 2019-05-24 18:09:56 -04:00
Hong Lu 5258c9cd7e Added some utility functions to the common script. Will be merged with common.py later. 2019-05-24 18:09:04 -04:00
Richin Jain 620f3ebe8c Adding component governance tool to build pipeline. 2019-05-24 15:12:19 -04:00
miguelgfierro 3c1708a21d readme update 📝 2019-05-24 18:32:28 +00:00
miguelgfierro c8fc93d4b6 🐛 2019-05-24 17:57:53 +00:00
miguelgfierro f0936bd9b1 added papermill 2019-05-24 16:54:17 +00:00
miguelgfierro 0f2fcd4f83 added new notebooks 2019-05-24 15:00:18 +00:00
miguelgfierro f03f712cfa added data integration tests with notebooks 2019-05-24 14:26:36 +00:00
miguelgfierro 03b3b387a6 refactoring tests 2019-05-24 14:06:22 +01:00
Said Bleik 2daeb1716e
Merge pull request #64 from microsoft/casey-gensen-noblank
Gensen noblank bugfix + Add preprocessing tests
2019-05-22 15:14:42 -04:00
Casey Hong a1da16f391 use fixture directly 2019-05-22 12:42:01 -04:00
Casey Hong 1cd36ccff7 fix snli noblank bug and add preprocessing tests 2019-05-21 23:00:56 -04:00
Said Bleik 63e546ab3c updated prerocessing, utils, classification 2019-05-21 16:45:23 -04:00
Hong Lu 2473e1a75c Black auto formatting. 2019-05-20 18:53:57 -04:00
Hong Lu 3d1c1862d9 Removed old data utils script. 2019-05-20 14:08:39 -04:00
Hong Lu 4a41ec41e8 Added a constant file. 2019-05-20 14:00:12 -04:00
Hong Lu 1393c74fb3 Minor updates for data class updates. 2019-05-20 13:59:38 -04:00
Hong Lu 9919a7bd35 Remived InputFeature class. Use namedtuple instead of class for input data. 2019-05-20 13:58:54 -04:00
Hong Lu e81138ad08 Changed optimizer and number of epochs configuration. 2019-05-20 13:58:16 -04:00
Said Bleik 49bb116474 update seq classifer 2019-05-17 10:04:46 -04:00
Hong Lu eef85dea41 Consolidated all configuration classes into a single class. 2019-05-16 18:11:21 -04:00
Hong Lu 7ca29691ae Consolidated some utility functions into BertTokenClassifier. 2019-05-16 18:10:47 -04:00
Hong Lu d87dfbc2af Minor edits and added docstring. 2019-05-16 18:10:14 -04:00
Hong Lu 2732da2717 Updated NER notebook with new BertTokenClassifier class. 2019-05-16 18:09:40 -04:00
Hong Lu 14543fbd52 Added yaml configuration file for NER example. 2019-05-16 18:08:50 -04:00
Miguel González-Fierro 9bd941d2f8
Merge pull request #61 from microsoft/abhiram-gensim-limit
Added option to limit number of word vectors for glove and word2vec
2019-05-16 13:04:41 +01:00
Abhiram E ce6d783adf Separated the asserts in tests 2019-05-15 10:51:56 -04:00
Miguel González-Fierro 7aa740606d
Merge pull request #59 from microsoft/issue_template
issue template
2019-05-15 15:17:51 +01:00
Abhiram E 52d720e9bf Added option to limit number of word vectors for glove and word2vec 2019-05-15 00:22:37 -04:00
miguelgfierro a5144f2626 issue template 2019-05-14 12:21:40 +01:00
Said Bleik 33da65e0a3
Merge pull request #58 from microsoft/janhavi-update-snliNB
[Fix] SNLI notebook and preprocess.py
2019-05-13 21:45:01 -04:00
Janhavi Mahajan 1ed2c4dc0a feat(bug fix) updated snli notebook with to_lowercase_all() instead of to_lowercase() that expects a column name list. Fixed None object returning in to_lowercase when column name list is not passed 2019-05-13 18:14:31 -04:00
Said Bleik e9c17a961e update BERTSequenceClassifier and notebook 2019-05-13 15:18:21 -04:00