Граф коммитов

389 Коммитов

Автор SHA1 Сообщение Дата
hlums aeb9486a6a Updated wikigold utils to be consistent with other datasets. 2019-06-23 21:57:04 +00:00
hlums 7c35f670d2 Added probabilities output to BERT token classifier. 2019-06-23 21:55:51 +00:00
hlums 044af9efef Updated ner token preprocessing for Chinese text. 2019-06-23 21:52:25 +00:00
Said Bleik 9c3a95159c add sequential loader test 2019-06-21 17:37:03 -04:00
Said Bleik ed4e09b9b3 edits to loader test 2019-06-21 16:33:37 -04:00
Said Bleik 0bae4ff59c add csv loader test 2019-06-21 16:18:48 -04:00
Abhiram E ed26f8fcc6 Integrated Mlflow with AzureMl Gensen deep dive notebook 2019-06-20 20:02:54 -04:00
miguelgfierro c9d587317e track github metrics 2019-06-18 20:25:27 +01:00
Ubuntu 9a72310b9d Added fixtures to ner test code. 2019-06-14 23:03:50 +00:00
Ubuntu cf05b5bba3 Added space before period in test data. 2019-06-13 20:50:03 +00:00
Ubuntu 5438d76596 Added test code for NER utils. 2019-06-13 18:25:27 +00:00
Hong Lu 6d671b6221 Started adding test code for NER. 2019-06-12 15:33:12 -04:00
Casey Hong e5b12c6f32 resolve merge conflicts 2019-06-11 11:45:30 -04:00
Chaoyu Guan f4f3591668 add explain-NLP-model part for issue #62 2019-06-08 12:51:44 +00:00
abeswara 008bfa2c57 Glove loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units tests to test the public methods.

Other changes
 1. Made download and extract methods private.
 2. Refactored Word2vec unit tests to exclude private methods.
2019-06-06 10:16:46 -07:00
abeswara c9006c8b65 Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-06-06 10:13:10 -07:00
Said Bleik ba716d109a
Merge pull request #70 from microsoft/datasets
Datasets
2019-05-28 13:39:39 -04:00
Abhiram E 36d7411bec Fix to limit the memory usage when using fasttext embedding loaders. Code changes to use the simpler version 2019-05-28 12:04:57 -04:00
miguelgfierro aee2197db4 add bigger tolerance 2019-05-28 13:58:05 +00:00
miguelgfierro 3c1708a21d readme update 📝 2019-05-24 18:32:28 +00:00
miguelgfierro f0936bd9b1 added papermill 2019-05-24 16:54:17 +00:00
miguelgfierro 0f2fcd4f83 added new notebooks 2019-05-24 15:00:18 +00:00
miguelgfierro f03f712cfa added data integration tests with notebooks 2019-05-24 14:26:36 +00:00
miguelgfierro 03b3b387a6 refactoring tests 2019-05-24 14:06:22 +01:00
Casey Hong a1da16f391 use fixture directly 2019-05-22 12:42:01 -04:00
Casey Hong 1cd36ccff7 fix snli noblank bug and add preprocessing tests 2019-05-21 23:00:56 -04:00
Abhiram E ce6d783adf Separated the asserts in tests 2019-05-15 10:51:56 -04:00
Abhiram E 52d720e9bf Added option to limit number of word vectors for glove and word2vec 2019-05-15 00:22:37 -04:00
Abhiram E b955e53d0d Added smoke tests to verify extracted sizes of pretrained vectors 2019-05-09 14:58:11 -04:00
Abhiram E 2502d91e1b FastText loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units test to test the public method.

Other changes
 1. Refactored files to add return types to docstrings.
 2. Minor changes to path variables.
2019-05-09 14:58:11 -04:00
abeswara 8025b4449d Glove loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units tests to test the public methods.

Other changes
 1. Made download and extract methods private.
 2. Refactored Word2vec unit tests to exclude private methods.
2019-05-09 14:58:10 -04:00
abeswara 8203b0150d Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-05-09 14:58:10 -04:00
abeswara 8408d7cce2 Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-05-09 14:58:10 -04:00
miguelgfierro 3c3ce8c14a got timer from recommenders 2019-05-09 17:25:44 +01:00
abeswara 56affe3f84 Added smoke test to verify MSRPC installer download 2019-05-02 12:06:52 -04:00
Abhiram E b9fce4ae61 Notebooks and Tests
1. Added Jupyter Notebook for MSR-PC dataset quickstart task
2. Added unit tests for downloading the dataset and loading pandas df
3. Changes to MSRPC to take in path to the dataset if it already exists.
2019-04-24 15:05:00 -04:00
miguelgfierro fcd6eef796 tests init 2019-04-16 19:57:14 +01:00
Richin Jain 804ec943fd Removing unused code that comes from template 2019-04-16 13:05:25 -04:00
Richin Jain 2c5b8e587e Intial commit to put the receipe template in 2019-04-05 13:55:58 -04:00