Граф коммитов

26 Коммитов

Автор SHA1 Сообщение Дата
Casey Hong e5b12c6f32 resolve merge conflicts 2019-06-11 11:45:30 -04:00
abeswara 008bfa2c57 Glove loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units tests to test the public methods.

Other changes
 1. Made download and extract methods private.
 2. Refactored Word2vec unit tests to exclude private methods.
2019-06-06 10:16:46 -07:00
abeswara c9006c8b65 Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-06-06 10:13:10 -07:00
Said Bleik ba716d109a
Merge pull request #70 from microsoft/datasets
Datasets
2019-05-28 13:39:39 -04:00
Abhiram E 36d7411bec Fix to limit the memory usage when using fasttext embedding loaders. Code changes to use the simpler version 2019-05-28 12:04:57 -04:00
miguelgfierro aee2197db4 add bigger tolerance 2019-05-28 13:58:05 +00:00
miguelgfierro 3c1708a21d readme update 📝 2019-05-24 18:32:28 +00:00
miguelgfierro f0936bd9b1 added papermill 2019-05-24 16:54:17 +00:00
miguelgfierro 0f2fcd4f83 added new notebooks 2019-05-24 15:00:18 +00:00
miguelgfierro f03f712cfa added data integration tests with notebooks 2019-05-24 14:26:36 +00:00
miguelgfierro 03b3b387a6 refactoring tests 2019-05-24 14:06:22 +01:00
Casey Hong a1da16f391 use fixture directly 2019-05-22 12:42:01 -04:00
Casey Hong 1cd36ccff7 fix snli noblank bug and add preprocessing tests 2019-05-21 23:00:56 -04:00
Abhiram E ce6d783adf Separated the asserts in tests 2019-05-15 10:51:56 -04:00
Abhiram E 52d720e9bf Added option to limit number of word vectors for glove and word2vec 2019-05-15 00:22:37 -04:00
Abhiram E b955e53d0d Added smoke tests to verify extracted sizes of pretrained vectors 2019-05-09 14:58:11 -04:00
Abhiram E 2502d91e1b FastText loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units test to test the public method.

Other changes
 1. Refactored files to add return types to docstrings.
 2. Minor changes to path variables.
2019-05-09 14:58:11 -04:00
abeswara 8025b4449d Glove loader - Code changes and unit tests.
1. Added methods to download, extract and load glove vectors.
2. Added units tests to test the public methods.

Other changes
 1. Made download and extract methods private.
 2. Refactored Word2vec unit tests to exclude private methods.
2019-05-09 14:58:10 -04:00
abeswara 8203b0150d Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-05-09 14:58:10 -04:00
abeswara 8408d7cce2 Word2vec loader - Code changes and unit tests.
1. Refactored word2vec loader to perform existing file checks before downloading or extracting.

2. Added units tests to load, download and extract functions.
2019-05-09 14:58:10 -04:00
miguelgfierro 3c3ce8c14a got timer from recommenders 2019-05-09 17:25:44 +01:00
abeswara 56affe3f84 Added smoke test to verify MSRPC installer download 2019-05-02 12:06:52 -04:00
Abhiram E b9fce4ae61 Notebooks and Tests
1. Added Jupyter Notebook for MSR-PC dataset quickstart task
2. Added unit tests for downloading the dataset and loading pandas df
3. Changes to MSRPC to take in path to the dataset if it already exists.
2019-04-24 15:05:00 -04:00
miguelgfierro fcd6eef796 tests init 2019-04-16 19:57:14 +01:00
Richin Jain 804ec943fd Removing unused code that comes from template 2019-04-16 13:05:25 -04:00
Richin Jain 2c5b8e587e Intial commit to put the receipe template in 2019-04-05 13:55:58 -04:00