Граф коммитов

2241 Коммитов

Автор SHA1 Сообщение Дата
abeswara 56affe3f84 Added smoke test to verify MSRPC installer download 2019-05-02 12:06:52 -04:00
abeswara 84ac44cbc0 Resolved code review comments 2019-05-02 12:06:52 -04:00
janhavi13 ae2ad10f11
Merge pull request #32 from Microsoft/janhavi-code-review-comments
Resolved code review comments
2019-05-02 12:06:06 -04:00
Hong Lu d5ee6d46cb Initial check in of bert utility functions. 2019-05-02 10:50:30 -04:00
Janhavi Mahajan 62d4ede04e delete snli_preprocess 2019-05-01 19:51:51 -04:00
Janhavi Mahajan 1b811e22f0 fixed readme links 2019-05-01 19:50:13 -04:00
Janhavi Mahajan 54bc05c917 feat(docs) update prep_data readme for SNLI 2019-05-01 19:48:29 -04:00
Janhavi Mahajan 13622549b0 feat(prep-data notebook) Notebook that shows preprocessing for SNLI using different utils in NLP 2019-05-01 19:40:29 -04:00
Said Bleik 10adf59777 update env, yahoo_answers, & classification eval 2019-05-01 22:49:41 +00:00
Janhavi Mahajan 338e606c5e feat(code review comments) refactoring based on Miguel's comments 2019-05-01 18:40:44 -04:00
Casey Hong 3ec60ab690 edit readme 2019-05-01 18:35:02 -04:00
Casey Hong 810beb6f2c organize stsbenchmark under new folder structure 2019-05-01 18:35:02 -04:00
Abhiram E a278a3f220
Merge pull request #29 from Microsoft/abhiram-new-folder-structure
Moved to the new folder structure
2019-05-01 14:05:28 -04:00
abeswara 54abdc88ca Moved to the new folder structure 2019-05-01 13:48:09 -04:00
Casey Hong 25a176b2cc rm_stopwords suffix 2019-04-30 15:05:17 -04:00
Said Bleik 45c8d98a53 clean folders 2019-04-30 16:50:21 +00:00
Said Bleik 757e7d063d
Merge pull request #28 from Microsoft/maidap-sentence-similarity
Sentence similarity dataset
2019-04-30 12:26:04 -04:00
Said Bleik 739651267d delete example templates 2019-04-30 16:11:39 +00:00
Said Bleik f2467d5286 folder structure & example utils 2019-04-30 15:51:47 +00:00
Casey Hong 8469457355 trigger builds for PRs opened against the maidap branch 2019-04-30 11:23:50 -04:00
Said Bleik bbaa591e9f revert - push to staging 2019-04-29 20:01:45 +00:00
Casey Hong baffaba839 fix links 2019-04-29 14:59:52 -04:00
Casey Hong 1bd9a132d6 add links 2019-04-29 14:59:52 -04:00
Casey Hong dc4eac5aee refactor for consistency between snli <=> sts notebooks, add gensen-specific preprocessing for snli 2019-04-29 14:59:52 -04:00
Casey Hong 1aa60a3a00 begin snli-sts consistency refactoring 2019-04-29 14:59:52 -04:00
Casey Hong 3c0ef18050 Delete top-level examples folder 2019-04-26 14:16:11 -04:00
Janhavi Mahajan 173bac7128 feat(code refactor) add output to jupyter notebooks 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 1498bfb853 feat(code refactoring) moving code around as per the new structure decided. 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 2c0114f955 feat(notebook bugs) renamed all utils from util_nlp to util_ss 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 0395f0b308 feat(code cleanup, update env.yaml with nltk package) 2019-04-26 12:11:55 -04:00
Janhavi Mahajan ba2ad0cbfa feat(code reformat) deleted snli from util_nlp 2019-04-26 12:11:55 -04:00
Janhavi Mahajan f0070819ea feat(code reformat) Formatting code based on new folder structure 2019-04-26 12:11:55 -04:00
Janhavi Mahajan eb7625041d update env.yaml with pip dependencies 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 4aadf66654 feat(code reformat) moved nltk utils to preprocess.py 2019-04-26 12:11:55 -04:00
Janhavi Mahajan faa26b3c54 feat(doc strings) fixed doc string format 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 88e5a3d724 feat(code format) formatted file with black 2019-04-26 12:11:55 -04:00
Janhavi Mahajan c969085424 feat(code format) added doc strings, rewrite clean_snli function 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 44db348fe5 feat(data prep) save dataframe to csv and renamed folder from nltk to nltk_utils 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 6e46eade15 feat(data_prep) SNLI notebook showcasing data prep, Corrected nltk util for column_name 2019-04-26 12:11:55 -04:00
Janhavi Mahajan 3964c04a7c feat(data prep) NLTK tokenizer util file and notebook, deleted some redundant files, updated snli util with cleaner data prep functions 2019-04-26 12:11:55 -04:00
Janhavi Mahajan f7b487cfbd feat(data_prep) Added SNLI dataset prep utility 2019-04-26 12:11:55 -04:00
Abhiram E 4dfec0667f
Merge pull request #17 from Microsoft/abhiram-msrpc
Data loader for MSR PC
2019-04-25 17:16:02 -04:00
Abhiram E 04b8715ea9 Moved from absolute path to relative 2019-04-25 17:07:45 -04:00
Abhiram E 84443d478c Refactored STS notebooks, updated utils_nlp files with the latest code from utils_ss and deleted utils_ss 2019-04-24 17:16:06 -04:00
Abhiram E ffb38ea42b Refactored code according to new structure, moved files and modified imports 2019-04-24 15:33:41 -04:00
Abhiram E d4db5a1860 Resolving code review comments.
1. Refactored and renamed msrpc_load notebook.
2. Removed redundant parameter to load_pandas_df function
2019-04-24 15:05:53 -04:00
Abhiram E f66ee268c0 Refactoring changes to MSRPC 2019-04-24 15:05:52 -04:00
Abhiram E 92416de9bf Fixed umerged paths in gitignore 2019-04-24 15:05:52 -04:00
Abhiram E b9fce4ae61 Notebooks and Tests
1. Added Jupyter Notebook for MSR-PC dataset quickstart task
2. Added unit tests for downloading the dataset and loading pandas df
3. Changes to MSRPC to take in path to the dataset if it already exists.
2019-04-24 15:05:00 -04:00
Abhiram E ac0abdfd61 Data loader for MSR PC
1. Added data downloader for MSR PC
2. Added support to clean data and load specified datasets as a
pandas dataframe.
3. Updates to environment.yml for newly added packages.
2019-04-24 15:03:41 -04:00