abeswara
56affe3f84
Added smoke test to verify MSRPC installer download
2019-05-02 12:06:52 -04:00
abeswara
84ac44cbc0
Resolved code review comments
2019-05-02 12:06:52 -04:00
janhavi13
ae2ad10f11
Merge pull request #32 from Microsoft/janhavi-code-review-comments
...
Resolved code review comments
2019-05-02 12:06:06 -04:00
Hong Lu
d5ee6d46cb
Initial check in of bert utility functions.
2019-05-02 10:50:30 -04:00
Janhavi Mahajan
62d4ede04e
delete snli_preprocess
2019-05-01 19:51:51 -04:00
Janhavi Mahajan
1b811e22f0
fixed readme links
2019-05-01 19:50:13 -04:00
Janhavi Mahajan
54bc05c917
feat(docs) update prep_data readme for SNLI
2019-05-01 19:48:29 -04:00
Janhavi Mahajan
13622549b0
feat(prep-data notebook) Notebook that shows preprocessing for SNLI using different utils in NLP
2019-05-01 19:40:29 -04:00
Said Bleik
10adf59777
update env, yahoo_answers, & classification eval
2019-05-01 22:49:41 +00:00
Janhavi Mahajan
338e606c5e
feat(code review comments) refactoring based on Miguel's comments
2019-05-01 18:40:44 -04:00
Casey Hong
3ec60ab690
edit readme
2019-05-01 18:35:02 -04:00
Casey Hong
810beb6f2c
organize stsbenchmark under new folder structure
2019-05-01 18:35:02 -04:00
Abhiram E
a278a3f220
Merge pull request #29 from Microsoft/abhiram-new-folder-structure
...
Moved to the new folder structure
2019-05-01 14:05:28 -04:00
abeswara
54abdc88ca
Moved to the new folder structure
2019-05-01 13:48:09 -04:00
Casey Hong
25a176b2cc
rm_stopwords suffix
2019-04-30 15:05:17 -04:00
Said Bleik
45c8d98a53
clean folders
2019-04-30 16:50:21 +00:00
Said Bleik
757e7d063d
Merge pull request #28 from Microsoft/maidap-sentence-similarity
...
Sentence similarity dataset
2019-04-30 12:26:04 -04:00
Said Bleik
739651267d
delete example templates
2019-04-30 16:11:39 +00:00
Said Bleik
f2467d5286
folder structure & example utils
2019-04-30 15:51:47 +00:00
Casey Hong
8469457355
trigger builds for PRs opened against the maidap branch
2019-04-30 11:23:50 -04:00
Said Bleik
bbaa591e9f
revert - push to staging
2019-04-29 20:01:45 +00:00
Casey Hong
baffaba839
fix links
2019-04-29 14:59:52 -04:00
Casey Hong
1bd9a132d6
add links
2019-04-29 14:59:52 -04:00
Casey Hong
dc4eac5aee
refactor for consistency between snli <=> sts notebooks, add gensen-specific preprocessing for snli
2019-04-29 14:59:52 -04:00
Casey Hong
1aa60a3a00
begin snli-sts consistency refactoring
2019-04-29 14:59:52 -04:00
Casey Hong
3c0ef18050
Delete top-level examples folder
2019-04-26 14:16:11 -04:00
Janhavi Mahajan
173bac7128
feat(code refactor) add output to jupyter notebooks
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
1498bfb853
feat(code refactoring) moving code around as per the new structure decided.
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
2c0114f955
feat(notebook bugs) renamed all utils from util_nlp to util_ss
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
0395f0b308
feat(code cleanup, update env.yaml with nltk package)
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
ba2ad0cbfa
feat(code reformat) deleted snli from util_nlp
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
f0070819ea
feat(code reformat) Formatting code based on new folder structure
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
eb7625041d
update env.yaml with pip dependencies
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
4aadf66654
feat(code reformat) moved nltk utils to preprocess.py
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
faa26b3c54
feat(doc strings) fixed doc string format
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
88e5a3d724
feat(code format) formatted file with black
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
c969085424
feat(code format) added doc strings, rewrite clean_snli function
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
44db348fe5
feat(data prep) save dataframe to csv and renamed folder from nltk to nltk_utils
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
6e46eade15
feat(data_prep) SNLI notebook showcasing data prep, Corrected nltk util for column_name
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
3964c04a7c
feat(data prep) NLTK tokenizer util file and notebook, deleted some redundant files, updated snli util with cleaner data prep functions
2019-04-26 12:11:55 -04:00
Janhavi Mahajan
f7b487cfbd
feat(data_prep) Added SNLI dataset prep utility
2019-04-26 12:11:55 -04:00
Abhiram E
4dfec0667f
Merge pull request #17 from Microsoft/abhiram-msrpc
...
Data loader for MSR PC
2019-04-25 17:16:02 -04:00
Abhiram E
04b8715ea9
Moved from absolute path to relative
2019-04-25 17:07:45 -04:00
Abhiram E
84443d478c
Refactored STS notebooks, updated utils_nlp files with the latest code from utils_ss and deleted utils_ss
2019-04-24 17:16:06 -04:00
Abhiram E
ffb38ea42b
Refactored code according to new structure, moved files and modified imports
2019-04-24 15:33:41 -04:00
Abhiram E
d4db5a1860
Resolving code review comments.
...
1. Refactored and renamed msrpc_load notebook.
2. Removed redundant parameter to load_pandas_df function
2019-04-24 15:05:53 -04:00
Abhiram E
f66ee268c0
Refactoring changes to MSRPC
2019-04-24 15:05:52 -04:00
Abhiram E
92416de9bf
Fixed umerged paths in gitignore
2019-04-24 15:05:52 -04:00
Abhiram E
b9fce4ae61
Notebooks and Tests
...
1. Added Jupyter Notebook for MSR-PC dataset quickstart task
2. Added unit tests for downloading the dataset and loading pandas df
3. Changes to MSRPC to take in path to the dataset if it already exists.
2019-04-24 15:05:00 -04:00
Abhiram E
ac0abdfd61
Data loader for MSR PC
...
1. Added data downloader for MSR PC
2. Added support to clean data and load specified datasets as a
pandas dataframe.
3. Updates to environment.yml for newly added packages.
2019-04-24 15:03:41 -04:00