diff --git a/utils_nlp/dataset/README.md b/utils_nlp/dataset/README.md index bf05557..bfbb040 100644 --- a/utils_nlp/dataset/README.md +++ b/utils_nlp/dataset/README.md @@ -21,7 +21,7 @@ df = load_pandas_df(DATA_FOLDER, file_split ="train", nrows = 1000) |[The STSbenchmark dataset](http://ixa2.si.ehu.es/stswiki/index.php/STSbenchmark)|[stsbenchmark.py](./stsbenchmark.py)| |[The Stanford Question Answering Dataset (SQuAD)](https://rajpurkar.github.io/SQuAD-explorer/)|[squad.py](./squad.py)| |[CNN/Daily Mail(CNN/DM) Dataset](https://github.com/harvardnlp/sent-summary)|[cnndm.py](./cnndm.py)| -|[Preprocessed CNN/Daily Mail(CNN/DM) Dataset by (https://arxiv.org/pdf/1903.10318.pdf)](https://github.com/nlpyang/BertSum)|[cnndm.py](./cnndm.py)| +|[Preprocessed CNN/Daily Mail(CNN/DM) Dataset for Extractive Summarization](https://github.com/nlpyang/BertSum)|[cnndm.py](./cnndm.py)| ## Dataset References Please see [Dataset References](../../DatasetReferences.md) for notice and information regarding datasets used.