Update README.md
This commit is contained in:
Родитель
557ed16b05
Коммит
c7caf71a37
|
@ -200,7 +200,7 @@ You can find that the original input sentence is restored from the vocabulary id
|
|||
```
|
||||
```<output file>``` stores a list of vocabulary and emission log probabilities. The vocabulary id corresponds to the line number in this file.
|
||||
|
||||
## Refine special meta tokens
|
||||
## Redefine special meta tokens
|
||||
By default, SentencePiece uses Unknown (<unk>), BOS (<s>) and EOS (</s>) tokens which have the ids of 0, 1, and 2 respectively. We can redefine these mappings in training phase as follows.
|
||||
|
||||
```
|
||||
|
|
Загрузка…
Ссылка в новой задаче