Skip to content

Commit

Permalink
Merge pull request #100 from VProv/master
Browse files Browse the repository at this point in the history
Add note about BPE-Dropout during training
  • Loading branch information
rsennrich authored Feb 15, 2021
2 parents 234923e + fa326d4 commit 823c880
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -93,6 +93,8 @@ On top of the basic BPE implementation, this repository supports:
use the argument `--dropout 0.1` for `subword-nmt apply-bpe` to randomly drop out possible merges.
Doing this on the training corpus can improve quality of the final system; at test time, use BPE without dropout.
In order to obtain reproducible results, argument `--seed` can be used to set the random seed.

**Note:** In the original paper, the authors used BPE-Dropout on each new batch separately. You can copy the training corpus several times to get similar behavior to obtain multiple segmentations for the same sentence.

- support for glossaries:
use the argument `--glossaries` for `subword-nmt apply-bpe` to provide a list of words and/or regular expressions
Expand Down

0 comments on commit 823c880

Please sign in to comment.