This repository has been archived by the owner on Jan 5, 2023. It is now read-only.
Release v1.2.0
Release Notes
- You can now use
$HOME
and$USER
in your configuration files. - Fixed an overflow error that would cause NMT with more than 255 tokens to fail.
- METEOR worker process is now correctly killed after validations.
- Many runs of an experiment are now suffixed with a unique random string instead of incremental integers to avoid race conditions in cluster setups.
- Replaced
utils.nn.get_network_topology()
with a newTopology
class that will parse thedirection
string of the model in a more smart way. - If
CUDA_VISIBLE_DEVICES
is set, theGPUManager
will always honor it. - Dropped creation of temporary/advisory lock files under
/tmp
for GPU reservation. - Time measurements during training are now structered into batch overhead, training and evaluation timings.
- Datasets
- Added
TextDataset
for standalone text file reading. - Added
OneHotDataset
, a variant ofTextDataset
where the sequences are not prefixed/suffixed with<bos>
and<eos>
respectively. - Added experimental
MultiParallelDataset
that merges an arbitrary number of parallel datasets together.
- Added
- nmtpy translate
.nodbl
and.nounk
suffixes are now added to output files for--avoid-double
and--avoid-unk
arguments respectively.- A model-agnostic enough
beam_search()
is now separated out into its own filenmtpytorch/search.py
. max_len
default is increased to 200.