This repository has been archived by the owner on Aug 3, 2021. It is now read-only.

OpenSeq2Seq v0.4

Pre-release

Pre-release

Kipok released this 29 Jun 21:06

· 1063 commits to master since this release

New models:

ConvS2S model for translation.
Wav2Letter model for speech recognition.
CIFAR-10 dataset support.
CNNEncoder that can be used to construct arbitrary (almost) CNN models. Based on that, integrated AlexNet and cifar10-nv.

New features:

Support for "iter_size" (accumulating gradients for "iter_size" steps without update).
"objects" benchmarking to evaluation and inference modes.
cuDNN compatible cells support for GNMT.
8-padding for transformer.
Improved config overwriting by train/eval/infer params (will not replace whole dicts, but update incrementally).
Audio normalization before preprocessing for speech2text models.
More summaries/parameters for different models.

Bug fixes:

Regularization in mixed precision mode (loss scaling was not applied, disabling regularizer).
Overwriting bool values from command line.
Multi-GPU evaluation in towers mode.
Multi-GPU inference for speech2text.
"reflect" padding changed to use zeros for audio preprocessing.
Unicode support for Python 2.

Important config/API changes:

Unified static/dynamic loss scaling into a single parameter.
Made RNN cells accept arbitrary parameters.
Exposed training step into maybe_print_logs and evaluate functions.

Other changes:

Improved unit tests and documentation.

Assets 2