[Feature Request] Decoder-only Marian models #989

alvations · 2023-04-27T10:42:37Z

Feature description

Given the GPT / PALM / BLOOM popularity, having marian benchmarks for decoder-only models would be good.

It is not that I think it will give better ChrF (maybe it will) but comparing marian-nmt models to other libraries models makes it a little tough to explain about "hyperparameters equivalence" and quirks in layer implementation.

Example

I'vent seen anyone coding from scratch in C++, but OpenNMT's CTranslate went the "converter" route https://github.com/OpenNMT/CTranslate2/blob/master/python/ctranslate2/converters/transformers.py

alvations added the enhancement label Apr 27, 2023

alvations changed the title ~~Decoder-only Marian models~~ [Feature Request] Decoder-only Marian models Apr 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Decoder-only Marian models #989

[Feature Request] Decoder-only Marian models #989

alvations commented Apr 27, 2023 •

edited

Loading

[Feature Request] Decoder-only Marian models #989

[Feature Request] Decoder-only Marian models #989

Comments

alvations commented Apr 27, 2023 • edited Loading

Feature description

Example

alvations commented Apr 27, 2023 •

edited

Loading