Improve EncoderDecoderModel docs #16135

patrickvonplaten · 2022-03-14T11:14:12Z

First good issue

There have been quite some issues/questions with how to use the Encoder-Decoder model, e.g.: #4483 and #15479 . The main reason for this is that the model docs are quite outdated and we could need a nice How-to-guide.

So I think we have two action items here:

Improve https://huggingface.co/docs/transformers/v4.17.0/en/model_doc/encoder-decoder#encoder-decoder-models a.k.a.: https://github.com/huggingface/transformers/blob/master/docs/source/model_doc/encoder-decoder.mdx

We should mention here:
a) How to create a model ? We should show how to use the from_encoder_decoder_pretrained(...) and then how to save the model?
b) How to fine-tune this model? We should mention that this model can then be fine-tuned just like any other encoder-decoder model (Bart, T5, ...)
c) Put a big warning that the config values have to be correctly set and how to set them, e.g. read: #15479

This should be an EncoderDecoderModel specific text and be very concise and short.

In a second step, we should then write a How-to-guide that includes much more details.

More than happy to help someone tackle this first good issue

The text was updated successfully, but these errors were encountered:

silvererudite · 2022-03-15T18:37:14Z

Hi...I would love to contribute to this.

patrickvonplaten · 2022-03-17T18:16:18Z

Awesome! Would you like to open a PR and give it a try? :-) I think it would be great if we could put some example code on how to create an EncoderDecoderModel on this model doc: https://github.com/huggingface/transformers/blob/master/docs/source/model_doc/encoder-decoder.mdx which will then be displayed here: https://huggingface.co/docs/transformers/v4.17.0/en/model_doc/encoder-decoder#encoder-decoder-models :-)

Let me know if you have any questions! Happy to help :-)

silvererudite · 2022-03-21T21:17:03Z

Yes..definitely...Will open a PR shortly and ask for help when I'm stuck..thanks a lot.

Threepointone4 · 2022-05-11T20:32:18Z

Hi @patrickvonplaten , I would love to contribute to this.

Threepointone4 · 2022-05-12T06:04:45Z

@patrickvonplaten , I have created the fork and added some docs.

So I think we have two action items here:

Improve https://huggingface.co/docs/transformers/v4.17.0/en/model_doc/encoder-decoder#encoder-decoder-models a.k.a.: https://github.com/huggingface/transformers/blob/master/docs/source/model_doc/encoder-decoder.mdx

We should mention here:
a) How to create a model ? We should show how to use the from_encoder_decoder_pretrained(...) and then how to save the model?
b) How to fine-tune this model? We should mention that this model can then be fine-tuned just like any other encoder-decoder model (Bart, T5, ...)

I have added some documentation, let me know what do you think about this.

c) Put a big warning that the config values have to be correctly set and how to set them, e.g. read: #15479

I didn't got chance to go through this, I will try to cover it this week

In a second step, we should then write a How-to-guide that includes much more details.

I have added a colab which has detailed explanation of encoder decoder model and how to train it. Does that help for this?

patrickvonplaten · 2022-05-13T09:11:04Z

Hey @Threepointone4, that's great!

Could you maybe open a PR for:

We should mention here:
a) How to create a model ? We should show how to use the from_encoder_decoder_pretrained(...) and then how to save the model?
b) How to fine-tune this model? We should mention that this model can then be fine-tuned just like any other encoder-decoder model (Bart, T5, ...)

? :-)

Threepointone4 · 2022-05-16T17:04:13Z

@patrickvonplaten I have created the PR and done the changes based on my understanding. Please let me know if some changes are required.

Winterflower · 2022-11-03T23:48:32Z

Hello all, I'm very much a beginner in this space, so please excuse the potentially stupid question. I have been experimenting with rolling out my own encoder-decoder combinations for use with the VisionEncoderDecoder class as specified in the docs here

The VisionEncoderDecoderModel can be used to initialize an image-to-text model with any pretrained Transformer-based vision model as the encoder ...

but I keep running into the issue of getting this error message

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's attention_mask to obtain reliable results.
Setting pad_token_id to eos_token_id:2 for open-end generation.

Based on reading the docs, I am not entirely sure if I need to specifically finetune an encode-decoder combination on the image-to-text downstream task and the error message above is due to that or if I can just pre-trained configurations without finetuning.
Perhaps I could open a PR with some docs suggestions?

patrickvonplaten · 2022-11-04T17:37:49Z

Hey @Winterflower,

Could you please try to use the forum instead for such questions: https://discuss.huggingface.co/ ? :-) Thank you!

anishlukk123 · 2023-05-10T16:34:58Z

is this issue still open because we i would like to take to solve this problem

ghost · 2023-05-19T04:48:13Z

I would like to contribute

SHUBHAPRIYA95 · 2023-07-03T06:11:26Z

Hi ,if this issue is still open i would love to contribute.

rajveer43 · 2023-08-02T05:34:52Z

Hi @patrickvonplaten , Is it still open I want to work on this!

patrickvonplaten · 2023-08-02T16:14:00Z

Sure maybe you can browse https://huggingface.co/docs/transformers/v4.31.0/en/model_doc/encoder-decoder#overview and check if there is anything we can improve

rajveer43 · 2023-08-17T07:54:56Z

thanks will do check

riiyaa24 · 2023-09-30T17:43:24Z

Hello, I would like to contribute to this issue? Can you assign me this PR

mhdirnjbr · 2023-11-09T22:49:26Z

Hello @patrickvonplaten !

This is my very first time deciding to contribute to open source projects inspired by my participation in the Hugging Face event in Paris and the insightful conversations I had with the project maintainers.

As a final-year graduate student in Math and AI, I am eager to explore opportunities to collaborate on this issue. I would greatly appreciate it if you could provide more information on how I can get involved.

Thank you in advance.

lappemic · 2024-05-13T09:41:44Z

It feels like this issue was addressed and closed by PR #17815?

Ryukijano · 2024-10-22T21:08:52Z

I would love to contribute to this!

Fixes huggingface#16135 Improve the `EncoderDecoderModel` documentation. * Add example code to create an `EncoderDecoderModel` using `from_encoder_decoder_pretrained`. * Add instructions on how to save the model. * Add instructions on how to fine-tune the model. * Add a warning about correctly setting configuration values. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/huggingface/transformers/issues/16135?shareId=XXXX-XXXX-XXXX-XXXX).

patrickvonplaten changed the title ~~EncoderDecoderModel docs:~~ Improve EncoderDecoderModel docs Mar 14, 2022

patrickvonplaten mentioned this issue Mar 14, 2022

Wrong/inconsistent behaviour in EncoderDecoderModel and generate method #15479

Closed

patrickvonplaten added the Good First Issue label Mar 14, 2022

Threepointone4 mentioned this issue May 16, 2022

Improved Documentation for Encoder Decoder models #17287

Closed

3 tasks

Threepointone4 mentioned this issue Jun 22, 2022

Improve encoder decoder model docs #17815

Merged

3 tasks

Ryukijano linked a pull request Oct 22, 2024 that will close this issue

Improve EncoderDecoderModel docs #34323

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve EncoderDecoderModel docs #16135

Improve EncoderDecoderModel docs #16135

patrickvonplaten commented Mar 14, 2022

silvererudite commented Mar 15, 2022

patrickvonplaten commented Mar 17, 2022

silvererudite commented Mar 21, 2022

Threepointone4 commented May 11, 2022

Threepointone4 commented May 12, 2022

patrickvonplaten commented May 13, 2022

Threepointone4 commented May 16, 2022

Winterflower commented Nov 3, 2022

patrickvonplaten commented Nov 4, 2022

anishlukk123 commented May 10, 2023

ghost commented May 19, 2023

SHUBHAPRIYA95 commented Jul 3, 2023

rajveer43 commented Aug 2, 2023

patrickvonplaten commented Aug 2, 2023

rajveer43 commented Aug 17, 2023

riiyaa24 commented Sep 30, 2023

mhdirnjbr commented Nov 9, 2023

lappemic commented May 13, 2024

Ryukijano commented Oct 22, 2024

Improve EncoderDecoderModel docs #16135

Improve EncoderDecoderModel docs #16135

Comments

patrickvonplaten commented Mar 14, 2022

First good issue

silvererudite commented Mar 15, 2022

patrickvonplaten commented Mar 17, 2022

silvererudite commented Mar 21, 2022

Threepointone4 commented May 11, 2022

Threepointone4 commented May 12, 2022

patrickvonplaten commented May 13, 2022

Threepointone4 commented May 16, 2022

Winterflower commented Nov 3, 2022

patrickvonplaten commented Nov 4, 2022

anishlukk123 commented May 10, 2023

ghost commented May 19, 2023

SHUBHAPRIYA95 commented Jul 3, 2023

rajveer43 commented Aug 2, 2023

patrickvonplaten commented Aug 2, 2023

rajveer43 commented Aug 17, 2023

riiyaa24 commented Sep 30, 2023

mhdirnjbr commented Nov 9, 2023

lappemic commented May 13, 2024

Ryukijano commented Oct 22, 2024