About the released code #45

CJ416 · 2024-08-25T17:13:53Z

hello, haohe. I really appreciate your work! Thank you for your kindness of open sourcing.
In the learning of the training code, I can not find the training of GPT2. In the original paper, embeddings of different modalities were fed into GPT2. But the released code seems to directly use clap embeddings and film to fuse the LOA. Or have I missed some details? Hope you can solve my confusion!

haoheliu · 2024-08-28T08:27:57Z

Hi @CJ416 You can find the GPT-2 training related code in this file audioldm_train/modules/audiomae/sequence_gen/sequence_input.py

You might need to modify the yaml config file so that to use GPT-2 output as LDM condition

CJ416 · 2024-08-28T08:36:29Z

Hi @CJ416 You can find the GPT-2 training related code in this file audioldm_train/modules/audiomae/sequence_gen/sequence_input.py

You might need to modify the yaml config file so that to use GPT-2 output as LDM condition

Wow~ Thanks for reply~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the released code #45

About the released code #45

CJ416 commented Aug 25, 2024

haoheliu commented Aug 28, 2024 •

edited

Loading

CJ416 commented Aug 28, 2024

About the released code #45

About the released code #45

Comments

CJ416 commented Aug 25, 2024

haoheliu commented Aug 28, 2024 • edited Loading

CJ416 commented Aug 28, 2024

haoheliu commented Aug 28, 2024 •

edited

Loading