Paper issues #15

RicherMans · 2024-12-20T03:02:31Z

Hey guys,
so I recently came across your paper and found some problems, that I'd like to discuss.

For nearly all results in the paper that are claimed to be "zeroshot", the authors clearly trained on that dataset, thus is not truely zeroshot. For example, this table:

Shows superiority in (ZS) evaluation against the baselines. However, CLARA's training set contains of CREMA-D, RAVDESS etc., while (some) of the baselines didn't use this "trick".
Can you clarify why it is believed that this is zero-shot performance?

What are the test datasets ? i.e., in example Table VI:
Since you have trained on so much speech data, why is there no zero-shot evaluation for MSW or even some english datasets?
Are pretrained checkpoints available? The links seem broken in the README.

Kind regards,
Heinrich

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper issues #15

Paper issues #15

RicherMans commented Dec 20, 2024

Paper issues #15

Paper issues #15

Comments

RicherMans commented Dec 20, 2024