Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MSRVTT results reproduction #27

Open
NimrodShabtay opened this issue May 30, 2024 · 1 comment
Open

MSRVTT results reproduction #27

NimrodShabtay opened this issue May 30, 2024 · 1 comment

Comments

@NimrodShabtay
Copy link

Hi,
Thank you for sharing your interesting work!
I want to try and reproduce the MSRVTT dataset.

I followed the instructions and used mistral_best.pth checkpoint, and I ran mistral_evaluation.sh

Then I ran evaluate_zero_shot.sh using GPT-4o to get the results for score and accuracy.
I got very low results (acc ~0.18) on 100K samples (didn't try the other 100K).

I wonder if you can help me to reproduce the results as reported in the paper / this repo.

Thanks in advance,
Nimrod

@hb-jw
Copy link

hb-jw commented Jul 26, 2024

Hello, I've also been replicating related benchmarks recently, and these benchmarks are mostly based on GPT-assistant, which seems quite costly. I'd like to ask, approximately how much does each of your evaluations cost?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants