MSRVTT results reproduction #27

NimrodShabtay · 2024-05-30T17:31:29Z

Hi,
Thank you for sharing your interesting work!
I want to try and reproduce the MSRVTT dataset.

I followed the instructions and used mistral_best.pth checkpoint, and I ran mistral_evaluation.sh

Then I ran evaluate_zero_shot.sh using GPT-4o to get the results for score and accuracy.
I got very low results (acc ~0.18) on 100K samples (didn't try the other 100K).

I wonder if you can help me to reproduce the results as reported in the paper / this repo.

Thanks in advance,
Nimrod

The text was updated successfully, but these errors were encountered:

hb-jw · 2024-07-26T13:50:30Z

Hello, I've also been replicating related benchmarks recently, and these benchmarks are mostly based on GPT-assistant, which seems quite costly. I'd like to ask, approximately how much does each of your evaluations cost?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MSRVTT results reproduction #27

MSRVTT results reproduction #27

NimrodShabtay commented May 30, 2024

hb-jw commented Jul 26, 2024

MSRVTT results reproduction #27

MSRVTT results reproduction #27

Comments

NimrodShabtay commented May 30, 2024

hb-jw commented Jul 26, 2024