Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

when I transcribe japanese video, sometime whole script file repeat same dialoge #67

Open
kiron111 opened this issue Mar 25, 2023 · 8 comments

Comments

@kiron111
Copy link

for example:
1
00:00:00,000 --> 00:00:02,000
I'm not sure if I'm going to be able to get through this.

2
00:00:02,000 --> 00:00:04,000
I'm not sure if I'm going to be able to get through this.

3
00:00:04,000 --> 00:00:06,000
I'm not sure if I'm going to be able to get through this.

4
00:00:06,000 --> 00:00:08,000
I'm not sure if I'm going to be able to get through this.

5
00:00:08,000 --> 00:00:10,000
I'm not sure if I'm going to be able to get through this.

6
00:00:10,000 --> 00:00:12,000
I'm not sure if I'm going to be able to get through this.

Sometime, it happen at the beginning, sometime at the end (maybe last for 10 minutes), sometime the whole file repeat same dialogue. (no matter I use medium or large model)

When I use the whisperer, using the same model, everything is okay.

So, I think maybe the optimization is over, skipping some important step.

@tigros
Copy link

tigros commented Mar 25, 2023

that's probably the --max-context 0 parameter which i saw somewhere and added it to whisperer.

@kiron111
Copy link
Author

I'm looking forwards for the next update 👍

@emcodem
Copy link

emcodem commented Mar 29, 2023

#26

@Taounit
Copy link

Taounit commented Apr 9, 2023

After making some tests I am pretty confident the problem comes from the translation module (Google API ?) and not whisper, as the repeat patterns occur in translation mode and stop when the transcription-only mode is used.

@albino1
Copy link

albino1 commented Apr 9, 2023

Unfortunately, this happens all the time without any translation at all. Check this github, the main whisper.cpp one, and even the official OpenAI one, there's probably a hundred posts about it across them.

@jiyuguan
Copy link

I’m writing this comment to report a problem with the whisper app. I found that when it transcribes voice to text, it repeats the same sentence until the end of the video. It only transcribes the first part of the voice accurately. I tried to cut the part that was not transcribed properly, and it worked. But then the same problem happened again in the second half of the edited video. I hope you can fix this issue soon.

@Highlander1536
Copy link

It can get really bad with live audio translating, anyone know a way to lower the repeating at least?

@emcodem
Copy link

emcodem commented Apr 27, 2023

@Highlander1536 sure, i have limited it with a dirty hack: #26 - what i do is detect if repeated text has been decoded and if yes, reset the context history.
Also, to repeat above stuff: -mc 0 seems to always solve it (but lowers quality, maybe --prompt can help here?)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants