fix: ASR hallucination on ending silence #115

JacobLinCool · 2024-12-26T05:26:27Z

work around on resolving #78

This pull request includes several changes to improve audio processing and transcription functionalities. The key changes involve trimming audio data and cleaning up transcription results.

Audio processing improvements:

src/lib/components/session/ParticipantView.svelte: Modified the float32ArrayToWav function call to remove the last 8000 samples (0.5 seconds) from the audio data before converting it to WAV format. This change was made in two places within the file. [1] [2]

Transcription cleanup:

src/lib/stt/gemini.ts: Updated the transcription function to remove unwanted characters (嗶, …) from the end of the transcription result using a regular expression.

Copilot reviewed 1 out of 2 changed files in this pull request and generated no comments.

Files not reviewed (1)

src/lib/components/session/ParticipantView.svelte: Language not supported

fix: ASR hallucination on ending silence

a3873df

JacobLinCool self-assigned this Dec 26, 2024

Copilot bot review requested due to automatic review settings December 26, 2024 05:26

Copilot AI reviewed Dec 26, 2024

View reviewed changes

JacobLinCool merged commit d25be06 into main Dec 26, 2024
4 checks passed

JacobLinCool deleted the fix-asr-bbb branch December 26, 2024 05:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: ASR hallucination on ending silence #115

fix: ASR hallucination on ending silence #115

JacobLinCool commented Dec 26, 2024

fix: ASR hallucination on ending silence #115

fix: ASR hallucination on ending silence #115

Conversation

JacobLinCool commented Dec 26, 2024

Choose a reason for hiding this comment