Add support for word-level audio transcription timestamp granularity #733

agcom · 2024-05-05T13:40:23Z

Describe the change
Add support for word-level audio transcription timestamp granularity.

Provide OpenAI documentation link

Describe your solution
Added AudioRequest.TimestampGranularities and AudioResponse.Words fields.

Tests
Filled AudioRequest.TimestampGranularities field in the existing tests' audio requests.

Additional context

Related to two-months-abandoned feat(whisper): add request parameter TimeStampGranularities and respo… #673, self-closed Addded Word-Level Timestamp Granularity Support #680, and self-closed Add support for word-level and segment-level timestamp granularities #674.
Also removed some obsolete comments on AudioRequest fields.

codecov · 2024-05-05T13:41:24Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.68%. Comparing base (774fc9d) to head (456ceba).
Report is 10 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #733      +/-   ##
==========================================
+ Coverage   98.46%   98.68%   +0.22%     
==========================================
  Files          24       24              
  Lines        1364     1140     -224     
==========================================
- Hits         1343     1125     -218     
+ Misses         15        9       -6     
  Partials        6        6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

agcom · 2024-05-05T13:42:18Z

I would have proposed separating transcription and translation request/response types, but that would have been a breaking change.

sashabaranov · 2024-05-06T10:05:20Z

Thank you for working on this!

sashabaranov · 2024-05-06T10:07:35Z

@agcom the PR looks good to me in its current state!

I would have proposed separating transcription and translation request/response types, but that would have been a breaking change.

We can discuss it in a separate PR, also totally open to look at a sketch of how this might look like 🙌🏻

agcom · 2024-05-06T13:35:14Z

@sashabaranov alright then, ready to review. I just wanted to test it out in our development server before marking it for review (did it, and it works fine).

We can discuss it in a separate PR, also totally open to look at a sketch of how this might look like 🙌🏻

May god bless me with more code refactoring tasks so I would work on this 😄.

agcom added 2 commits May 5, 2024 16:47

Add support for audio transcription timestamp_granularities word

54414fd

Fixup multiple timestamp granularities

456ceba

agcom marked this pull request as ready for review May 6, 2024 13:31

sashabaranov approved these changes May 7, 2024

View reviewed changes

sashabaranov merged commit 3334a9c into sashabaranov:master May 7, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for word-level audio transcription timestamp granularity #733

Add support for word-level audio transcription timestamp granularity #733

agcom commented May 5, 2024 •

edited

Loading

codecov bot commented May 5, 2024

agcom commented May 5, 2024 •

edited

Loading

sashabaranov commented May 6, 2024

sashabaranov commented May 6, 2024

agcom commented May 6, 2024 •

edited

Loading

Add support for word-level audio transcription timestamp granularity #733

Add support for word-level audio transcription timestamp granularity #733

Conversation

agcom commented May 5, 2024 • edited Loading

codecov bot commented May 5, 2024

Codecov Report

agcom commented May 5, 2024 • edited Loading

sashabaranov commented May 6, 2024

sashabaranov commented May 6, 2024

agcom commented May 6, 2024 • edited Loading

agcom commented May 5, 2024 •

edited

Loading

agcom commented May 5, 2024 •

edited

Loading

agcom commented May 6, 2024 •

edited

Loading