-
Notifications
You must be signed in to change notification settings - Fork 13
/
DESCRIPTION
33 lines (33 loc) · 1.56 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
Package: audio.whisper
Type: Package
Title: Transcribe Audio Files using the "Whisper" Automatic Speech Recognition Model
Version: 0.4.2
Maintainer: Jan Wijffels <jwijffels@bnosac.be>
Authors@R: c(
person('Jan', 'Wijffels', role = c('aut', 'cre', 'cph'), email = 'jwijffels@bnosac.be', comment = "R wrapper"),
person('David', 'Reid', role = c('ctb', 'cph'), comment = "Files in src/dr_libs folder"),
person('Georgi', 'Gerganov', role = c('ctb', 'cph'), comment = "Files in src/whisper_cpp and files in inst folder"),
person('BNOSAC', role = 'cph', comment = "R wrapper"))
Description: The "Whisper" models are trained for speech recognition and translation tasks,
capable of transcribing speech audio into the text in the language it is spoken (Automatic Speech Recognition)
as well as translated into English (speech translation).
The package is an "Rcpp" wrapper around the standalone C++ implementation provided at <https://github.com/ggerganov/whisper.cpp>.
There are 10 pretrained models available of different sizes and language capabilities.
"Whisper" is explained in the paper: 'Robust Speech Recognition via Large-Scale Weak Supervision' by Radford et al. (2022), available at <arXiv:2212.04356>.
License: MIT + file LICENSE
URL: https://github.com/bnosac/audio.whisper
Encoding: UTF-8
Depends:
R (>= 2.10)
Imports:
Rcpp (>= 0.11.5),
utils
Suggests:
tinytest,
audio,
data.table (>= 1.12.4),
audio.vadwebrtc (>= 0.2.0)
LinkingTo: Rcpp
SystemRequirements: GNU make
RoxygenNote: 7.2.3
Remotes: bnosac/audio.vadwebrtc