go-speech

go-speech is a proof of concept I'm writing to play with neural networks and digital signal processing. Current goal is to do simple speech recognition for words like "yes" or "no".

Working parts are:

Wav recorder to generate your own corpus
detect words in Wav file
split words in 20ms samples
train network with Fast Fourrier transform data

Usage

go-speech uses dep for dependencies management:

go get github.com/golang/dep/cmd/dep

Install source code:

go get github.com/bjorand/go-speech
cd $GOPATH/src/bjorand/go-speech

Install dependencies:

dep ensure -v

Record a Wav file where your repeat the same word "hello" many times separated by short silences:

go run cmd/recorder/recorder.go -out hellos.wav

Press Ctrl-c to stop and save recording.

You can also record a separate and unique occurrence of "hello" to test our network while training runs.

go run cmd/recorder/recorder.go -out hello-test.wav

For testing purpose during training, record a new file containing a different word like "crazy":

go run cmd/recorder/recorder.go -out crazy.wav

Train neural network and save results:

go run cmd/train/train.go -word hello -in hellos.wav -test hello-test.wav -wrong crazy.wav

TODO

cmd/recorder: start recording when a word is detected.
filter signal (21 to 9000hz cover human voice)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cmd		cmd
docs/jupyter-notebooks		docs/jupyter-notebooks
drivers/audio		drivers/audio
speechrec		speechrec
.gitignore		.gitignore
Gopkg.lock		Gopkg.lock
Gopkg.toml		Gopkg.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

go-speech

Usage

TODO

About

Releases

Packages

Languages

License

bjorand/go-speech

Folders and files

Latest commit

History

Repository files navigation

go-speech

Usage

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages