Fake speech recognition

Deep learning CNN model, recognize between synthetic and human speech.

Target

In the world of the fake news we want to know if the current voice is real human or TTS (text to speech) machine. Our model is not good enough, but it's proof of concept that we can train model to recognize fake speech. The training process exists in Training folder (model weights include).

Datasets

We decided to to train the model on speech command datasets because we need the same dataset for synthetic & human voices, we found the next datasets:

So our training data based on 30 words: bed, bird, cat, dog, down, eight, five, four, go, happy, house, left, marvin, nine, no, off, on, one, right, seven, sheila, six, stop, three, tree, two, up, wow, yes, zero

We want to recognize the TTS machines of google & ibm watson, so we generated records of all the 30 words by all their english voices.

Api

We built api server in python using flask, you can find it in Api folder.

Route: /recognize
Method: POST
Param: record <URL OF SPEECH RECORD>
Return: JSON
  {
    sucess: true,
    message: 'success',
    data: {
        "result": <BOT | HUMAN>,
        "spectrogram": <SPECTROGRAM OF THE SPEECH IMAGE>
    }
  }

For currect spectrogram image url you need to add environment variable

export FLASK_APP_URL=<FULL APP URL>

Contributors

Credits

@dawidkopczyk about the article & code
[JohannesBuchner] for synthetic dataset https://www.kaggle.com/jbuchner/synthetic-speech-commands-dataset
Warden P. Speech Commands: A public dataset for single-word speech recognition, 2017. Available from http://download.tensorflow.org/data/speech_commands_v0.01.tar.gz

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Api		Api
Training		Training
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake speech recognition

Target

Datasets

Api

Contributors

Credits

About

Releases

Packages

Languages

yehuya/Fake-speech-recognition

Folders and files

Latest commit

History

Repository files navigation

Fake speech recognition

Target

Datasets

Api

Contributors

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages