Speech Emotion Recognition using Adversarial auto-encoders

For low-level acoustic features, Authors extract a set of 1582 features using the openSMILE toolkit. The set consists of an assembly of spectral prosody and energy based features. Authors use five folder cross validation scheme, but this implementation is used one leave speaker cross validation scheme for speaker-independent manner.

Datasets

Interactive Emotional Dyadic Motion Capture (IEMOCAP) database is required to run this code.

Dependencies

openSMILE for low-level acoustic features extraction
Tensorflow for Adversarial Auto-encoders
scikit-learn for classification and performance evaluation

References

S. Sahu, R. Gupta, G. Sivaraman and C. Espy-Wilson, "Adversarial Auto-encoders for Speech Based Emotion Recognition," Interspeech, 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
datasets/00		datasets/00
exp/result		exp/result
model		model
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Emotion Recognition using Adversarial auto-encoders

Datasets

Dependencies

References

About

Releases

Packages

Languages

eesungkim/Speech_Emotion_Recognition_AAE

Folders and files

Latest commit

History

Repository files navigation

Speech Emotion Recognition using Adversarial auto-encoders

Datasets

Dependencies

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages