Skip to content

Latest commit

 

History

History
46 lines (38 loc) · 1.73 KB

README.md

File metadata and controls

46 lines (38 loc) · 1.73 KB

A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues

This repository contains the source codes for our submissions to OMG Emotion Challenge 2018. Method descriptions can be found here.

Team Member: Songyou Peng, Le Zhang, Yutong Ban, Meng Fang, Stefan Winkler

Requirements

Preprocessing

Every video should be pre-processed as follows:

  • Extract frames and apply MTCNN to align faces
  • Extract WAV files and calculate STFT

Citation

If you use the code (only for research), please consider citing our paper:

@inproceedings{peng2018omg,
 author =  {Peng, Songyou and Zhang, Le and Ban, Yutong and Fang, Meng and Winkler, Stefan},
 title = {{A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues}},
 year = {2018},
 booktitle = {arxiv},
}

Contact Songyou Peng ✉️ for questions, comments and reporting bugs.