Skip to content

adriandavidauer/VVAD

Repository files navigation

VVAD-LRS3

Library to provide models trained on the VVAD-LRS3 Dataset. The library also contains preprocessing pipelines. Applications are Speaker detection in scenarios, where multiple people are in the robot's field of view and stare detection for proactive approaches.

Prerequisites

vvadlrs3 depends on dlib which needs build tools to be installed over pip. Here is described what is needed.

On Ubuntu, you need to install the following:

sudo apt-get install build-essential cmake libopenblas-dev liblapack-dev libx11-dev libgtk-3-dev

Install

pip install vvadlrs3

Data

The models are trained on the VVAD-LRS3 Dataset


Some samples visualized. Samples with green borders are positive samples, samples with red borders are negative samples