-
Encoder input shape: (1, 224, 224, 3) in the format B, H, W, C
- B - batch size
- H - image height
- W - image width
- C - number of channels. Expected color order is BGR.
-
Decoder input shape: (1, 16, 512) in the format B, T, C
- B - batch size
- T - duration of input clip
- C - dimension of embedding
-
Encoder output shape: (1, 1, 1, 512)
representing embedding of processed frame -
Decoder output shape: (1, 9)
each row is a logits vector of performed actions.
The list of recognized actions is
LABELS = [
'Safe driving',
'Texting left',
'Texting right',
'Talking phone left',
'Talking phone right',
'Operating radio',
'Drinking eating',
'Reaching behind',
'Hair and makeup',
]
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample image,
$ python3 driver-action-recognition-adas.py
If you want to specify the input video, put the file path after the --input
option.
You can use --savepath
option to save the resulting video file.
$ python3 driver-action-recognition-adas.py --input VIDEO_PATH --savepath SAVE_VIDEO_PATH
- OpenVINO - Open Model Zoo repository - driver-action-recognition-adas-0002
- OpenVINO - driver-action-recognition-adas-0002
OpenVINO
ONNX opset=11
driver-action-recognition-adas-0002-encoder.onnx.prototxt
driver-action-recognition-adas-0002-decoder.onnx.prototxt