Transformer Attention-based Motion Prediction

In the field of 3D motion prediction, attention-based deep learning approaches have been gaining popularity recently. One recent advancement is the Spatio-Temporal (ST) transformer model, an architecture that has the ability to incorporate spatial and temporal information using joint representation. The ST transformer model has quickly become a state-of-the-art solution to motion prediction. In this paper, we implement the ST transformer successfully and provide qualitative and quantitative analysis on its performance across different motions, representations, and teacher forcing ratios.

Read the full report here.

Model Architectures Used

Transformer
RNN
Seq2Seq
Tied Seq2Seq
Transformer-LSTM hybrid

Sample Motions

Fairmotion from Facebook AI Research (FAIR) provides an easy-to-use interfaces and tools to work with motion capture data.

Motion Prediction

Human motion prediction is the problem of forecasting future body poses given observed pose sequence. The problem requires encoding both spatial and temporal aspects of human motion, and generating sequences conditioned on that. The problem is formulated as a sequence modeling task -- the input is 120 poses (frames) i.e., 2s of motion at 60Hz, and the output is 24 poses (400ms).

Data

AMASS DIP dataset contains 42 hours of recorded human motion data. AMASS benchmark uses 3304 test samples (5% of dataset). See Preprocessing section for instructions on downloading and loading the dataset.

Metrics

Mean (Euler) Angle Error of poses at 80, 160, 320 and 400 ms. Mean Angle Error is the Euclidean distance between predicted and reference Euler angles averaged over all joints, summed up to N frames. See metrics.euler_diff for code that computes this metric.

Preprocess

AMASS data can be downloaded from this link http://dip.is.tue.mpg.de/ where sequences are stored as .pkl files. We use amass_dip loader to load raw data in Motion objects, extract sequence windows, represent them as list of (source, target) tuples in their matrix versions, and split into training, validation and test sets. The preprocessing steps and dataset splits have been proposed by Structured Prediction Helps 3D Human Motion Modelling, Aksan et al (source code). The following angle representations can be used -- axis angles (aa), quaternion (quat) and rotation matrix (rotmat).

python fairmotion/tasks/motion_prediction/preprocess.py \
    --input-dir <PATH TO RAW DATA> \
    --output-dir <PREPROCESSED OUTPUT PATH> \
    --split-dir ./fairmotion/tasks/motion_prediction/data/ \
    --rep aa

Train

To build models for the motion prediction task, we provide several architectures -- RNN, seq2seq, tied seq2seq, transformer and transformer-LSTM hybrid models. The training script trains models for specified number of epochs while saving models every --save-model-frequency epochs and also the best performing on validation set.

python fairmotion/tasks/motion_prediction/training.py \
    --save-model-path <PATH TO SAVE MODELS> \
    --preprocessed-path <PREPROCESSED DATA PATH> \
    --epochs 100

Test

The test script loads the best saved model from --save-model-path, and evaluates its performance on the test set. --save-output-path can be used to generate and save output sequences for a subset of the test set.

python fairmotion/tasks/motion_prediction/test.py \
    --save-model-path <PATH TO MODEL> \
    --preprocessed-path <PREPROCESSED DATA PATH> \
    --save-output-path <PATH TO SAVE PREDICTED MOTIONS>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Code		Code
Models		Models
Supplementary Materials		Supplementary Materials
viz		viz
.gitattributes		.gitattributes
Performance Tracking.txt		Performance Tracking.txt
Project_Report_CS_7643_Attention_based_Motion_Prediction.pdf		Project_Report_CS_7643_Attention_based_Motion_Prediction.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Attention-based Motion Prediction

Model Architectures Used

Sample Motions

Motion Prediction

Data

Metrics

Preprocess

Train

Test

About

Releases

Packages

Languages

techbrainwave/Deep-Learning-Motion-Prediction

Folders and files

Latest commit

History

Repository files navigation

Transformer Attention-based Motion Prediction

Model Architectures Used

Sample Motions

Motion Prediction

Data

Metrics

Preprocess

Train

Test

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages