WSS23 - 3D Human Pose Estimation

For Wolfram Summer School 2023, I worked on estimating 3D pose in form of human skeletons and 3D meshes from video and images using neural networks.

Objective

This project aims to leverage advanced computer vision techniques to accurately estimate the 3D positions of 17 keypoints on the human body and create visually appealing visualizations of the estimated pose in a 3-dimensional display.

Approach

This project implemented a two-step approach for 3D human pose estimation. The CenterNet model from the Wolfram Neural Network repository was employed to estimate the 2D coordinates of 17 keypoints on the human body. The MiDaS monocular depth estimation model was utilized by importing it using ONNX to obtain depth information. By combining the 2D coordinates with the estimated depths, the body joint linkages were established and the pose was visualized in 3D. We tested the pipeline on various single person images and videos, and it demonstrated comparable results and visualizations to state-of-the-art technologies.

Future Directions:

Multi-person support: Extend the project to estimate 3D keypoints for multiple individuals in an image while maintaining their separation as distinct human skeletons.
Occlusion handling: Explore the development of an occlusion-resistant model capable of estimating keypoints even when body parts are partially hidden.
Video optimization: Investigate ways to enhance and implement the 3D estimation pipeline for video data.
Mesh Regression: Consider further work on converting 3D poses into a standard human mesh using the obtained 3D points.

Gallery

Here are some interesting results on images.

More details can be found in this Wolfram Community Post

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Drafts and Notes		Drafts and Notes
Outputs		Outputs
3D_Human_Pose_Estimation_Using_Machine_Learning.nb		3D_Human_Pose_Estimation_Using_Machine_Learning.nb
Community Post.nb		Community Post.nb
Pose Collage Banner.png		Pose Collage Banner.png
Presentation WSS23.nb		Presentation WSS23.nb
Project Community Notebook.nb		Project Community Notebook.nb
Project Description.nb		Project Description.nb
README.md		README.md
Scratchbook.nb		Scratchbook.nb
Visualization.nb		Visualization.nb
WSS23 Poster - Fizza Rubab.nb		WSS23 Poster - Fizza Rubab.nb
WSS23 Poster - Fizza Rubab.pdf		WSS23 Poster - Fizza Rubab.pdf
architecture.png		architecture.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WSS23 - 3D Human Pose Estimation

Objective

Approach

Future Directions:

Gallery

About

Releases

Packages

Languages

Fizza-Rubab/WSS23-3D-Pose-Estimation

Folders and files

Latest commit

History

Repository files navigation

WSS23 - 3D Human Pose Estimation

Objective

Approach

Future Directions:

Gallery

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages