GitHub - mint-lab/3dv_tutorial: An Invitation to 3D Vision: A Tutorial for Everyone

An Invitation to 3D Vision: A Tutorial for Everyone

An Invitation to 3D Vision is an introductory tutorial on 3D computer vision (a.k.a. geometric vision or visual geometry or multiple-view geometry). It aims to make beginners understand basic theories on 3D vision and implement its applications using OpenCV. In addition to tutorial slides, example codes are provided in the purpose of education. They include simple but interesting and practical applications. The example codes are written as short as possible (mostly less than 100 lines) to be clear and easy to understand.

To clone this repository (codes and slides): git clone https://github.com/mint-lab/3dv_tutorial.git
To fork this repository to your Github: Click here
To download codes and slides as a ZIP file: Click here
📝 How to run example codes in Python
📝 How to run example codes in C++

What does its name come from?

The main title, An Invitation to 3D Vision, came from a legendary book by Yi Ma, Stefano Soatto, Jana Kosecka, and Shankar S. Sastry. We wish that our tutorial will be the first gentle invitation card for beginners to 3D vision and its applications.
The subtitle, for everyone, was inspired from Prof. Kim's online lecture (in Korean). Our tutorial is also intended not only for students and researchers in academia, but also for hobbyists and developers in industries. We tried to describe important and typical problems and their solutions in OpenCV. We hope readers understand it easily without serious mathematical background.

Lecture Slides

Example Codes

Section 1. Introduction [slides]
Section 2. Single-view Geometry [slides]
- Getting Started with 2D
  - 3D rotation conversion [python]
- Pinhole Camera Model
  - Object localization [python] [cpp]
  - Image formation [python] [cpp]
- Geometric Distortion Models
  - Geometric distortion visualization [python]
  - Geometric distortion correction [python] [cpp] [result video]
- Camera Calibration
  - Camera calibration [python] [cpp]
- Absolute Camera Pose Estimation (a.k.a. perspective-n-point; PnP)
  - Pose estimation (chessboard) [python] [cpp] [result video]
  - Pose estimation (book) [python] [cpp]
  - Pose estimation (book) with camera calibration [python] [cpp]
  - Pose estimation (book) with camera calibration without initial $K$ [python] [cpp] [result video]
Section 3. Two-view Geometry [slides]
- Planar Homography
  - Perspective distortion correction [python] [cpp]
  - Planar image stitching [python] [cpp]
  - 2D video stabilization [python] [cpp] [result video]
- Epipolar Geometry
  - Epipolar line visualization [python]
- Relative Camera Pose Estimation
  - Fundamental matrix estimation [python]
  - Monocular visual odometry (epipolar version) [python] [cpp] [result video]
- Triangulation
  - Triangulation [python] [cpp]
Section 4. Solving Problems [slides]
- Solving Linear Equations in 3D Vision
  - Affine transformation estimation [python]
  - Planar homography estimation [python]
    - Appendix) Image warping using homography [python]
  - Fundamental matrix estimation [python]
  - Triangulation [python]
- Solving Nonlinear Equations in 3D Vision
  - Absolute camera pose estimation [python]
  - Camera calibration [python]
Section 5. Finding Correspondence [slides]
- Feature Points and Descriptors
  - Harris corner [python]
  - SuperPoint [Github]
- Feature Matching and Tracking
  - Feature matching comparison [python]
  - Feature tracking with KLT tracker [python]
- Outlier Rejection
  - Line fitting with RANSAC [python] [cpp]
  - Line fitting with M-estimator [python] [cpp]
  - Planar homography estimation with RANSAC [python]
Section 6. Multiple-view Geometry [slides]
- Bundle Adjustment [python] [cpp]
- Structure-from-Motion (SfM)
  - Global SfM [python] [cpp]
  - Incremental SfM [python] [cpp]
Section 7. Visual SLAM and Odometry [slides]

License

Beerware

Authors

Acknowledgement

The authors thank the following contributors and projects.

Jae-Yeong Lee: He motivated many examples.
Giseop Kim: He contributed the initial version of SfM codes based on Toy-SfM and cvsba.
Richard Blais: His book cover and video in the OpenCV tutorial were used to demonstrate camera pose estimation and augmented reality.
Russell Hewett: His two hill images were used to demonstrate image stitching.
Kang Li: His shaking CCTV video was used to demonstrate video stabilization.
The KITTI Vision Benchmark Suite: The KITTI odometry dataset #07 was used to demonstrate visual odometry and SLAM.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
data		data
examples		examples
slides		slides
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
HOWTO_RUN_CPP.md		HOWTO_RUN_CPP.md
HOWTO_RUN_PYTHON.md		HOWTO_RUN_PYTHON.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Invitation to 3D Vision: A Tutorial for Everyone

What does its name come from?

Lecture Slides

Example Codes

License

Authors

Acknowledgement

About

Releases 1

Packages

Contributors 2

Languages

License

mint-lab/3dv_tutorial

Folders and files

Latest commit

History

Repository files navigation

An Invitation to 3D Vision: A Tutorial for Everyone

What does its name come from?

Lecture Slides

Example Codes

License

Authors

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages