Georgia Tech Structure from Motion (GTSFM) Library

Platform	Build Status
Ubuntu 20.04.3

What is GTSFM?

GTSFM is an end-to-end SFM pipeline based on GTSAM. GTSFM was designed from the ground-up to natively support parallel computation using Dask.

License

The majority of our code is governed by a MIT license and is suitable for commercial use. However, certain implementations featured in our repo (SuperPoint, SuperGlue) are governed by a non-commercial license and may not be used commercially.

Installation

GTSFM requires no compilation, as Python wheels are provided for GTSAM.

To install GTSFM, first, we need to create a conda environment.

Linux On Linux, with CUDA support:

conda env create -f environment_linux.yml
conda activate gtsfm-v1 # you may need "source activate gtsfm-v1" depending upon your bash and conda set-up

The Python3.8 gtsam wheel for Linux is available here.

Mac On Mac OSX, there is no CUDA support, so run:

conda env create -f environment_mac.yml
conda activate gtsfm-v1

Download the Python 3.8 gtsam wheel for Mac here, and install it as

pip install ~/Downloads/gtsam-4.1.1-py3-none-any.whl

Completing Installation

Now, install gtsfm as a module:

pip install -e .

Make sure that you can run python -c "import gtsfm; import gtsam; print('hello world')" in python, and you are good to go!

Usage Guide (Running 3d Reconstruction)

Before running reconstruction, if you intend to use modules with pre-trained weights, such as SuperPoint, SuperGlue, or PatchmatchNet, please first run:

./download_model_weights.sh

To run SfM with a dataset with only an image directory and EXIF, with image file names ending with "jpg", please create the following file structure like

└── {DATASET_NAME}
       ├── images
               ├── image1.jpg
               ├── image2.jpg
               ├── image3.jpg

and run

python gtsfm/runner/run_scene_optimizer_olssonloader.py --config_name {CONFIG_NAME} --dataset_root {DATASET_ROOT} --image_extension jpg --num_workers {NUM_WORKERS}

For example, if you had 4 cores available and wanted to use the Deep Front-End (recommended), you should run:

python gtsfm/runner/run_scene_optimizer_olssonloader.py --dataset_root /path/to/{DATASET_NAME} --image_extension jpg --config_name deep_front_end.yaml --num_workers 4

(or however many workers you desire).

Currently we require EXIF data embedded into your images (or you can provide ground truth intrinsics in the expected format for an Olsson dataset, or COLMAP-exported text data, etc)

If you would like to compare GTSFM output with COLMAP output, please run:

python gtsfm/runner/run_scene_optimizer_colmap_loader.py --config_name {CONFIG_NAME} --images_dir {IMAGES_DIR} --colmap_files_dirpath {COLMAP_FILES_DIRPATH} --image_extension jpg --num_workers {NUM_WORKERS} --max_frame_lookahead {MAX_FRAME_LOOKAHEAD}

where COLMAP_FILES_DIRPATH is a directory where .txt files such as cameras.txt, images.txt, etc have been saved.

To visualize the result using Open3D, run:

python gtsfm/visualization/view_scene.py --rendering_library open3d --point_rendering_mode point

For users that are working with the same dataset repeatedly, we provide functionality to cache front-end results for GTSFM for very fast inference afterwards. For more information, please refer to gtsfm/frontend/cacher/README.md.

Repository Structure

GTSFM is designed in an extremely modular way. Each module can be swapped out with a new one, as long as it implements the API of the module's abstract base class. The code is organized as follows:

gtsfm: source code, organized as:
- averaging
  - rotation: rotation averaging implementations (Shonan, Chordal, etc)
  - translation: translation averaging implementations (1d-SFM, etc)
- bundle: bundle adjustment implementations
- common: basic classes used through GTSFM, such as Keypoints, Image, SfmTrack2d, etc
- data_association: 3d point triangulation (DLT) w/ or w/o RANSAC, from 2d point-tracks
- densify
- frontend: SfM front-end code, including:
  - detector: keypoint detector implementations (DoG, etc)
  - descriptor: feature descriptor implementations (SIFT, SuperPoint etc)
  - matcher: descriptor matching implementations (Superglue, etc)
  - verifier: 2d-correspondence verifier implementations (Degensac, OA-Net, etc)
  - cacher: Cache implementations for different stages of the front-end.
- loader: image data loaders
- utils: utility functions such as serialization routines and pose comparisons, etc
tests: unit tests on every function and module

Contributing

Contributions are always welcome! Please be aware of our contribution guidelines for this project.

Citing this work

Open-source Python implementation:

@misc{GTSFM,
    author = {Ayush Baid and Fan Jiang and Akshay Krishnan and John Lambert and Aditya Singh and
       Aishwarya Venkataramanan and Sushmita Warrier and Jing Wu and Xiaolong Wu and Frank Dellaert},
    title = { {GTSFM}: Georgia Tech Structure from Motion},
    howpublished={\url{https://github.com/borglab/gtsfm}},
    year = {2021}
}

Note: authors are listed in alphabetical order.

Compiling Additional Verifiers

On Linux, we have made pycolmap's LORANSAC available in pypi. However, on Mac, pycolmap must be built from scratch. See the instructions here.

Name		Name	Last commit message	Last commit date
Latest commit History 3,018 Commits
.github		.github
assets		assets
evaluation		evaluation
gtsfm		gtsfm
rtf_vis_tool		rtf_vis_tool
tests		tests
thirdparty		thirdparty
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
download_model_weights.sh		download_model_weights.sh
environment_linux.yml		environment_linux.yml
environment_linux_cpuonly.yml		environment_linux_cpuonly.yml
environment_mac.yml		environment_mac.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Georgia Tech Structure from Motion (GTSFM) Library

What is GTSFM?

License

Installation

Completing Installation

Usage Guide (Running 3d Reconstruction)

Repository Structure

Contributing

Citing this work

Compiling Additional Verifiers

About

Releases

Packages

Languages

License

stihuangyuan/gtsfm

Folders and files

Latest commit

History

Repository files navigation

Georgia Tech Structure from Motion (GTSFM) Library

What is GTSFM?

License

Installation

Completing Installation

Usage Guide (Running 3d Reconstruction)

Repository Structure

Contributing

Citing this work

Compiling Additional Verifiers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages