Vector3dvector and other vector Eigen bindings speed up #657

yxlao · 2018-11-02T10:22:03Z

Improves speed of open3d.Vector3dVector, Vector3iVector, Vector2iVector, Matrix4dVector by 40-200x, resolving issue #403 and pybind/pybind11#1481.

Special thanks to Wenzel's feedback. According to Wenzel, the slowness is due to "casting millions of small vectors, which requires a proportional amount of Python API calls".

Comparison

# in the build dir, inside virtualenv
➜  pip install pytest
➜  make install-pip-package -j && pytest ../src/UnitTest -s

Before

open3d.Vector3dVector: (2000000, 3)
open3d -> numpy: 1.005225s
numpy -> open3d: 0.000018s

open3d.Vector3iVector: (2000000, 3)
open3d -> numpy: 0.983502s
numpy -> open3d: 0.001925s

open3d.Vector2iVector: (2000000, 3)
open3d -> numpy: 0.944884s
numpy -> open3d: 0.000886s

After

open3d.Vector3dVector: (2000000, 3)
open3d -> numpy: 0.024798s
numpy -> open3d: 0.000017s

open3d.Vector3iVector: (2000000, 3)
open3d -> numpy: 0.010451s
numpy -> open3d: 0.001906s

open3d.Vector2iVector: (2000000, 3)
open3d -> numpy: 0.005553s
numpy -> open3d: 0.000865s

Discussions

This solution is not ideal yet:

We pay the copy penalty. The way to avoid this is to replace the underlying storage of say vector<Eigen::Vector3d> to one blob of buffer. However, this requires significant rework of the code base.
2) We pay the penalty accessing numpy array index individually. We can do more aggressive optimizations (e.g. more direct memory mapping) if we we handle double and int types separately (i.e. handle Vector3dVector and Vector3iVector separately) instead of using a generic function. assert that the incoming array is contiguous. Preliminary tests shows about 20% - 30% speed up.
Edit:2) is addressed in the improved direct mapping approach

Future works

Some of the templated functions can be further merged. Please let me know if you have suggestions.

This change is

qianyizh · 2018-11-02T16:02:52Z

This tries to address #403

yxlao · 2018-11-02T23:34:27Z

Update: squeezed in another optimization ed122ac with direct memory mapping, 30% more speed up:

2e6 points:

open3d.Vector3dVector: (2000000, 3)
open3d -> numpy: 0.017295s
numpy -> open3d: 0.000016s

open3d.Vector3iVector: (2000000, 3)
open3d -> numpy: 0.009439s
numpy -> open3d: 0.002394s

open3d.Vector2iVector: (2000000, 3)
open3d -> numpy: 0.004427s
numpy -> open3d: 0.001198s

2e5 points (as used in #403):

open3d.Vector3dVector: (200000, 3)
open3d -> numpy: 0.001392s
numpy -> open3d: 0.000009s

open3d.Vector3iVector: (200000, 3)
open3d -> numpy: 0.000263s
numpy -> open3d: 0.000012s

open3d.Vector2iVector: (200000, 3)
open3d -> numpy: 0.000198s
numpy -> open3d: 0.000013s

syncle

Reviewed 2 of 4 files at r1, 2 of 2 files at r2.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @takanokage and @qianyizh)

syncle

This is great PR. It is good to have unit test for this as well. As a sanity check, could you also check other tutorial examples that uses VectorXXVectors?

Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @takanokage and @qianyizh)

yxlao

@syncle For vector and matrices, we have

Vector3dVector
Vector3iVector
Vector2iVector
Matrix4dVector

The first 3 has been optimized. The fourth Matrix4dVector could be optimized in the same way, however, it is only used for converting camera parameters, so performance shouldn't be an issue for now.

Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @takanokage and @qianyizh)

yxlao added 2 commits November 2, 2018 03:00

add python eigen unit tests and benchmarks

c793c99

custom constructor to speed up vector of eigen

d14f468

yxlao requested review from takanokage, syncle and qianyizh November 2, 2018 10:22

qianyizh mentioned this pull request Nov 2, 2018

Vector3dVector function slow #403

Closed

yxlao added 2 commits November 2, 2018 13:47

remove speed up for eigen matrix

6c83086

direct mapping and py::array::c_style

ed122ac

yxlao force-pushed the vector3dvector branch from 7e53157 to ed122ac Compare November 2, 2018 23:31

syncle approved these changes Nov 3, 2018

View reviewed changes

yxlao commented Nov 3, 2018

View reviewed changes

qianyizh merged commit 8d90623 into isl-org:master Nov 5, 2018

yxlao deleted the vector3dvector branch November 5, 2018 17:55

This was referenced Nov 7, 2018

Visual 3d point directly from numpy array #664

Closed

Speed issue with binding from Numpy array to std::vector<Eigen::Vector3d>. pybind/pybind11#1481

Closed

yxlao mentioned this pull request Jul 19, 2019

open3d.vector3dvector() is very slow #1045

Closed

andyjmwang mentioned this pull request Aug 14, 2019

Speed issue for function with Numpy array inputs pybind/pybind11#1880

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector3dvector and other vector Eigen bindings speed up #657

Vector3dvector and other vector Eigen bindings speed up #657

yxlao commented Nov 2, 2018 •

edited

Loading

qianyizh commented Nov 2, 2018

yxlao commented Nov 2, 2018 •

edited

Loading

syncle left a comment

syncle left a comment

yxlao left a comment

Vector3dvector and other vector Eigen bindings speed up #657

Vector3dvector and other vector Eigen bindings speed up #657

Conversation

yxlao commented Nov 2, 2018 • edited Loading

Comparison

Discussions

Future works

qianyizh commented Nov 2, 2018

yxlao commented Nov 2, 2018 • edited Loading

syncle left a comment

Choose a reason for hiding this comment

syncle left a comment

Choose a reason for hiding this comment

yxlao left a comment

Choose a reason for hiding this comment

yxlao commented Nov 2, 2018 •

edited

Loading

yxlao commented Nov 2, 2018 •

edited

Loading