feat: allow tree_r_last to be built on gpu #1138

cryptonemo · 2020-06-01T20:30:27Z

feat: attempt to improve gpu tree_c layer retrieval

cryptonemo · 2020-06-01T20:31:20Z

Work in progress. Still uses CPU compat mode for tree builders. Todo: verify poseidon standard/strengthened on tests. Param generation bump required?

porcuquine · 2020-06-01T20:51:37Z

Yes, this will require bumping the parameter version.

cryptonemo · 2020-06-02T16:58:29Z

@porcuquine Can you confirm if poseidon constraints (tests) should have to be updated with this change (updated neptune, etc)?

porcuquine · 2020-06-02T17:17:00Z

Yes, and they should go down.

cryptonemo · 2020-06-03T19:06:45Z

Note: This comes out of draft status when neptune v1 is released.

dignifiedquire · 2020-06-04T15:50:41Z

storage-proofs/porep/src/stacked/vanilla/proof.rs

+                    let mut layer_data = Vec::with_capacity(layers);
+                    for _ in 0..layers {
+                        layer_data.push(Vec::new());
+                    }


could be written as

let mut layer_data = vec![vec![]; layers];

the inner Vec should probably be allocated with with_capacity if you can determine their expected length

The inner vec is entirely replaced

dignifiedquire · 2020-06-04T15:54:34Z

storage-proofs/porep/src/stacked/vanilla/proof.rs

+                                layer_data[k - 1] = fr_elements;
+                            }
+                        });
+                    });


could be

for layer_elements in layer_data.iter_mut() { // ... layer_elements.extend(elements.into_iter().map(Into::into)); }

which should avoid some allocations

I'll take a look at this, thanks

I see what you're saying now (along with the above comment). Thanks!

dignifiedquire · 2020-06-04T15:56:59Z

storage-proofs/porep/src/stacked/vanilla/proof.rs

-                        )?;
+                                for fr in fr_elements {
+                                    buf.extend(fr_into_bytes(&fr));
+                                }


let buf: Vec<u8> = fr_elements.iter().flat_map(fr_into_bytes).collect();

I think I want to avoid this. This looks eerily like the construction that was originally in place that bottlenecked hard.

dignifiedquire

some small code improvements, but overall looks good to me

porcuquine · 2020-06-04T16:21:46Z

storage-proofs/porep/src/stacked/vanilla/proof.rs


-                        let flat_tree_data: Vec<_> = tree_data
+                                for fr in fr_elements {
+                                    buf.extend(fr_into_bytes(&fr));


This will still allocate a buffer into which to write the conversion. I think you should add a new version of this function which writes directly into buf.

For this instance (of tree data), it's taking about a half second on a 32GiB run. The base data flattening is now taking about 2-3 seconds.

The buf being extended has been pre-allocated above that. I'm not sure if you're saying to avoid that allocation, or the one that extend would do had it not been pre-allocated.

I was talking about the allocation inside fr_into_bytes.

👍 Got it, thanks. Given the overall speed improvements, we can wait on that a bit since this is no longer bottlenecking.

feat: attempt to improve gpu tree_c layer retrieval fix: bump parameter version fix: properly delete tree c in the split config case fix: update tests to match new (lower) constraints

feat: update debug logging

feat: add adjustable column_write_batch_size setting

cryptonemo mentioned this pull request Jun 2, 2020

Include layer index before node when creating label preimage. #1139

Merged

cryptonemo force-pushed the gpu-tree-r branch from f0c687c to bfab2c1 Compare June 2, 2020 12:19

cryptonemo added the cryptocomputelab CryptoComputeLab work label Jun 2, 2020

cryptonemo force-pushed the gpu-tree-r branch 6 times, most recently from aa19d48 to 758e7d2 Compare June 3, 2020 18:19

cryptonemo force-pushed the gpu-tree-r branch 2 times, most recently from 986ca9f to 5900d31 Compare June 4, 2020 12:31

cryptonemo marked this pull request as ready for review June 4, 2020 14:05

cryptonemo requested review from dignifiedquire and porcuquine as code owners June 4, 2020 14:05

dignifiedquire reviewed Jun 4, 2020

View reviewed changes

dignifiedquire previously approved these changes Jun 4, 2020

View reviewed changes

porcuquine reviewed Jun 4, 2020

View reviewed changes

cryptonemo dismissed dignifiedquire’s stale review via 014213e June 4, 2020 20:10

cryptonemo force-pushed the gpu-tree-r branch from 97c016d to 014213e Compare June 4, 2020 20:10

cryptonemo added 4 commits June 5, 2020 15:37

feat: allow tree_r_last to be built on gpu

d09683c

feat: attempt to improve gpu tree_c layer retrieval fix: bump parameter version fix: properly delete tree c in the split config case fix: update tests to match new (lower) constraints

feat: sanity check all seal input path types

fbe019d

feat: enable GPU tree builder

5b1e93f

feat: update debug logging

fix: update test constraint

37f324d

cryptonemo added 3 commits June 5, 2020 15:37

feat: improve persisting tree_c performance

b7a1c1a

feat: add adjustable column_write_batch_size setting

fix: point to proper neptune version 1.0.0 crate

6457314

fix: re-factor and apply review feedback

b4d827f

cryptonemo force-pushed the gpu-tree-r branch from 014213e to b4d827f Compare June 5, 2020 19:47

cryptonemo requested a review from porcuquine June 5, 2020 19:57

porcuquine approved these changes Jun 5, 2020

View reviewed changes

cryptonemo merged commit 573d24f into master Jun 5, 2020

cryptonemo deleted the gpu-tree-r branch June 8, 2020 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: allow tree_r_last to be built on gpu #1138

feat: allow tree_r_last to be built on gpu #1138

cryptonemo commented Jun 1, 2020

cryptonemo commented Jun 1, 2020

porcuquine commented Jun 1, 2020

cryptonemo commented Jun 2, 2020

porcuquine commented Jun 2, 2020

cryptonemo commented Jun 3, 2020

dignifiedquire Jun 4, 2020

dignifiedquire Jun 4, 2020

cryptonemo Jun 4, 2020

dignifiedquire Jun 4, 2020

cryptonemo Jun 4, 2020

cryptonemo Jun 4, 2020

dignifiedquire Jun 4, 2020

cryptonemo Jun 4, 2020

dignifiedquire left a comment

porcuquine Jun 4, 2020

cryptonemo Jun 4, 2020

cryptonemo Jun 4, 2020

porcuquine Jun 4, 2020

cryptonemo Jun 4, 2020

feat: allow tree_r_last to be built on gpu #1138

feat: allow tree_r_last to be built on gpu #1138

Conversation

cryptonemo commented Jun 1, 2020

cryptonemo commented Jun 1, 2020

porcuquine commented Jun 1, 2020

cryptonemo commented Jun 2, 2020

porcuquine commented Jun 2, 2020

cryptonemo commented Jun 3, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dignifiedquire left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment