prototype: pytorch base learner #4660

rajan-chari · 2023-11-08T22:18:54Z

Current todo

Current bugs

Parsing -0.00000

Post MVP Wish list

accumulate error during minibatch instead of gradients
Support arbitrary model created by TorchScript/Other mechanisms
VW binary input format
statically link torch dlls
GPU
Different optimizers (AdamW) for simple N layer network
Profile allocations and ensure no-allocations in steady state
Reduce Binary Size

Debugging Notes:
When building debug configuration in windows, and using system dependencies, use the debug version of libtorch libraries. This is because release version of your app will be ABI incompatible with debug version of libtorch and vice-versa leading to very odd errors. This is why std cpp types should not be used in library interfaces!

lalo · 2023-11-14T20:31:25Z

update https://github.com/VowpalWabbit/vowpal_wabbit/blob/master/ThirdPartyNotices.txt if you are planning on merging

ataymano · 2023-11-16T16:23:28Z

vowpalwabbit/core/src/reductions/dnn/dnn.cc

+  int num_layers = 3;
+  int hidden_layer_size = 20;
+  int mini_batch_size = 10;
+  new_options.add(make_option("dnn", use_dnn).keep().necessary().help("Fully connected deep neural network base learner."))


should we mark them as experimental()?

rajan-chari · 2023-11-25T21:16:50Z

update https://github.com/VowpalWabbit/vowpal_wabbit/blob/master/ThirdPartyNotices.txt if you are planning on merging

Got it.

rajan-chari · 2023-11-25T21:17:25Z

update https://github.com/VowpalWabbit/vowpal_wabbit/blob/master/ThirdPartyNotices.txt if you are planning on merging

Makes sense.

olgavrou · 2024-03-14T15:49:36Z

To be re-opened in the future

rajan-chari added 6 commits October 21, 2023 22:09

added visual studio 2022 cmake preset

bd5dabd

Added dnn reduction

6147100

experiments with sparse tensors

c8d28dc

Minor GD refactor

be22780

Implement dnn base reduction

ba2e7f9

python tests

5370e87

rajan-chari marked this pull request as draft November 8, 2023 22:27

Print more model details

1f4880c

ataymano reviewed Nov 16, 2023

View reviewed changes

rajan-chari added 5 commits November 17, 2023 16:32

updated vcpkg

9efdecf

Use pytorch system dependency to save build time + GPU support

b3cccb4

Support n base learners (like gd)

5c44b9a

generate cb_adf test data

e97c58a

create a regression file when using dnn base reduction

b1567d7

rajan-chari closed this Nov 25, 2023

rajan-chari reopened this Nov 25, 2023

rajan-chari added 5 commits November 25, 2023 16:59

Added dnn unit tests

0760bba

Grow the size of the input layer as size of dense inputs grow.

26d2d85

debug hash values not changing

0e2d509

Auto increase input layer size when there is more input data elements

f4f52c9

dnn test bug fix. hard coded example

971ece1

olgavrou closed this Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prototype: pytorch base learner #4660

prototype: pytorch base learner #4660

rajan-chari commented Nov 8, 2023 •

edited

Loading

lalo commented Nov 14, 2023

ataymano Nov 16, 2023

rajan-chari Nov 20, 2023

rajan-chari commented Nov 25, 2023

rajan-chari commented Nov 25, 2023

olgavrou commented Mar 14, 2024

prototype: pytorch base learner #4660

prototype: pytorch base learner #4660

Conversation

rajan-chari commented Nov 8, 2023 • edited Loading

lalo commented Nov 14, 2023

ataymano Nov 16, 2023

Choose a reason for hiding this comment

rajan-chari Nov 20, 2023

Choose a reason for hiding this comment

rajan-chari commented Nov 25, 2023

rajan-chari commented Nov 25, 2023

olgavrou commented Mar 14, 2024

rajan-chari commented Nov 8, 2023 •

edited

Loading