Delta neural networks inference #2129

casper2002casper · 2022-11-30T11:43:04Z

Describe the potential feature

In scenarios where only part of the input layer is augmented, delta neural networks can be used to speed up inference by only updating the nodes affected by the change. The implementation of this principle has for example revolutionized chess engines (https://www.chessprogramming.org/NNUE), and has recently been implemented for CNNs https://github.com/facebookresearch/DeltaCNN.

Motivation

Could offer speedups in situations where the input layer is only partially updated, such as chess engines or combinatorial search methods.
Recent progress in CUDA sparse matrixes makes this interesting to evaluate.

Possible Implementation

Store hidden layer data from previous evaluation
Calculate input layer delta with previous evaluation and store in spare matrix
Update hidden layer data via sparse layer propagation

https://github.com/facebookresearch/DeltaCNN

casper2002casper · 2023-01-19T13:10:25Z

Might be better as a seperate module

casper2002casper closed this as completed Jan 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delta neural networks inference #2129

Delta neural networks inference #2129

casper2002casper commented Nov 30, 2022 •

edited

Loading

casper2002casper commented Jan 19, 2023

Delta neural networks inference #2129

Delta neural networks inference #2129

Comments

casper2002casper commented Nov 30, 2022 • edited Loading

Describe the potential feature

Motivation

Possible Implementation

casper2002casper commented Jan 19, 2023

casper2002casper commented Nov 30, 2022 •

edited

Loading