Concurrent loop through each mini-batch during training #178

rouson · 2024-07-18T05:26:40Z

This PR lays the groundwork for parallelizing or offloading a significant part of the training algorithm to a GPU.

This commit allocates a pair_cost array for each mini-batch so that each input/output pair's contribution to the cost function sum can be stored in a separate element rather than keeping a sequential running tally as the loop through the mini-batch progresses. This change lays the groundwork for making the loop concurrent, which in turn lays the foundation for using a Fortran 2023 concurrent reduction. The calulaiton is currently redundant with a running sum so that a match between the two can be verified in an assertion. The reundancy and the assertion will be removed in a future commit.

rouson added 3 commits July 17, 2024 22:06

feat(train): concurrent loop thru each mini-batch

5bd636a

chore(train): whitespace edits

383ad76

rouson merged commit 5425151 into main Jul 18, 2024
6 checks passed

rouson deleted the concurrent-training branch July 18, 2024 05:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent loop through each mini-batch during training #178

Concurrent loop through each mini-batch during training #178

rouson commented Jul 18, 2024 •

edited

Loading

Concurrent loop through each mini-batch during training #178

Concurrent loop through each mini-batch during training #178

Conversation

rouson commented Jul 18, 2024 • edited Loading

rouson commented Jul 18, 2024 •

edited

Loading