Features/179 repeat #674

lenablind · 2020-09-18T13:56:04Z

Description

Implementation of function repeat() based on np.repeat()
For process-local operations, I use torch.repeat_interleave

Docs numpy: https://numpy.org/doc/stable/reference/generated/numpy.repeat.html
Docs pytorch: https://pytorch.org/docs/stable/generated/torch.repeat_interleave.html

Strategy

In the following, only the distributed case (<=> a.split != None) will be explained more specifically as the algorithm otherwise results mainly in a direct call of the torch function.

axis is None

Implies a.split = 0 as the (total) result would be in the wrong order otherwise.
To assure the correct distribution of repeats and syntactical consistency with numpy (repeats must be 1-dimensional), repeats has to be reshaped to the global shape of a, redistributed along axis 0 and be flattened afterwards.
The last step is necessary due to compatibility with torch (1-dimensionality required as in numpy).

axis is not None

Depending on whether a.split == axis, repeats has to be either split along axis 0 (case True) or gathered on all processes (case False). The eventually resulting redistribution of data is needed to assure correct local and therefore global results.

Issue/s resolved: #179

Changes proposed:

Implemented new function: manipulations.repeat()
Implemented corresponding tests

Type of change

New feature (non-breaking change which adds functionality)

Due Diligence

All split configurations tested
Multiple dtypes tested in relevant functions
Documentation updated (if needed)
Updated changelog.md under the title "Pending Additions"

Does this change modify the behaviour of other functions? If so, which?

no

…to avoid balanceing

…sceptibility

mtar · 2020-09-18T13:56:07Z

GPU cluster tests are currently disabled on this Pull Request.

codecov · 2020-09-18T14:04:52Z

Codecov Report

Merging #674 into master will increase coverage by 0.05%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #674      +/-   ##
==========================================
+ Coverage   97.47%   97.53%   +0.05%     
==========================================
  Files          87       87              
  Lines       17231    17643     +412     
==========================================
+ Hits        16796    17208     +412     
  Misses        435      435

Impacted Files	Coverage Δ
heat/core/manipulations.py	`99.28% <100.00%> (+0.06%)`	⬆️
heat/core/tests/test_factories.py	`100.00% <100.00%> (ø)`
heat/core/tests/test_manipulations.py	`99.94% <100.00%> (+0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5cd2b42...229585a. Read the comment docs.

coquelin77

looks pretty good. good job!

heat/core/manipulations.py

lenablind · 2020-09-21T14:41:41Z

looks pretty good. good job!

Thank you!

mtar · 2020-09-23T05:51:02Z

ok to test

coquelin77

very very minor changes and only a couple questions. good work!

heat/core/manipulations.py

lenablind · 2020-10-26T14:16:53Z

very very minor changes and only a couple questions. good work!

@coquelin77 Thank you!

lenablind added 30 commits September 1, 2020 09:19

First approach to implementation

0ced160

Additional type restrictions

f229825

Restrictions for repeats

285fa1c

Implementation of undistributed function

b79bf5e

Check for repeats consisting only of integers as DNDarray

4c7f425

First test cases for undistributed implementation

f7f23dd

Checks for 'repeats.dtype' DNDarray or np.ndarray

17cd911

Additional tests and first approach to distribution

70170cd

Query for dtype of np.ndarray and ht.DNDarray

2d8bda2

Broken for distributed case. Adaption of split syntax

dda37ca

Removal of DNDarray as a possible dtype for 'repeats'

aa7a412

Adaption of tests - Replaced assert_array_equal with all equivalent

fdff989

Broken - Adaption of split algorithm for repeats

8ccefbb

Usage of resplit_

b9713b6

Merge branch 'master' into features/179-repeat

adbe489

Use of ht.empty

0713d47

Restructured sanitation of a, replaced ht.flatten with torch.flatten …

328afc9

…to avoid balanceing

Moved (test) functions into manipulations

c7aed40

is_split adaption for repeats

e25e218

Additional tests for axis != None

5e1aef7

Improvement of lshape handling

d46eb65

Algorithm simplification

a80eab4

DNDarray as valid dtype for repeats

4c57e09

Repeats distributed, axis = None

6828035

a distributed, repeats undistributed

81bcb60

Code restructured

210ea27

Code restructured

c4f1bb9

Broken if both repeats & a are distributed

919f005

Repeats distributed, axis != None, Debugging

bbb3552

All working testcases, debugging

ebb12c3

lenablind added 10 commits September 17, 2020 09:46

Moved empty case (no data) to bottom

8e178b6

Broadcast via 1 element list

d7681d3

Slicing approach - distributed case, axis=None

216859d

Replaced repeats.resplit_(0) with slicing strategy

c428f61

Combine both broadcasts, change back to resplit_(0) for less error su…

2d03365

…sceptibility

More precise warnings, handling of globally empty input

8cd457d

reshape & flatten repeats if axis is None

fd75a78

Additional tests for different dtypes of a

ff956c7

Additional tests

daf0cee

Adapted docstring

ac9e885

lenablind requested review from Cdebus, mtar, coquelin77 and ClaudiaComito September 18, 2020 13:56

Update of CHANGELOG

accb241

coquelin77 requested changes Sep 21, 2020

View reviewed changes

Merge branch 'master' into features/179-repeat

f77b379

lenablind added 2 commits September 21, 2020 17:36

Resplit out of place, warnings instead of (adapted) print

b75ce4c

Replaced array_equal with all() equivalent

40e62db

coquelin77 requested changes Oct 26, 2020

View reviewed changes

heat/core/manipulations.py Outdated Show resolved Hide resolved

heat/core/manipulations.py Outdated Show resolved Hide resolved

heat/core/manipulations.py Outdated Show resolved Hide resolved

heat/core/manipulations.py Show resolved Hide resolved

lenablind added 2 commits October 26, 2020 14:58

Solved merge conflict

7c8fafc

Adaption to requested changes

229585a

coquelin77 approved these changes Oct 26, 2020

View reviewed changes

coquelin77 merged commit dc6caa3 into master Oct 26, 2020

coquelin77 deleted the features/179-repeat branch October 26, 2020 14:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/179 repeat #674

Features/179 repeat #674

lenablind commented Sep 18, 2020 •

edited

Loading

mtar commented Sep 18, 2020

codecov bot commented Sep 18, 2020 •

edited

Loading

coquelin77 left a comment

lenablind commented Sep 21, 2020

mtar commented Sep 23, 2020

coquelin77 left a comment

lenablind commented Oct 26, 2020

Features/179 repeat #674

Features/179 repeat #674

Conversation

lenablind commented Sep 18, 2020 • edited Loading

Description

Strategy

axis is None

axis is not None

Changes proposed:

Type of change

Due Diligence

Does this change modify the behaviour of other functions? If so, which?

mtar commented Sep 18, 2020

codecov bot commented Sep 18, 2020 • edited Loading

Codecov Report

coquelin77 left a comment

Choose a reason for hiding this comment

lenablind commented Sep 21, 2020

mtar commented Sep 23, 2020

coquelin77 left a comment

Choose a reason for hiding this comment

lenablind commented Oct 26, 2020

lenablind commented Sep 18, 2020 •

edited

Loading

codecov bot commented Sep 18, 2020 •

edited

Loading