Features/372 data layout #423

ClaudiaComito · 2019-12-02T14:17:43Z

Description

Addresses #372. By default, the memory layout of torch tensors is row-major (C-style), where the data are stored first dimension first (rows first in 2 dimensions). Some algorithms may run faster on a column-major memory layout (Fortran-style), where data are stored last dimension first (columns first in 2 dimensions).
In NumPy, the user can define the memory layout of an array thanks to the keyword argument order:

https://docs.scipy.org/doc/numpy/reference/generated/numpy.array.html

This PR introduces the keyword argument order for heat factories, hence the possibility of column-major memory layout for heat tensors. Currently, the default is order="C", with the possibility of specifying order="F". Because heat factories are using torch.clone() internally, and clone() does not preserve the memory layout of the tensor for the time being, I'm leaving the implementation of option order="K" for later. This issue will probably be fixed in one of the next PyTorch releases.

I'm also introducing the DNDarray properties stride (torch-like) and strides (numpy-like, output in bytes).

Fixes: #372, #403, #404

Changes proposed:

new function sanitize_memory_layout() in module memory
new kwarg order for all factories potentially giving rise to tensors with ndim > 1
new DNDarray properties stride(), strides (torch-like output and numpy-like output respectively)
new function assertTrue_memory_layout() in module basic_test

Type of change

Select relevant options.

[ ] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to not work as expected)
[ ] Documentation update

Are all split configurations tested and accounted for?
[x] yes [ ] no
Does this change require a documentation update outside of the changes proposed?
[x] yes [ ] no
Does this change modify the behaviour of other functions?
[ ] yes [x] no
Are there code practices which require justification?
[ ] yes [x] no

…nderlying torch tensor and no longer gets the wrong info from from the numpyfied DNDarray

…of memory layout (row- or column-major).

…o the bottom

…out.

…ating 1D list).

…ng layout (e.g. from tensor creation with ndmin > tensor.shape).

…rides already matching requested order), removed "NotImplementedError" for order K.

… a given dimension

…ns 1.

…peration

…t relevant for now

….comm.

codecov · 2019-12-06T12:27:27Z

Codecov Report

Merging #423 into master will increase coverage by 0.02%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #423      +/-   ##
==========================================
+ Coverage   98.07%   98.09%   +0.02%     
==========================================
  Files          55       55              
  Lines       11414    11548     +134     
==========================================
+ Hits        11194    11328     +134     
  Misses        220      220

Impacted Files	Coverage Δ
heat/core/tests/test_dndarray.py	`100% <100%> (ø)`	⬆️
heat/core/tests/test_suites/test_basic_test.py	`100% <100%> (ø)`	⬆️
heat/core/tests/test_memory.py	`100% <100%> (ø)`	⬆️
heat/core/factories.py	`100% <100%> (ø)`	⬆️
heat/core/memory.py	`100% <100%> (ø)`	⬆️
heat/core/tests/test_suites/basic_test.py	`100% <100%> (ø)`	⬆️
heat/core/dndarray.py	`95.71% <100%> (+0.05%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4efe156...46fbc2e. Read the comment docs.

coquelin77 · 2019-12-04T08:49:05Z

heat/core/dndarray.py

+        tuple of ints: bytes to step in each dimension when traversing a tensor.
+        numpy-like usage: self.strides
+        """
+        stride = np.array(self._DNDarray__array.stride())


can we use torch here instead of numpy? numpy doesnt like GPUs

There is even no need to return a array (Numpy or PyTorch) at all as stride() already returns a tuple

@krajsek this is the numpy-like version of stride (output in bytes), I need to be able to multiply it by the element size.

@coquelin77 if you expect problems I'll use list comprehension instead of numpy. Must we be wary of every numpy call then, or only when dealing with large arrays?

o.k., I see

@krajsek this is the numpy-like version of stride (output in bytes), I need to be able to multiply it by the element size.

@coquelin77 if you expect problems I'll use list comprehension instead of numpy. Must we be wary of every numpy call then, or only when dealing with large arrays?

yes, we should always be wary of using numpy. we should aim to use primarily pytorch functions. Also, if i dont misunderstand, wrapping this in a numpy array doesnt really get us anything since its already a torch tensor

Hi Daniel.
This:
self._DNDarray__array.stride()
is a tuple. In order to provide numpy-like information (output in bytes), this tuple has to be multiplied by the element size of the tensor (scalar, bytes). I need to change the tuple into something that can be multiplied by a scalar. I'll use list comprehension and be done with it.

coquelin77 · 2019-12-06T15:45:03Z

heat/core/memory.py

@@ -22,3 +24,44 @@ def copy(a):
    return dndarray.DNDarray(
        a._DNDarray__array.clone(), a.shape, a.dtype, a.split, a.device, a.comm


can you add a.order to this?

hm, but order is no tensor attribute. I'm not sure I see the need for this. The way to check the memory layout would be to call a.stride() (or a.strides if you want to be numpy-like)

krajsek

I went through the code and it looks good, but I did not make test myself.

krajsek · 2019-12-09T10:49:05Z

heat/core/dndarray.py

+        tuple of ints: bytes to step in each dimension when traversing a tensor.
+        numpy-like usage: self.strides
+        """
+        stride = np.array(self._DNDarray__array.stride())


o.k., I see

@krajsek this is the numpy-like version of stride (output in bytes), I need to be able to multiply it by the element size.

@coquelin77 if you expect problems I'll use list comprehension instead of numpy. Must we be wary of every numpy call then, or only when dealing with large arrays?

…row-major and column-major order

…ion.

ClaudiaComito added 30 commits October 28, 2019 16:25

Added dndarray property 'stride' (same as torch.Tensor.stride).

93a8ccc

Merge branch 'master' into features/372-data_layout

6df3bed

Implemented tensor property strides (numpy-like), added to docs.

c2faa47

Merge branch 'master' into features/372-data_layout

8cb873c

First pass of column-first memory layout, single-node only.

7c12fe5

Implemented stride_tricks.sanitize_memory_layout, first pass.

b01511b

Property DNDarray.strides now gets the correct information from the u…

d01d14d

…nderlying torch tensor and no longer gets the wrong info from from the numpyfied DNDarray

Moved sanitize_memory_layout from stride_tricks to module memory.

5b6276d

Implemented memory.sanitize_memory_layout.

2d0596d

Introduced attribute order in factories.array, enables specification …

5ace9e1

…of memory layout (row- or column-major).

sanitize_memory_layout(), moved tests for least likely occurrencies t…

89caeaa

…o the bottom

Fixed typos.

bfaec9c

Merge branch 'master' into features/372-data_layout

d619ce6

ht.array, docs and examples for attribute "order" defining memory lay…

a60482f

…out.

Expanded documentation to sanitize_memory_layout().

dba61a2

Keyword argument order introduced for all factories (except those cre…

90b6e16

…ating 1D list).

Typo.

12934d1

Implemented function assertTrue_memory_layout()

c442fa8

Modified sanitize_memory_layout() to address tensors with unknown/wro…

1da8a74

…ng layout (e.g. from tensor creation with ndmin > tensor.shape).

sanitize_memory_layout(), removed unnecessary checks (ndim < 2 and st…

bb9a2b6

…rides already matching requested order), removed "NotImplementedError" for order K.

Added keyword argument order in factories calls.

c62974b

First draft of function test_sanitize_memory_layout()

e18eee6

Modified row_major, column_major condition to allow for shape=1 along…

c725a8b

… a given dimension

Added memory layout test for 5D tensor, non distributed, shape contai…

76a1617

…ns 1.

Added memory layout test for non distributed tensor after reduction o…

5628fac

…peration

Removed test for non distributed tensor after reduction operation, no…

932a85e

…t relevant for now

Implemented tests for sanitize_memory_layout()

c3ede72

In assert_array_equal(), Allreduce running on self._comm, not on self…

5beea49

….comm.

Removing unused variable.

43df1ab

Fixed error that messed up column-major memory layout for ndim>3

e47ac20

ClaudiaComito added 4 commits December 2, 2019 14:28

Fixed and expanded tests for sanitize_memory_layout

3a096ad

Fixed wrong variable reference after pre-commit

814caa9

pre-commit changes

b659576

Improved docs.

eea901b

ClaudiaComito requested review from Markus-Goetz, coquelin77, Cdebus, krajsek and TheSlimvReal December 2, 2019 14:20

Extending BasicTest in TestMemory

c38dede

Merge branch 'master' into features/372-data_layout

eca64fa

coquelin77 requested changes Dec 6, 2019

View reviewed changes

coquelin77 reviewed Dec 6, 2019

View reviewed changes

ClaudiaComito added 3 commits December 7, 2019 07:46

Re-adding assertTrue_memory_layout to BasicTest

e7d0090

Function test_dndarray.test_stride_and_strides(), first pass,

3bb13ba

Added float32, row-major and float64, column-major stride tests

218345b

krajsek previously approved these changes Dec 9, 2019

View reviewed changes

ClaudiaComito added 2 commits December 9, 2019 13:53

test_stride_and_strides: Added test cases for distributed tensors in …

d3a7499

…row-major and column-major order

pre-commit reformatting

ae00b59

ClaudiaComito dismissed krajsek’s stale review via ae00b59 December 9, 2019 12:56

ClaudiaComito added 4 commits December 9, 2019 14:12

Added function test_basic_test.test_assertTrue_memory_layout()

fbe9738

Merge branch 'master' into features/372-data_layout

c21d6be

dndrray.strides, replaced np.array(...)*itemsize with list comprehens…

38802e8

…ion.

pre-commit minor changes

46fbc2e

coquelin77 approved these changes Dec 10, 2019

View reviewed changes

coquelin77 merged commit 2b03e77 into master Dec 10, 2019

coquelin77 deleted the features/372-data_layout branch December 10, 2019 08:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/372 data layout #423

Features/372 data layout #423

ClaudiaComito commented Dec 2, 2019 •

edited

Loading

codecov bot commented Dec 6, 2019 •

edited

Loading

coquelin77 Dec 4, 2019

krajsek Dec 6, 2019

ClaudiaComito Dec 9, 2019

krajsek Dec 9, 2019

coquelin77 Dec 10, 2019

ClaudiaComito Dec 10, 2019

coquelin77 Dec 6, 2019

ClaudiaComito Dec 9, 2019

krajsek left a comment

krajsek Dec 9, 2019

		@@ -22,3 +24,44 @@ def copy(a):
		return dndarray.DNDarray(
		a._DNDarray__array.clone(), a.shape, a.dtype, a.split, a.device, a.comm

Features/372 data layout #423

Features/372 data layout #423

Conversation

ClaudiaComito commented Dec 2, 2019 • edited Loading

Description

Type of change

codecov bot commented Dec 6, 2019 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krajsek left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ClaudiaComito commented Dec 2, 2019 •

edited

Loading

codecov bot commented Dec 6, 2019 •

edited

Loading