[core] Convert operator supports `u2, u3, u6` types #23490

praasz · 2024-03-16T08:00:57Z

Details:

Add new types u2, u3, u6 to Convert operator.
Expand element::Iterator to support NF4 and BitProxy to support NF4 conversion.
Update tensor to calculate correctly byte size for u3, u6 types.
Fix NF4 <-> conversion to always use byte pack/unpack and quantization when convert to/from floating point. In future the conversion for NF4 will be limited to f32 -> NF4.

Tickets:

praasz · 2024-03-16T08:05:26Z

Blocked by PR #23279 .

Improve binary size reduction

mlukasze · 2024-03-19T06:29:42Z

Blocked by PR #23279 .

blocker has been merged

src/core/src/op/convert.cpp

…23490) ### Details: - Add new types `u2, u3, u6` to Convert operator. - Expand `element::Iterator` to support NF4 and `BitProxy` to support NF4 conversion. - Update tensor to calculate correctly byte size for `u3, u6` types. - Fix NF4 <-> conversion to always use byte pack/unpack and quantization when convert to/from floating point. In future the conversion for NF4 will be limited to f32 -> NF4. ### Tickets: - [CVS-127000](https://jira.devtools.intel.com/browse/CVS-127000) - [CVS-128024](https://jira.devtools.intel.com/browse/CVS-128024) --------- Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>

praasz requested review from AlexKoff88 and t-jankowski March 16, 2024 08:00

praasz added this to the 2024.1 milestone Mar 16, 2024

praasz added 7 commits March 18, 2024 21:19

Add fp32 -> nf4 convert test

77cd9ef

Use element iterator in Convert

11c3e35

Update tensor byte size calculation

55372c2

Add u2, u3, u6 type to Convert

232f9f2

Add helpers to create iterator from void pointer

e86a545

Improve binary size reduction

Fix capture list in AllocatedTensor ctor

0ee0ef6

Correct NF4 <-> floating point deduction

23a0e32

praasz force-pushed the feature/add-u2-u3-u6-to-convert branch from 0afc7c0 to 23a0e32 Compare March 18, 2024 21:26

github-actions bot removed category: CPU OpenVINO CPU plugin category: build OpenVINO cmake script / infra category: CPP API OpenVINO CPP API bindings labels Mar 18, 2024

praasz marked this pull request as ready for review March 18, 2024 21:40

praasz requested review from a team as code owners March 18, 2024 21:40

praasz requested review from vurusovs and mitruska March 19, 2024 05:38

Restore removed include

5ae1dd7

mlukasze added the Code Freeze label Mar 20, 2024

praasz added 2 commits March 25, 2024 06:29

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

d470bfc

Fix cast of input tensor data

e989dbd

mitruska approved these changes Mar 25, 2024

View reviewed changes

src/core/src/op/convert.cpp Show resolved Hide resolved

praasz added this pull request to the merge queue Mar 26, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 26, 2024

praasz added this pull request to the merge queue Mar 26, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 26, 2024

praasz added this pull request to the merge queue Mar 26, 2024

akladiev removed this pull request from the merge queue due to the queue being cleared Mar 26, 2024

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

2c6cf1d

praasz enabled auto-merge March 26, 2024 15:21

praasz and others added 10 commits March 26, 2024 21:59

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

6c88412

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

3ffd8f6

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

3c15fba

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

b8e4e3c

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

f34fcb8

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

19a60eb

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

0221b65

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

d50d8f2

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

e9d789e

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

b601124

praasz force-pushed the feature/add-u2-u3-u6-to-convert branch from 05691fd to b601124 Compare March 29, 2024 12:54

praasz added 2 commits March 29, 2024 18:29

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

f26bc10

Merge branch 'master' into feature/add-u2-u3-u6-to-convert

4264d13

praasz added this pull request to the merge queue Mar 31, 2024

Merged via the queue into openvinotoolkit:master with commit bb20710 Mar 31, 2024
108 checks passed

praasz deleted the feature/add-u2-u3-u6-to-convert branch March 31, 2024 08:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Convert operator supports `u2, u3, u6` types #23490

[core] Convert operator supports `u2, u3, u6` types #23490

praasz commented Mar 16, 2024 •

edited

Loading

praasz commented Mar 16, 2024 •

edited

Loading

mlukasze commented Mar 19, 2024

[core] Convert operator supports u2, u3, u6 types #23490

[core] Convert operator supports u2, u3, u6 types #23490

Conversation

praasz commented Mar 16, 2024 • edited Loading

Details:

Tickets:

praasz commented Mar 16, 2024 • edited Loading

mlukasze commented Mar 19, 2024

[core] Convert operator supports `u2, u3, u6` types #23490

[core] Convert operator supports `u2, u3, u6` types #23490

praasz commented Mar 16, 2024 •

edited

Loading

praasz commented Mar 16, 2024 •

edited

Loading