-
Notifications
You must be signed in to change notification settings - Fork 246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PTQ][OV] FP8 implementation #2283
[PTQ][OV] FP8 implementation #2283
Conversation
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## develop #2283 +/- ##
===========================================
- Coverage 90.74% 90.60% -0.15%
===========================================
Files 489 489
Lines 43975 44196 +221
===========================================
+ Hits 39906 40044 +138
- Misses 4069 4152 +83
... and 2 files with indirect coverage changes
Flags with carried forward coverage won't be shown. Click here to find out more.
|
c048a85
to
097aaee
Compare
@KodiaqQ Are you planning to add a new |
Good catch, thanks. I'll add it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some general comments:
- FBC and BC support FakeConvert operations?
- What test plan do you propose for FP8 quantization?
This reverts commit 65c25a6.
nncf/quantization/fake_quantize.py
Outdated
:return: Parameters of the FakeConvert layer. | ||
""" | ||
|
||
destination_type_maximum = {"HF8": 448, "BF8": 57344} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest to align names with https://github.com/openvinotoolkit/openvino/blob/74f6b6454d2b503b7a412b80281bd54bfd328c43/src/core/reference/include/openvino/reference/fake_convert.hpp#L253C29-L253C37: "f8e5m2", "f8e4m3"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks like this PR requires additional validation. I would suggest to run ptq conformance test and DLB calibrate test on small model scope.
@alexsu52, please, review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, @KodiaqQ please merge it after passing additional validation.
@@ -154,6 +169,7 @@ def get_start_nodes_for_activation_path_tracing(nncf_graph: NNCFGraph) -> List[N | |||
|
|||
:param nncf_graph: NNCFGraph to get the start nodes. | |||
:return: List of NNCFNodes to use as start nodes for activation path tracing. | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typo
Conformance validation run - 229 (manual). |
Changes
Mode
parameterReason for changes
Related tickets
Tests
tests/openvino/native/quantization/test_graphs.py
tests/openvino/native/test_model_transformer.py
On top of openvinotoolkit/openvino#21034 - Merged