gpu: quantization refactoring prerequisites #2512

dzarukin · 2025-01-24T22:45:34Z

This PR updates gemm_with_post_ops implementation with a new way of loading scales as they are going to be affected after refactor. The main issue is that data_type::undef becomes a valid data type for scales, meaning, they were not specified, which would trigger a bunch of issues in the existing implementation.

This PR guards from that situation.

Thanks to @rjoursler for a patch to deal with undefined data types.

This prepares gemm_with_post_ops for `data_type::undef` in scales data types and also a custom data type in dst scales.

dzarukin · 2025-01-24T22:46:18Z

make test
disable test_device_cpu

src/gpu/intel/ocl/gemm/gemm_with_post_ops.cl

src/gpu/intel/ocl/ocl_math_utils.h

dzarukin added 4 commits January 24, 2025 14:11

gpu: intel: ocl: fix scales application

a80736f

gpu: intel: ocl: gemm: remove duplicate macro entry

52ab87c

gpu: primitive_conf: add with_punning arg for def_attr_info

2629be9

gpu: intel: ocl: io: introduce undef_data and e8m0

e763308

This prepares gemm_with_post_ops for `data_type::undef` in scales data types and also a custom data type in dst scales.

dzarukin requested a review from a team as a code owner January 24, 2025 22:45

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Jan 24, 2025

rjoursler approved these changes Jan 24, 2025

View reviewed changes

src/gpu/intel/ocl/gemm/gemm_with_post_ops.cl Outdated Show resolved Hide resolved

src/gpu/intel/ocl/ocl_math_utils.h Show resolved Hide resolved

echeresh approved these changes Jan 24, 2025

View reviewed changes

gpu: intel: ocl: gemm_with_post_ops: move c_scales to io

d7c52ef

dzarukin force-pushed the dzarukin/refactor_quant_prereq branch from ca82718 to d7c52ef Compare January 24, 2025 23:49

dzarukin merged commit 4ffd446 into main Jan 26, 2025
4 checks passed

dzarukin deleted the dzarukin/refactor_quant_prereq branch January 26, 2025 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu: quantization refactoring prerequisites #2512

gpu: quantization refactoring prerequisites #2512

dzarukin commented Jan 24, 2025 •

edited

Loading

dzarukin commented Jan 24, 2025

gpu: quantization refactoring prerequisites #2512

gpu: quantization refactoring prerequisites #2512

Conversation

dzarukin commented Jan 24, 2025 • edited Loading

dzarukin commented Jan 24, 2025

dzarukin commented Jan 24, 2025 •

edited

Loading