We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When running the convolution described by the following driver command on MI200 platform
MIOpenDriver convint8 -n 1 -c 64 --in_d 128 -H 128 -W 128 -k 32 --fil_d 3 -y 3 -x 3 --pad_d 1 -p 1 -q 1 --conv_stride_d 1 -u 1 -v 1 --dilation_d 1 -l 1 -j 1 --spatial_dim 3 -m conv -g 1 -F 1 -t 1 -S 0
Running it results in the following:
Memory access fault by GPU node-6 (Agent handle: 0x1190f40) on address 0x7f3692c9e000. Reason: Unknown.
The text was updated successfully, but these errors were encountered:
@atamazov Can you please take a look?
Sorry, something went wrong.
@JehandadKhan The reason is integer overflow. Navi21:
MIOpen(HIP): Info [ConvolutionForwardImmediate] solver_id = GemmFwdRest, workspace = 7247757312 ... MIOpen(HIP): Info2 [Log] Kernel MIOpenUtilKernels4.cl Compile Time, ms: 155.193 MIOpen(HIP): Info2 [run] kernel_name = transpose_packed_MN2NM, global_work_dim = { 3623878656, 1, 1 }, local_work_dim = { 256, 1, 1 }
The solver calls
float transpose_packed_MN2NM(const Handle& handle, int m, int n, int in_offset, int out_offset, ConstData_t in, Data_t out, miopenDataType_t type);
and passes 3623878656 to out_offset which exceeds INT_MAX.
out_offset
The fix could be either narrowing the solver's applicability or fixing the math in transpose_packed_MN2NM(); the latter is preferable of course.
transpose_packed_MN2NM()
How urgent is this issue (please assign a label)?
Please note that:
@JehandadKhan Similar problem happens in CallGemm()
CallGemm()
JehandadKhan
No branches or pull requests
When running the convolution described by the following driver command on MI200 platform
Running it results in the following:
The text was updated successfully, but these errors were encountered: