-
Notifications
You must be signed in to change notification settings - Fork 379
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ROCm 2.7 Unable to find a suitable algorithm for doing forward convolution #869
Comments
@vnikiti , could you provide more information:
|
I have very similar problem. I am in KVM ubuntu 18.04 with pcie passtrough.
|
@zjing14 @daniellowell , could you help review the logs? |
Here is the problem the HIP compilation is failing ungracefully on gx803. @zjing14 and @asroy please look into this issue. As a workaround please try setting this blacklist variable: |
Setting up blacklist variable solved the problem for me. |
@sunway513 I tried running benchmark inside docker and natively. I get the same error.
Blacklist variable did not work for me. I also tried other models and my GPU crashed and system rebooted. I am returning this GPU and going to switch to NVIDIA. I really like AMD. I love open source but the state of ROCm is very incomplete yet. Architecture-wise, we cant use ROCm on other OSs. Implementation-wise, there are prohibitively too many bugs. |
This issue should be resolved in ROCm 2.8 |
I am have run into the same issue after installing rocm and tensorflow on debian 10. This work around did not work for me either. apt --installed list | grep -E 'hip|hcc|roc|miopen|amdgpu|hsa'
screenfetch
|
Having the same issue on my RX 580,
|
@anubhavj87 @ianzur @vnikiti Hello all, these issues may be unrelated in terms of the specific algorithm even though they are resulting in the same behavior. Please collect the logs using the environment variable: Attach the resulting logs to your new post using the button on the GUI labeled: |
I had the same problem with my RX580 Some notes:
MIOpen doc pages mentions that they are dropping support for gfx803 (RX580...) after 2.0 So the workaround works but I still get messages like: |
This issue should be fixed in 2.1. |
@daniellowell Thank you! |
MIOpen 2.1 has been released with ROCm 2.8. Please re-open the issue if the problem still persists. |
After Installing ROCm 3.0 I have again the same issue as before, please see my previous reply in this thread for details |
I get the following error when running TensorFlow 1.14 benchmark on RX 580 4GB, ROCm 2.7.22.
Below is the command I ran to produce this error.
python3 tf_cnn_benchmarks.py --num_gpus=1 --batch_size=32 --model=resnet50 --variable_update=parameter_server
Below is the complete output of this command including the error.
The text was updated successfully, but these errors were encountered: