Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pantheon with amdgpu-pro driver: xorg fails to start #167909

Closed
davidak opened this issue Apr 8, 2022 · 4 comments
Closed

Pantheon with amdgpu-pro driver: xorg fails to start #167909

davidak opened this issue Apr 8, 2022 · 4 comments
Labels
0.kind: bug Something is broken 2.status: wontfix We cannot or will not fix this issue 6.topic: hardware

Comments

@davidak
Copy link
Member

davidak commented Apr 8, 2022

Describe the bug

Steps To Reproduce

Steps to reproduce the behavior:

  1. services.xserver.desktopManager.pantheon.enable = true;
  2. services.xserver.videoDrivers = [ "amdgpu-pro" ];
  3. Fatal server error

Full Xorg log: https://gist.github.com/davidak/fc9fcbc2b23c0b67a678443b79f30ded

Expected behavior

Same as with amdgpu driver.

Screenshots

signal-2022-04-08-205904

Additional context

It works perfectly with amdgpu driver.

Notify maintainers

@corngood asked for log in #151019 (comment).

Can you have a look?

Metadata

Build from unstable: 22.05pre367572.b6966d911da

Still reproducible with 22.05pre369524.5181d5945ed

@davidak davidak added the 0.kind: bug Something is broken label Apr 8, 2022
@davidak
Copy link
Member Author

davidak commented Apr 8, 2022

AMDGPU(0): [drm] Failed to open DRM device for pci:0000:03:00.0: No such file or directory

https://gist.github.com/davidak/fc9fcbc2b23c0b67a678443b79f30ded#file-amdgpu-pro-xorg-pantheon-crash-txt-L74

When i'm booted into NixOS stable i can confirm it is the right PCI BusID.

[root@gaming:~]# lspci
...
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] (rev c1)

So it's not the issue that is does not find the GPU or use the wrong (i have only 1, not even integrated).

@corngood
Copy link
Contributor

corngood commented Apr 8, 2022

Could you share modinfo amdgpu, uname -a, and dmesg here for reference? I'd like to make sure it's actually loading the dkms amdgpu module, but I'm not 100% sure how to verify that. It'll be a few days before I can test it myself to compare.

@davidak
Copy link
Member Author

davidak commented Apr 8, 2022

[root@gaming:~]# modinfo amdgpu
filename:       /run/booted-system/kernel-modules/lib/modules/5.10.109/kernel/drivers/gpu/drm/amd/amdgpu/amdgpu.ko.xz
version:        5.11.19.21.30
...

Full output: https://gist.github.com/davidak/c776f07586ea640f87a15df5f6cce35f

[root@gaming:~]# uname -a
Linux gaming 5.10.109 #1-NixOS SMP Mon Mar 28 07:57:11 UTC 2022 x86_64 GNU/Linux

dmesg: https://gist.github.com/davidak/def549a347bc29419e4f1f2c5adbd898

i see no issues there

× display-manager.service - X11 Server
     Loaded: loaded (/etc/systemd/system/display-manager.service; linked; vendor preset: enabled)
     Active: failed (Result: exit-code) since Fri 2022-04-08 20:57:56 CEST; 12min ago
    Process: 1338 ExecStartPre=/nix/store/147dr7c7vpsjy8sscpkv555lfxz46pyp-unit-script-display-manager-pre-start/bin/display-manager-pre-start (code=exited, status=0/SU>
    Process: 1343 ExecStart=/nix/store/wlvpba9p8g6mxskbkw41gzf27rqsr15n-unit-script-display-manager-start/bin/display-manager-start (code=exited, status=1/FAILURE)
   Main PID: 1343 (code=exited, status=1/FAILURE)
         IP: 0B in, 0B out
        CPU: 133ms

Apr 08 20:57:56 gaming systemd[1]: display-manager.service: Scheduled restart job, restart counter is at 3.
Apr 08 20:57:56 gaming systemd[1]: Stopped X11 Server.
Apr 08 20:57:56 gaming systemd[1]: display-manager.service: Start request repeated too quickly.
Apr 08 20:57:56 gaming systemd[1]: display-manager.service: Failed with result 'exit-code'.
Apr 08 20:57:56 gaming systemd[1]: Failed to start X11 Server.
Apr 08 20:57:56 gaming systemd[1]: display-manager.service: Triggering OnFailure= dependencies.

display-manager-start:

#!/nix/store/ld19k12mf0pa42ff43qs3sizamp09f03-bash-5.1-p16/bin/bash
set -e
export PATH=/nix/store/91q8d0jhmxn34lgjhank3sg0brgns21k-lightdm-1.30.0/sbin:$PATH
exec /nix/store/91q8d0jhmxn34lgjhank3sg0brgns21k-lightdm-1.30.0/sbin/lightdm
[root@gaming:~]# lspci  -v -s  $(lspci | grep ' VGA ' | cut -d" " -f 1)
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] (rev c1) (prog-if 00 [VGA controller])
	Subsystem: Sapphire Technology Limited Device e448
	Flags: bus master, fast devsel, latency 0, IRQ 150
	Memory at 4200000000 (64-bit, prefetchable) [size=8G]
	Memory at 4100000000 (64-bit, prefetchable) [size=2M]
	I/O ports at 4000 [size=256]
	Memory at 50200000 (32-bit, non-prefetchable) [size=1M]
	Expansion ROM at 000c0000 [disabled] [size=128K]
	Capabilities: [48] Vendor Specific Information: Len=08 <?>
	Capabilities: [50] Power Management version 3
	Capabilities: [64] Express Legacy Endpoint, MSI 00
	Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
	Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
	Capabilities: [150] Advanced Error Reporting
	Capabilities: [200] Physical Resizable BAR
	Capabilities: [240] Power Budgeting <?>
	Capabilities: [270] Secondary PCI Express
	Capabilities: [2a0] Access Control Services
	Capabilities: [2d0] Process Address Space ID (PASID)
	Capabilities: [320] Latency Tolerance Reporting
	Capabilities: [410] Physical Layer 16.0 GT/s <?>
	Capabilities: [440] Lane Margining at the Receiver <?>
	Kernel driver in use: amdgpu
	Kernel modules: amdgpu

shows the driver should be in use

[root@gaming:~]# lsmod | grep amd
amdgpu               6582272  1
iommu_v2               20480  1 amdgpu
ttm                    69632  1 amdgpu
amdkcl                 24576  2 amdgpu,ttm
gpu_sched              36864  1 amdgpu
drm_kms_helper        270336  1 amdgpu
i2c_algo_bit           16384  1 amdgpu
drm                   618496  5 gpu_sched,drm_kms_helper,amdgpu,ttm
backlight              24576  3 video,amdgpu,drm
i2c_core              102400  8 videodev,drm_kms_helper,i2c_algo_bit,ee1004,i2c_smbus,amdgpu,i2c_i801,drm

@tomodachi94
Copy link
Member

amdgpu-pro was removed by #320228. It was unmaintained.

@tomodachi94 tomodachi94 closed this as not planned Won't fix, can't repro, duplicate, stale Dec 12, 2024
@tomodachi94 tomodachi94 added the 2.status: wontfix We cannot or will not fix this issue label Dec 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0.kind: bug Something is broken 2.status: wontfix We cannot or will not fix this issue 6.topic: hardware
Projects
None yet
Development

No branches or pull requests

4 participants