Amdgpu fail to load on boot after switching from NVIDIA

Hi,
I have recently switched from NVIDIA’s gtx 1060 to RX 7800XT and to no surprise my system won’t boot right now. I have gone through everything mentioned on AMDGPU, uninstalled all nivdia related drivers, reinstalled video-linux, set amdgpu.dc to 0 in grub file, added amdgpu to MODULES at /etc/mkinitcpio.conf and all of this seem not to do anything. Am I missing something? Any suggestions would be appreciated.

journalctl:

Nov 15 21:09:04 pumpkinfield kernel: x86/cpu: SGX disabled by BIOS.
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: blacklist: Problem blacklisting hash (-13)
Nov 15 21:09:04 pumpkinfield kernel: [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* sw_init of IP block <gmc_v11_0> failed -22
Nov 15 21:09:04 pumpkinfield kernel: amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_init failed
Nov 15 21:09:04 pumpkinfield kernel: amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init
Nov 15 21:09:05 pumpkinfield kernel: usb 1-5: 3:1: cannot get freq at ep 0x84
Nov 15 21:23:31 pumpkinfield avahi-dnsconfd[514]: read(): EOF

mhwd -li

> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI


Warning: No installed USB configs!

mwhd-kernel -li

Currently running: 6.5.5-1-MANJARO (linux65)
The following kernels are installed in your system:
   * linux61

MODULES in etc/mkinitcpio.conf is set to MODULES=(amdgpu radeon), tried also without radeon.
GRUB_CMDLINE_LINUX_DEFAULT is set to GRUB_CMDLINE_LINUX_DEFAULT="quiet resume=UUID=edad059d-2569-4854-b8ed-7a7d5744e205 udev.log_priority=3 radeon.cik_support=0 amdgpu.cik_support=1 radeon.si_support=0 amdgpu.si_support=1 amdgpu.dc=0"

Linux-firmware is at latest version and my mirrors are up to date, there is no major update pending to be installed, my last os upgrade went smoothly without any issues (that was before I switched gpus). I also have windows 10 installed on a separate ssd, can confirm that gpu itself works. I am running 2 monitors, one hooked up with hdmi cable the other one with dp, tried switching them around in different configurations but to no avail. When I try to manually launch startx in chroot there’s only black screen. Before I set amdgpu in MODULES system would freeze at dev/nvme0xyz: clean, x/x files, x/x blocks

EDIT: All above references linux61

Both on linux65 and 66 I get the following results:
Screen backlight flashes a few times then it freezes at black screen (backlight on).

journalctl linux66

-- Boot f8249618dda448cf8fd57f5697473ab8 --
Nov 15 23:35:52 pumpkinfield kernel: x86/cpu: SGX disabled by BIOS.
Nov 15 23:35:53 pumpkinfield kernel: usb 1-5: 3:1: cannot get freq at ep 0x84
Nov 15 23:35:55 pumpkinfield gnome-session-binary[593]: Unrecoverable failure in required component org.gnome.Shell.desktop
Nov 15 23:36:31 pumpkinfield avahi-dnsconfd[459]: read(): EOF

journalctl linux65

-- Boot 6a58140c5e8b4992bac416bf1f6db235 --
Nov 15 23:46:13 pumpkinfield kernel: x86/cpu: SGX disabled by BIOS.
Nov 15 23:46:13 pumpkinfield kernel: usb 1-5: 3:1: cannot get freq at ep 0x84
Nov 15 23:46:15 pumpkinfield gnome-session-binary[591]: Unrecoverable failure in required component org.gnome.Shell.desktop
Nov 15 23:47:11 pumpkinfield avahi-dnsconfd[457]: read(): EOF

I have updated linux-firmware and regenerated kernel images for each after installation.

    ~  LANG=C errno 22                  ✔ 
EINVAL 22 Invalid argument

Looks to me that one of the arguments here are wrong. When no firmware was found, then you would see -2.

These parameters for amdgpu and radeon are only needed when you have legacy hardware and want to enable experimental support for GCN1/2, but your card is RDNA3. By default they are disabled.

MODULES=()

Remove all of those parameters.

Are you sending this message from the past then? Because in the present I have 6.5.11-1-MANJARO.

Post output of inxi -v7azy.

Apologies, I should note that Currently running: 6.5.5-1-MANJARO (linux65) is the version present on live usb I made to get into chroot. It’s the linux61 that is fully up to date and apparently my new gpu just refuses to work with it, so I tried putting all the parameters mentioned above to try to solve it. I switched to linux66 and deleted those additional parameters following your suggestion, system booted properly this time. The issue is solved but here’s the output of inxi -v7azy anyway.
Thanks for the help

System:
  Kernel: 6.6.1-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 13.2.1
    clocksource: tsc available: hpet,acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.6-x86_64
    root=UUID=b0c802c2-00b2-4409-adbd-b4219b043d93 rw quiet
    resume=UUID=edad059d-2569-4854-b8ed-7a7d5744e205 udev.log_priority=3
  Desktop: GNOME v: 45.1 tk: GTK v: 3.24.38 wm: gnome-shell dm: GDM v: 45.0.1
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop Mobo: Micro-Star model: MPG Z390 GAMING PLUS (MS-7B51) v: 1.0
    serial: <superuser required> UEFI: American Megatrends v: 1.E0
    date: 10/20/2022
Battery:
  Message: No system battery data found. Is one present?
Memory:
  System RAM: total: 32 GiB available: 31.29 GiB used: 4.1 GiB (13.1%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges required.
CPU:
  Info: model: Intel Core i5-9600KF bits: 64 type: MCP arch: Coffee Lake
    gen: core 9 level: v3 note: check built: 2018 process: Intel 14nm family: 6
    model-id: 0x9E (158) stepping: 0xD (13) microcode: 0xFA
  Topology: cpus: 1x cores: 6 smt: <unsupported> cache: L1: 384 KiB
    desc: d-6x32 KiB; i-6x32 KiB L2: 1.5 MiB desc: 6x256 KiB L3: 9 MiB
    desc: 1x9 MiB
  Speed (MHz): avg: 800 min/max: 800/4600:4700 scaling: driver: intel_pstate
    governor: powersave cores: 1: 800 2: 800 3: 800 4: 800 5: 800 6: 800
    bogomips: 44412
  Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat
    arch_capabilities arch_perfmon art avx avx2 bmi1 bmi2 bts clflush
    clflushopt cmov constant_tsc cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64
    dtherm dts ept ept_ad erms est f16c flexpriority flush_l1d fma fpu
    fsgsbase fxsr ht hwp hwp_act_window hwp_epp hwp_notify ibpb ibrs
    ibrs_enhanced ida intel_pt invpcid lahf_lm lm mca mce md_clear mmx monitor
    movbe mpx msr mtrr nonstop_tsc nopl nx pae pat pbe pcid pclmulqdq pdcm
    pdpe1gb pebs pge pln pni popcnt pse pse36 pts rdrand rdseed rdtscp
    rep_good sdbg sep smap smep ss ssbd sse sse2 sse4_1 sse4_2 ssse3 stibp
    syscall tm tm2 tpr_shadow tsc tsc_adjust tsc_deadline_timer vme vmx vnmi
    vpid x2apic xgetbv1 xsave xsavec xsaveopt xsaves xtopology xtpr
  Vulnerabilities:
  Type: gather_data_sampling mitigation: Microcode
  Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data mitigation: Clear CPU buffers; SMT disabled
  Type: retbleed mitigation: Enhanced IBRS
  Type: spec_rstack_overflow status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Enhanced / Automatic IBRS, IBPB: conditional,
    RSB filling, PBRSB-eIBRS: SW sequence
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort mitigation: TSX disabled
Graphics:
  Device-1: AMD Navi 32 [Radeon RX 7700 XT / 7800 XT] vendor: Gigabyte
    driver: amdgpu v: kernel arch: RDNA-3 code: Navi-3x process: TSMC n5 (5nm)
    built: 2022+ pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: HDMI-A-1
    empty: DP-1,DP-2,HDMI-A-2 bus-ID: 03:00.0 chip-ID: 1002:747e class-ID: 0300
  Device-2: Microdia ACR010 USB Webcam driver: snd-usb-audio,uvcvideo
    type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-5:5
    chip-ID: 0c45:636a class-ID: 0102 serial: <filter>
  Display: x11 server: X.org v: 1.21.1.9 with: Xwayland v: 23.2.2
    compositor: gnome-shell driver: X: loaded: amdgpu
    unloaded: modesetting,radeon alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-size: <missing: xdpyinfo>
  Monitor-1: HDMI-A-1 mapped: HDMI-A-0 model: Samsung C27F591
    serial: <filter> built: 2017 res: 1920x1080 hz: 60 dpi: 82 gamma: 1.2
    size: 598x336mm (23.54x13.23") diag: 686mm (27") ratio: 16:9 modes:
    max: 1920x1080 min: 720x400
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast surfaceless: drv: radeonsi x11: drv: radeonsi
    inactive: gbm,wayland
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 23.1.9-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 7800 XT (gfx1101 LLVM
    16.0.6 DRM 3.54 6.6.1-1-MANJARO) device-ID: 1002:747e memory: 15.62 GiB
    unified: no
  API: Vulkan v: 1.3.269 layers: 12 device: 0 type: discrete-gpu name: AMD
    Radeon RX 7800 XT (RADV GFX1101) driver: mesa radv v: 23.1.9-manjaro1.1
    device-ID: 1002:747e surfaces: xcb,xlib
Audio:
  Device-1: Intel Cannon Lake PCH cAVS vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel alternate: snd_soc_skl,snd_sof_pci_intel_cnl
    bus-ID: 00:1f.3 chip-ID: 8086:a348 class-ID: 0403
  Device-2: AMD Navi 31 HDMI/DP Audio driver: snd_hda_intel v: kernel pcie:
    gen: 4 speed: 16 GT/s lanes: 16 bus-ID: 03:00.1 chip-ID: 1002:ab30
    class-ID: 0403
  Device-3: Microdia ACR010 USB Webcam driver: snd-usb-audio,uvcvideo
    type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-5:5
    chip-ID: 0c45:636a class-ID: 0102 serial: <filter>
  API: ALSA v: k6.6.1-1-MANJARO status: kernel-api tools: N/A
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 0.3.84 status: off tools: pw-cli
  Server-3: PulseAudio v: 16.1 status: active with: pulseaudio-alsa
    type: plugin tools: pacat,pactl
Network:
  Device-1: Intel Ethernet I219-V vendor: Micro-Star MSI driver: e1000e
    v: kernel port: N/A bus-ID: 00:1f.6 chip-ID: 8086:15bc class-ID: 0200
  IF: eno1 state: up speed: 100 Mbps duplex: full mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  WAN IP: <filter>
Bluetooth:
  Device-1: TP-Link UB500 Adapter driver: btusb v: 0.8 type: USB rev: 1.1
    speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-4:4 chip-ID: 2357:0604
    class-ID: e001 serial: <filter>
  Report: btmgmt ID: hci0 rfk-id: 0 state: up address: <filter> bt-v: 5.1
    lmp-v: 10 status: discoverable: no pairing: no class-ID: 6c0104
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 1.13 TiB used: 663.71 GiB (57.5%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Western Digital
    model: WD Blue SN570 1TB size: 931.51 GiB block-size: physical: 512 B
    logical: 512 B speed: 31.6 Gb/s lanes: 4 tech: SSD serial: <filter>
    fw-rev: 234100WD temp: 31.9 C scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 vendor: A-Data model: SU650 size: 223.57 GiB
    block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s tech: SSD
    serial: <filter> fw-rev: 1B0 scheme: GPT
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 146.48 GiB size: 143.13 GiB (97.71%)
    used: 23.53 GiB (16.4%) fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
    label: N/A uuid: b0c802c2-00b2-4409-adbd-b4219b043d93
  ID-2: /boot/efi raw-size: 550 MiB size: 548.9 MiB (99.80%)
    used: 25.6 MiB (4.7%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
    label: NO_LABEL uuid: 0F85-1608
  ID-3: /home raw-size: 768.05 GiB size: 754.92 GiB (98.29%)
    used: 640.16 GiB (84.8%) fs: ext4 dev: /dev/nvme0n1p4 maj-min: 259:4
    label: N/A uuid: 0e77537b-7fd0-4efe-9e10-7d7c1346257b
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: partition size: 16.44 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/nvme0n1p3 maj-min: 259:3 label: N/A
    uuid: edad059d-2569-4854-b8ed-7a7d5744e205
Unmounted:
  ID-1: /dev/sda1 maj-min: 8:1 size: 223.57 GiB fs: ntfs label: N/A
    uuid: 3A6648F96648B781
USB:
  Hub-1: 1-0:1 info: hi-speed hub with single TT ports: 16 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 1-1:2 info: Acer Peripherals (now BenQ ) ZOWIE Gaming Mouse
    type: mouse driver: hid-generic,usbhid interfaces: 1 rev: 2.0
    speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1 power: 100mA
    chip-ID: 04a5:8001 class-ID: 0301
  Device-2: 1-2:3 info: Apple Aluminium Keyboard (ANSI) type: keyboard,mouse
    driver: apple,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 300mA chip-ID: 05ac:024f class-ID: 0301
  Device-3: 1-4:4 info: TP-Link UB500 Adapter type: bluetooth driver: btusb
    interfaces: 2 rev: 1.1 speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1
    power: 500mA chip-ID: 2357:0604 class-ID: e001 serial: <filter>
  Device-4: 1-5:5 info: Microdia ACR010 USB Webcam type: video,audio
    driver: snd-usb-audio,uvcvideo interfaces: 4 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 500mA
    chip-ID: 0c45:636a class-ID: 0102 serial: <filter>
  Hub-2: 2-0:1 info: super-speed hub ports: 10 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 35.0 C pch: 44.0 C mobo: N/A gpu: amdgpu
    temp: 35.0 C mem: 50.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 1
Info:
  Processes: 263 Uptime: 7m wakeups: 0 Init: systemd v: 254 default: graphical
  tool: systemctl Compilers: gcc: 13.2.1 clang: 16.0.6 Packages: pm: pacman
  pkgs: 1433 libs: 483 tools: gnome-software,pamac Shell: Bash v: 5.2.15
  running-in: gnome-terminal inxi: 3.3.31

OK, all good then. 7800XT was only released 2-3 months ago so I’m not surprised that it needs a fairly recent kernel but I am surprised that 6.5 doesn’t work. Anyway 6.6 is very likely to be the next LTS so you’re good for the long-term.

1 Like

This topic was automatically closed 36 hours after the last reply. New replies are no longer allowed.