Crashes under heavy load after switching from nvidia to amd rx6700xt

Hi,

I recently switched from nvidia gtx 1060 6gb to radeon rx 6700 xt and since then my pc crashes under heavy load. Mangohud reports 70-80 C for CPU and about the same for GPU (never above 85 C). I am running non overclocked i5 9600kf with aforementioned rx 6700xt powered by 700W psu. Crash is instantaneous (no freeze or drop in performance before the crash), sometimes it even crashes within 1 minute of load.
I checked the journalctl but ngl I have hard time figuring out whether anything of this is related to my problem.
journalctl -p err -b -1 output:

lut 28 18:54:10 pumpkinfield kernel: x86/cpu: SGX disabled by BIOS.
lut 28 18:54:10 pumpkinfield kernel: Spectre V2 : WARNING: Unprivileged eBPF is enabled with eIBRS on, data leaks possible via Spectre v2 BHB attacks!
lut 28 18:54:10 pumpkinfield kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PR00._CPC], AE_NOT_FOUND (20210730/psargs-330)
lut 28 18:54:10 pumpkinfield kernel: ACPI Error: Aborting method \_SB.PR01._CPC due to previous error (AE_NOT_FOUND) (20210730/psparse-529)
lut 28 18:54:10 pumpkinfield kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PR00._CPC], AE_NOT_FOUND (20210730/psargs-330)
lut 28 18:54:10 pumpkinfield kernel: ACPI Error: Aborting method \_SB.PR02._CPC due to previous error (AE_NOT_FOUND) (20210730/psparse-529)
lut 28 18:54:10 pumpkinfield kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PR00._CPC], AE_NOT_FOUND (20210730/psargs-330)
lut 28 18:54:10 pumpkinfield kernel: ACPI Error: Aborting method \_SB.PR03._CPC due to previous error (AE_NOT_FOUND) (20210730/psparse-529)
lut 28 18:54:10 pumpkinfield kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PR00._CPC], AE_NOT_FOUND (20210730/psargs-330)
lut 28 18:54:10 pumpkinfield kernel: ACPI Error: Aborting method \_SB.PR04._CPC due to previous error (AE_NOT_FOUND) (20210730/psparse-529)
lut 28 18:54:10 pumpkinfield kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PR00._CPC], AE_NOT_FOUND (20210730/psargs-330)
lut 28 18:54:10 pumpkinfield kernel: ACPI Error: Aborting method \_SB.PR05._CPC due to previous error (AE_NOT_FOUND) (20210730/psparse-529)
lut 28 18:54:11 pumpkinfield kernel: NVRM: No NVIDIA GPU found.
lut 28 18:54:12 pumpkinfield kernel: NVRM: No NVIDIA GPU found.
lut 28 18:54:13 pumpkinfield kernel: NVRM: No NVIDIA GPU found.
lut 28 18:54:19 pumpkinfield gdm-password][889]: gkr-pam: unable to locate daemon control file
lut 28 18:54:22 pumpkinfield gdm-launch-environment][504]: GLib-GObject: g_object_unref: assertion 'G_IS_OBJECT (object)' failed
lut 28 18:54:47 pumpkinfield pulseaudio[1148]: GetManagedObjects() failed: org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes include: the remote application did not send a reply, the > 
lut 28 19:09:45 pumpkinfield pulseaudio[1148]: ALSA woke us up to write new data to the device, but there was actually nothing to write.
lut 28 19:09:45 pumpkinfield pulseaudio[1148]: Most likely this is a bug in the ALSA driver 'snd_usb_audio'. Please report this issue to the ALSA developers.
lut 28 19:09:45 pumpkinfield pulseaudio[1148]: We were woken up with POLLOUT set -- however a subsequent snd_pcm_avail() returned 0 or another value < min_avail.

mhwd -l && mhwd -li output:

> 0000:03:00.0 (0300:1002:73df) Display controller ATI Technologies Inc:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI
     video-modesetting            2020.01.13                true            PCI
            video-vesa            2017.03.12                true            PCI


> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI


Warning: No installed USB configs!

inxi -G

Graphics:
  Device-1: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
    driver: amdgpu v: kernel
  Display: wayland server: X.org v: 1.21.1.7 with: Xwayland v: 22.1.8
    compositor: gnome-shell v: 43.3 driver: X: loaded: nvidia gpu: amdgpu
    resolution: 1: 1920x1080~60Hz 2: 1920x1080~60Hz
  API: OpenGL v: 4.6 Mesa 22.3.5 renderer: AMD Radeon RX 6700 XT (navi22
    LLVM 15.0.7 DRM 3.42 5.15.94-1-MANJARO)

Worth mentioning that I did remove the nvidia drivers and checked if nvidia-utils were gone as well.
Any suggestions would be highly appreciated.

Something looks wrong with your config, mine looks like this (working flawless):

Graphics:
  Device-1: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
    driver: amdgpu v: kernel
  Display: x11 server: X.Org v: 21.1.7 with: Xwayland v: 22.1.8 driver: X:
    loaded: amdgpu unloaded: modesetting,radeon dri: radeonsi gpu: amdgpu
    resolution: 2560x1440~60Hz
  API: OpenGL v: 4.6 Mesa 22.3.5 renderer: AMD Radeon RX 6700 XT (navi22
    LLVM 15.0.7 DRM 3.49 6.1.12-1-MANJARO)

I cleared /var/log and now it looks like this

inxi -G

Graphics:
  Device-1: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
    driver: amdgpu v: kernel
  Display: wayland server: X.org v: 1.21.1.7 with: Xwayland v: 22.1.8
    compositor: gnome-shell v: 43.3 driver: gpu: amdgpu resolution:
    1: 1920x1080~144Hz 2: 1920x1080~60Hz
  API: OpenGL v: 4.6 Mesa 22.3.5 renderer: AMD Radeon RX 6700 XT (navi22
    LLVM 15.0.7 DRM 3.42 5.15.94-1-MANJARO)

Try :

  1. switching from wayland to x
  2. unplug secondary display
  3. install a newer kernel (6.1)

Switching from wayland to X helped, thanks a lot.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.