On my and my colleague’s notebook linux-firmware 20210818.c46b8c3-1 runs without any incidence on RavenRidge so far.
However, my desktop at home with Polaris still suffers from GPU crashes. I wasn’t even able to get into a TTY and just turned it off. This was the second crash since the update and the first since I updated to Kernel 5.13.
Here is the journalctl:
-- Journal begins at Thu 2021-02-04 08:38:19 CET, ends at Fri 2021-09-10 20:58:57 CEST. --
Sep 10 20:46:32 ManjaroGamingPC kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
Sep 10 20:46:32 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x0352a004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010366A
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090A0004
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x04, vmid 4, pasid 32771) at page 1062506, write from 'CB4' (0x43423400) (160)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x03122004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090A0004
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x04, vmid 4, pasid 32771) at page 0, write from 'CB4' (0x43423400) (160)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x03529004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00104061
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090D0010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1065057, write from 'CB7' (0x43423700) (208)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x03129004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00103640
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x09020010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1062464, write from 'CB2' (0x43423200) (32)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x03522004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010378C
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x09010010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1062796, write from 'CB3' (0x43423300) (16)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x035a6004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00104026
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x09020010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1064998, write from 'CB2' (0x43423200) (32)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x031a6004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00104051
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090E0010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1065041, write from 'CB6' (0x43423600) (224)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x031a5004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00104183
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090D0010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1065347, write from 'CB7' (0x43423700) (208)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x031a2004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00103605
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090A0010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1062405, write from 'CB4' (0x43423400) (160)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x035a2004 for process kwin_x11 pid 1178 thread kwin_x11:cs0 pid 1197
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00103728
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090D0010
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x10, vmid 4, pasid 32771) at page 1062696, write from 'CB7' (0x43423700) (208)
Sep 10 20:46:32 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: IH ring buffer overflow (0x0008E2E0, 0x00000F00, 0x0000E2F0)
Sep 10 20:46:42 ManjaroGamingPC kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
Sep 10 20:46:42 ManjaroGamingPC kernel: gmc_v8_0_process_interrupt: 158 callbacks suppressed
Sep 10 20:46:42 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 146 0x0a20480c for process firefox pid 1903 thread firefox:cs0 pid 1959
Sep 10 20:46:42 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00102944
Sep 10 20:46:42 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0604800C
Sep 10 20:46:42 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x0c, vmid 3, pasid 32775) at page 1059140, read from 'TC4' (0x54433400) (72)
Sep 10 20:46:42 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Sep 10 20:46:52 ManjaroGamingPC kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
Sep 10 20:46:52 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 147 0x0ff28802 for process firefox pid 1903 thread firefox:cs0 pid 1959
Sep 10 20:46:52 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x000013FE
Sep 10 20:46:52 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x07088002
Sep 10 20:46:52 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x02, vmid 3, pasid 32775) at page 5118, write from 'TC6' (0x54433600) (136)
Sep 10 20:46:52 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Sep 10 20:47:02 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU fault detected: 147 0x08a24802 for process Xorg pid 989 thread Xorg:cs0 pid 1017
Sep 10 20:47:02 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00036714
Sep 10 20:47:02 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x03048002
Sep 10 20:47:02 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: VM fault (0x02, vmid 1, pasid 32769) at page 222996, write from 'TC4' (0x54433400) (72)
Sep 10 20:47:02 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Sep 10 20:47:12 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=856042, emitted seq=856045
Sep 10 20:47:12 ManjaroGamingPC kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 989 thread Xorg:cs0 pid 1017
Sep 10 20:47:12 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU reset begin!
Sep 10 20:47:16 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: failed to suspend display audio
Sep 10 20:47:17 ManjaroGamingPC kernel: amdgpu: cp is busy, skip halt cp
Sep 10 20:47:17 ManjaroGamingPC kernel: amdgpu: rlc is busy, skip halt rlc
Sep 10 20:47:17 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: BACO reset
Sep 10 20:47:17 ManjaroGamingPC kernel: amdgpu 0000:26:00.0: amdgpu: GPU reset succeeded, trying to resume
Sep 10 20:47:17 ManjaroGamingPC kernel: [drm] PCIE GART of 256M enabled (table at 0x000000F400300000).
Sep 10 20:47:17 ManjaroGamingPC kernel: [drm] VRAM is lost due to GPU reset!
Sep 10 20:47:19 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:20 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:21 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:22 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:23 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:24 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:25 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
Sep 10 20:47:26 ManjaroGamingPC kernel: [drm:uvd_v6_0_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!