I often encounter a freeze of the graphical interface, but the system itself is not frozen.
More precisely : I still can use every program, and the mouse cursor is still movable, it even continues to adapt to the inputs (text pointer, mouse pointer, link pointer, etc). But everything else is frozen. I have to reboot every time it happens.
It also happens on live ISOs. It happened on EndeavourOS Live ISO, (using Xfce), and it still happens to my Manjaro install (using KDE)
When it happens, I have a short black screen, then the display goes back, frozen, with sometimes some minor artifacts glitches.
So it seems it comes from my GPU, when I checked the journalctl's logs:
janv. 10 10:33:13 guilhem-manjaro kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1040827, emitted seq=1040829
janv. 10 10:33:13 guilhem-manjaro kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 943 thread Xorg:cs0 pid 1028
janv. 10 10:33:13 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: GPU reset begin!
janv. 10 10:33:17 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: failed to suspend display audio
janv. 10 10:33:17 guilhem-manjaro kernel: amdgpu 0000:09:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
janv. 10 10:33:17 guilhem-manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
janv. 10 10:33:18 guilhem-manjaro kernel: amdgpu 0000:09:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
janv. 10 10:33:18 guilhem-manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
janv. 10 10:33:18 guilhem-manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
janv. 10 10:33:18 guilhem-manjaro kernel: [drm] free PSP TMR buffer
janv. 10 10:33:18 guilhem-manjaro kernel: amdgpu 0000:09:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0013 address=0xf7d00200100 flags=0x0030]
janv. 10 10:33:18 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: MODE1 reset
janv. 10 10:33:18 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: GPU mode1 reset
janv. 10 10:33:18 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: GPU smu mode1 reset
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:18 guilhem-manjaro kernel: snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x1f0500
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: GPU reset succeeded, trying to resume
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] PCIE GART of 512M enabled (table at 0x00000080007E9000).
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] VRAM is lost due to GPU reset!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] PSP is resuming...
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] reserve 0xa00000 from 0x82fe000000 for PSP TMR
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: RAS: optional ras ta ucode is not available
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: SMU is resuming...
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: SMU is resumed successfully!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] DMUB hardware initialized: version=0x02020003
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] kiq ring mec 2 pipe 1 q 0
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] JPEG decode initialized successfully.
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: recover vram bo from shadow start
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: recover vram bo from shadow done
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu 0000:09:00.0: amdgpu: GPU reset(2) succeeded!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm] Skip scheduling IBs!
janv. 10 10:33:19 guilhem-manjaro kernel: amdgpu_cs_ioctl: 42 callbacks suppressed
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
janv. 10 10:33:19 guilhem-manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
My GPU works fine on Windows 10, I’ve never encountered such freeze.
So I guess it comes from the drivers? Do you think I should try the AMDGPU-PRO drivers?