System frequently crashing after GPU drivers update

Yes, now that I checked.

Apr 18 15:21:16 fakefred kernel: audit: type=1334 audit(1618730476.763:9): prog-id=5 op=LOAD
Apr 18 15:21:16 fakefred kernel: audit: type=1334 audit(1618730476.763:10): prog-id=6 op=LOAD
Apr 18 15:21:16 fakefred kernel: usb 3-2: New USB device found, idVendor=13d3, idProduct=56a6, bcdDevice=17.11
Apr 18 15:21:16 fakefred kernel: usb 3-2: New USB device strings: Mfr=3, Product=1, SerialNumber=2
Apr 18 15:21:16 fakefred kernel: usb 3-2: Product: Integrated Camera
Apr 18 15:21:16 fakefred kernel: usb 3-2: Manufacturer: Azurewave
Apr 18 15:21:16 fakefred kernel: usb 3-2: SerialNumber: 0001
Apr 18 15:21:16 fakefred kernel: acpi_cpufreq: overriding BIOS provided _PSD data
Apr 18 15:21:16 fakefred kernel: r8168: loading out-of-tree module taints kernel.
Apr 18 15:21:17 fakefred kernel: ACPI: Video Device [VGA] (multi-head: yes  rom: no  post: no)
Apr 18 15:21:17 fakefred kernel: acpi device:08: registered as cooling_device8
Apr 18 15:21:17 fakefred kernel: input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/device:07/LNXVIDEO:00/input/input6
Apr 18 15:21:17 fakefred kernel: r8168: module verification failed: signature and/or required key missing - tainting kernel
Apr 18 15:21:17 fakefred kernel: acpi PNP0C14:01: duplicate WMI GUID 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00)
Apr 18 15:21:17 fakefred kernel: acpi PNP0C14:02: duplicate WMI GUID 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00)
Apr 18 15:21:17 fakefred kernel: r8168 Gigabit Ethernet driver 8.048.03-NAPI loaded
Apr 18 15:21:17 fakefred kernel: piix4_smbus 0000:00:14.0: SMBus Host Controller at 0xb00, revision 0
Apr 18 15:21:17 fakefred kernel: piix4_smbus 0000:00:14.0: Using register 0x02 for SMBus port selection
Apr 18 15:21:17 fakefred kernel: piix4_smbus 0000:00:14.0: Auxiliary SMBus Host Controller at 0xb20
Apr 18 15:21:17 fakefred kernel: sp5100_tco: SP5100/SB800 TCO WatchDog Timer Driver
Apr 18 15:21:17 fakefred kernel: sp5100-tco sp5100-tco: Using 0xfeb00000 for watchdog MMIO address
Apr 18 15:21:17 fakefred kernel: sp5100-tco sp5100-tco: initialized. heartbeat=60 sec (nowayout=0)
Apr 18 15:21:17 fakefred kernel: r8168: This product is covered by one or more of the following patents: US6,570,884, US6,115,776, and US6,327,625.
Apr 18 15:21:17 fakefred kernel: r8168  Copyright (C) 2020  Realtek NIC software team <nicfae@realtek.com> 
                                  This program comes with ABSOLUTELY NO WARRANTY; for details, please see <http://www.gnu.org/licenses/>. 
                                  This is free software, and you are welcome to redistribute it under certain conditions; see <http://www.gnu.org/licenses/>. 
Apr 18 15:21:17 fakefred kernel: random: mktemp: uninitialized urandom read (6 bytes read)
Apr 18 15:21:17 fakefred kernel: Adding 9227464k swap on /dev/nvme0n1p3.  Priority:-2 extents:1 across:9227464k SSFS
Apr 18 15:21:17 fakefred kernel: ccp 0000:05:00.2: enabling device (0000 -> 0002)
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: ThinkPad ACPI Extras v0.26
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: http://ibm-acpi.sf.net/
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: ThinkPad BIOS R0UET65W (1.45 ), EC R0UHT65W
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: Lenovo ThinkPad E485, model 20KUA001CD
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: radio switch found; radios are enabled
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: This ThinkPad has standard ACPI backlight brightness control, supported by the ACPI video driver
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: Disabling thinkpad-acpi brightness events by default...
Apr 18 15:21:17 fakefred kernel: ccp 0000:05:00.2: ccp enabled
Apr 18 15:21:17 fakefred kernel: ccp 0000:05:00.2: psp: unable to access the device: you might be running a broken BIOS.
Apr 18 15:21:17 fakefred kernel: input: PC Speaker as /devices/platform/pcspkr/input/input8
Apr 18 15:21:17 fakefred kernel: RAPL PMU: API unit is 2^-32 Joules, 1 fixed counters, 163840 ms ovfl timer
Apr 18 15:21:17 fakefred kernel: RAPL PMU: hw unit of domain package 2^-16 Joules
Apr 18 15:21:17 fakefred kernel: cryptd: max_cpu_qlen set to 1000
Apr 18 15:21:17 fakefred kernel: cfg80211: Loading compiled-in X.509 certificates for regulatory database
Apr 18 15:21:17 fakefred kernel: cfg80211: Loaded X.509 cert 'sforshee: 00b28ddf47aef9cea7'
Apr 18 15:21:17 fakefred kernel: AVX2 version of gcm_enc/dec engaged.
Apr 18 15:21:17 fakefred kernel: AES CTR mode by8 optimization enabled
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: Standard ACPI backlight interface available, not loading native one
Apr 18 15:21:17 fakefred kernel: r8168 0000:02:00.0 enp2s0: renamed from eth0
Apr 18 15:21:17 fakefred kernel: thinkpad_acpi: battery 1 registered (start 95, stop 100)
Apr 18 15:21:17 fakefred kernel: battery: new extension: ThinkPad Battery Extension
Apr 18 15:21:17 fakefred kernel: input: ThinkPad Extra Buttons as /devices/platform/thinkpad_acpi/input/input7
Apr 18 15:21:17 fakefred kernel: kvm: Nested Virtualization enabled
Apr 18 15:21:17 fakefred kernel: SVM: kvm: Nested Paging enabled
Apr 18 15:21:17 fakefred kernel: SVM: Virtual VMLOAD VMSAVE supported
Apr 18 15:21:17 fakefred kernel: SVM: Virtual GIF supported
Apr 18 15:21:17 fakefred kernel: MCE: In-kernel MCE decoding enabled.
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: snd_hda_intel 0000:05:00.1: enabling device (0000 -> 0002)
Apr 18 15:21:17 fakefred kernel: snd_hda_intel 0000:05:00.1: Handle vga_switcheroo audio client
Apr 18 15:21:17 fakefred kernel: snd_hda_intel 0000:05:00.6: enabling device (0000 -> 0002)
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: input: HD-Audio Generic HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:08.1/0000:05:00.1/sound/card0/input10
Apr 18 15:21:17 fakefred kernel: input: HD-Audio Generic HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:08.1/0000:05:00.1/sound/card0/input11
Apr 18 15:21:17 fakefred kernel: input: HD-Audio Generic HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:08.1/0000:05:00.1/sound/card0/input12
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0: CX20753/4: BIOS auto-probing.
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0: autoconfig for CX20753/4: line_outs=1 (0x17/0x0/0x0/0x0/0x0) type:speaker
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:    speaker_outs=0 (0x0/0x0/0x0/0x0/0x0)
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:    hp_outs=1 (0x16/0x0/0x0/0x0/0x0)
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:    mono: mono_out=0x0
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:    inputs:
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:      Internal Mic=0x1a
Apr 18 15:21:17 fakefred kernel: snd_hda_codec_conexant hdaudioC1D0:      Mic=0x19
Apr 18 15:21:17 fakefred kernel: intel_rapl_common: Found RAPL domain package
Apr 18 15:21:17 fakefred kernel: intel_rapl_common: Found RAPL domain core
Apr 18 15:21:17 fakefred kernel: input: HD-Audio Generic Mic as /devices/pci0000:00/0000:00:08.1/0000:05:00.6/sound/card1/input13
Apr 18 15:21:17 fakefred kernel: input: HD-Audio Generic Headphone as /devices/pci0000:00/0000:00:08.1/0000:05:00.6/sound/card1/input14
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: ath10k_pci 0000:04:00.0: enabling device (0000 -> 0002)
Apr 18 15:21:17 fakefred kernel: ath10k_pci 0000:04:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0
Apr 18 15:21:17 fakefred kernel: [drm] amdgpu kernel modesetting enabled.
Apr 18 15:21:17 fakefred kernel: amdgpu: Topology: Add APU node [0x0:0x0]
Apr 18 15:21:17 fakefred kernel: checking generic (b0000000 7f0000) vs hw (b0000000 10000000)
Apr 18 15:21:17 fakefred kernel: fb0: switching to amdgpudrmfb from EFI VGA
Apr 18 15:21:17 fakefred kernel: Console: switching to colour dummy device 80x25
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: vgaarb: deactivate vga console
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: enabling device (0006 -> 0007)
Apr 18 15:21:17 fakefred kernel: [drm] initializing kernel modesetting (RAVEN 0x1002:0x15DD 0x17AA:0x506F 0xC4).
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default)
Apr 18 15:21:17 fakefred kernel: [drm] register mmio base: 0xC0800000
Apr 18 15:21:17 fakefred kernel: [drm] register mmio size: 524288
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 0 <soc15_common>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 1 <gmc_v9_0>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 2 <vega10_ih>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 3 <psp>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 4 <gfx_v9_0>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 5 <sdma_v4_0>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 6 <powerplay>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 7 <dm>
Apr 18 15:21:17 fakefred kernel: [drm] add ip block number 8 <vcn_v1_0>
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: Fetched VBIOS from VFCT
Apr 18 15:21:17 fakefred kernel: amdgpu: ATOM BIOS: 113-RAVEN-106
Apr 18 15:21:17 fakefred kernel: [drm] VCN decode is enabled in VM mode
Apr 18 15:21:17 fakefred kernel: [drm] VCN encode is enabled in VM mode
Apr 18 15:21:17 fakefred kernel: [drm] JPEG decode is enabled in VM mode
Apr 18 15:21:17 fakefred kernel: [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VRAM: 256M 0x000000F400000000 - 0x000000F40FFFFFFF (256M used)
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
Apr 18 15:21:17 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: AGP: 267419648M 0x000000F800000000 - 0x0000FFFFFFFFFFFF
Apr 18 15:21:17 fakefred kernel: [drm] Detected VRAM RAM=256M, BAR=256M
Apr 18 15:21:17 fakefred kernel: [drm] RAM width 128bits DDR4
Apr 18 15:21:17 fakefred kernel: [TTM] Zone  kernel: Available graphics memory: 3880308 KiB
Apr 18 15:21:17 fakefred kernel: [TTM] Zone   dma32: Available graphics memory: 2097152 KiB
Apr 18 15:21:17 fakefred kernel: [drm] amdgpu: 256M of VRAM memory ready
Apr 18 15:21:17 fakefred kernel: [drm] amdgpu: 3072M of GTT memory ready.
Apr 18 15:21:17 fakefred kernel: [drm] GART: num cpu pages 262144, num gpu pages 262144
Apr 18 15:21:17 fakefred kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F400900000).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: tun: Universal TUN/TAP device driver, 1.6
Apr 18 15:21:17 fakefred kernel: amdgpu: hwmgr_sw_init smu backed is smu10_smu
Apr 18 15:21:17 fakefred kernel: [drm] Found VCN firmware Version ENC: 1.12 DEC: 2 VEP: 0 Revision: 5
Apr 18 15:21:17 fakefred kernel: [drm] PSP loading VCN firmware
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: [drm] reserve 0x400000 from 0xf40fc00000 for PSP TMR
Apr 18 15:21:17 fakefred kernel: random: crng init done
Apr 18 15:21:17 fakefred kernel: random: 6 urandom warning(s) missed due to ratelimiting
Apr 18 15:21:17 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:17 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:17 fakefred kernel: pcieport 0000:00:01.6: AER: Multiple Corrected error received: 0000:00:01.0
Apr 18 15:21:17 fakefred kernel: pcieport 0000:00:01.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Apr 18 15:21:17 fakefred kernel: pcieport 0000:00:01.6:   device [1022:15d3] error status/mask=00000040/00006000
Apr 18 15:21:17 fakefred kernel: pcieport 0000:00:01.6:    [ 6] BadTLP                
Apr 18 15:21:17 fakefred kernel: enp2s0: 0xffffa2bac02c1000, e8:6a:64:4a:8d:7d, IRQ 64
Apr 18 15:21:17 fakefred kernel: ath10k_pci 0000:04:00.0: qca9377 hw1.1 target 0x05020001 chip_id 0x003821ff sub 17aa:0901
Apr 18 15:21:17 fakefred kernel: ath10k_pci 0000:04:00.0: kconfig debug 1 debugfs 1 tracing 1 dfs 0 testmode 0
Apr 18 15:21:17 fakefred kernel: ath10k_pci 0000:04:00.0: firmware ver WLAN.TF.2.1-00021-QCARMSWP-1 api 6 features wowlan,ignore-otp crc32 42e41877
Apr 18 15:21:18 fakefred kernel: ath10k_pci 0000:04:00.0: board_file api 2 bmi_id N/A crc32 8aedfa4a
Apr 18 15:21:18 fakefred kernel: psmouse serio1: synaptics: queried max coordinates: x [..5676], y [..4690]
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: RAS: optional ras ta ucode is not available
Apr 18 15:21:18 fakefred kernel: ath10k_pci 0000:04:00.0: htt-ver 3.56 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available
Apr 18 15:21:18 fakefred kernel: [drm] kiq ring mec 2 pipe 1 q 0
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB: values for F clock
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         400000 in kHz, 3174 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         933000 in kHz, 3724 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         1067000 in kHz, 3924 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         1200000 in kHz, 4074 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB: values for DCF clock
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         300000 in kHz, 3174 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         600000 in kHz, 3724 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         626000 in kHz, 3924 in mV
Apr 18 15:21:18 fakefred kernel: [drm] DM_PPLIB:         654000 in kHz, 4074 in mV
Apr 18 15:21:18 fakefred kernel: [drm] Display Core initialized with v3.2.116!
Apr 18 15:21:18 fakefred kernel: psmouse serio1: synaptics: queried min coordinates: x [1266..], y [1162..]
Apr 18 15:21:18 fakefred kernel: psmouse serio1: synaptics: Your touchpad (PNP: LEN2060 PNP0f13) says it can support a different bus. If i2c-hid and hid-rmi are not used, you might want to try setting psmouse.synaptics_intertouch to 1 and report this to linux-input@vger.kernel.org.
Apr 18 15:21:18 fakefred kernel: snd_hda_intel 0000:05:00.1: bound 0000:05:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
Apr 18 15:21:18 fakefred kernel: ath: EEPROM regdomain: 0x6c
Apr 18 15:21:18 fakefred kernel: ath: EEPROM indicates we should expect a direct regpair map
Apr 18 15:21:18 fakefred kernel: ath: Country alpha2 being used: 00
Apr 18 15:21:18 fakefred kernel: ath: Regpair used: 0x6c
Apr 18 15:21:18 fakefred kernel: psmouse serio1: synaptics: Touchpad model: 1, fw: 8.16, id: 0x1e2b1, caps: 0xf016a3/0x940300/0x12e800/0x400000, board id: 3383, fw id: 2731746
Apr 18 15:21:18 fakefred kernel: psmouse serio1: synaptics: serio: Synaptics pass-through port at isa0060/serio1/input0
Apr 18 15:21:18 fakefred kernel: [drm] VCN decode and encode initialized successfully(under SPG Mode).
Apr 18 15:21:18 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:18 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:18 fakefred kernel: kfd kfd: Allocated 3969056 bytes on gart
Apr 18 15:21:18 fakefred kernel: kfd kfd: error getting iommu info. is the iommu enabled?
Apr 18 15:21:18 fakefred kernel: kfd kfd: Error initializing iommuv2
Apr 18 15:21:18 fakefred kernel: kfd kfd: device 1002:15dd NOT added due to errors
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: SE 1, SH per SE 1, CU per SH 11, active_cu_number 8
Apr 18 15:21:18 fakefred kernel: [drm] fb mappable at 0xA0BCA000
Apr 18 15:21:18 fakefred kernel: [drm] vram apper at 0xA0000000
Apr 18 15:21:18 fakefred kernel: [drm] size 8294400
Apr 18 15:21:18 fakefred kernel: [drm] fb depth is 24
Apr 18 15:21:18 fakefred kernel: [drm]    pitch is 7680
Apr 18 15:21:18 fakefred kernel: fbcon: amdgpudrmfb (fb0) is primary device
Apr 18 15:21:18 fakefred kernel: ath10k_pci 0000:04:00.0 wlp4s0: renamed from wlan0
Apr 18 15:21:18 fakefred kernel: Console: switching to colour frame buffer device 240x67
Apr 18 15:21:18 fakefred kernel: EDAC amd64: F17h_M10h detected (node 0).
Apr 18 15:21:18 fakefred kernel: EDAC amd64: Node 0: DRAM ECC disabled.
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: AER: Corrected error received: 0000:00:01.0
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:   device [1022:15d3] error status/mask=00000040/00006000
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:    [ 6] BadTLP                
Apr 18 15:21:18 fakefred kernel: mc: Linux media interface: v0.10
Apr 18 15:21:18 fakefred kernel: input: SynPS/2 Synaptics TouchPad as /devices/platform/i8042/serio1/input/input9
Apr 18 15:21:18 fakefred kernel: input: Lenovo Laser Wireless Mouse as /devices/pci0000:00/0000:00:08.1/0000:05:00.3/usb1/1-4/1-4:1.0/0003:17EF:6039.0001/input/input16
Apr 18 15:21:18 fakefred kernel: hid-generic 0003:17EF:6039.0001: input,hidraw0: USB HID v1.11 Mouse [Lenovo Laser Wireless Mouse] on usb-0000:05:00.3-4/input0
Apr 18 15:21:18 fakefred kernel: usbcore: registered new interface driver usbhid
Apr 18 15:21:18 fakefred kernel: usbhid: USB HID core driver
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: [drm] fb0: amdgpudrmfb frame buffer device
Apr 18 15:21:18 fakefred kernel: mousedev: PS/2 mouse device common for all mice
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: AER: Multiple Corrected error received: 0000:00:01.0
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:   device [1022:15d3] error status/mask=00000080/00006000
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:    [ 7] BadDLLP               
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
Apr 18 15:21:18 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
Apr 18 15:21:18 fakefred kernel: videodev: Linux video capture interface: v2.00
Apr 18 15:21:18 fakefred kernel: Bluetooth: Core ver 2.22
Apr 18 15:21:18 fakefred kernel: NET: Registered protocol family 31
Apr 18 15:21:18 fakefred kernel: Bluetooth: HCI device and connection manager initialized
Apr 18 15:21:18 fakefred kernel: Bluetooth: HCI socket layer initialized
Apr 18 15:21:18 fakefred kernel: Bluetooth: L2CAP socket layer initialized
Apr 18 15:21:18 fakefred kernel: Bluetooth: SCO socket layer initialized
Apr 18 15:21:18 fakefred kernel: [drm] Initialized amdgpu 3.40.0 20150101 for 0000:05:00.0 on minor 0
Apr 18 15:21:18 fakefred kernel: usbcore: registered new interface driver btusb
Apr 18 15:21:18 fakefred kernel: uvcvideo: Found UVC 1.00 device Integrated Camera (13d3:56a6)
Apr 18 15:21:18 fakefred kernel: input: Integrated Camera: Integrated C as /devices/pci0000:00/0000:00:08.1/0000:05:00.4/usb3/3-2/3-2:1.0/input/input17
Apr 18 15:21:18 fakefred kernel: usbcore: registered new interface driver uvcvideo
Apr 18 15:21:18 fakefred kernel: USB Video Class driver (1.1.1)
Apr 18 15:21:18 fakefred kernel: Bluetooth: BNEP (Ethernet Emulation) ver 1.3
Apr 18 15:21:18 fakefred kernel: Bluetooth: BNEP filters: protocol multicast
Apr 18 15:21:18 fakefred kernel: Bluetooth: BNEP socket layer initialized
Apr 18 15:21:18 fakefred kernel: NET: Registered protocol family 38
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: AER: Corrected error received: 0000:00:01.0
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:   device [1022:15d3] error status/mask=00000040/00006000
Apr 18 15:21:18 fakefred kernel: pcieport 0000:00:01.6:    [ 6] BadTLP                
Apr 18 15:21:18 fakefred kernel: psmouse serio2: trackpoint: Elan TrackPoint firmware: 0x11, buttons: 3/3
Apr 18 15:21:19 fakefred kernel: input: TPPS/2 Elan TrackPoint as /devices/platform/i8042/serio1/serio2/input/input15
Apr 18 15:21:20 fakefred kernel: kauditd_printk_skb: 38 callbacks suppressed
Apr 18 15:21:20 fakefred kernel: audit: type=1100 audit(1618730480.463:49): pid=1170 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:authentication grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:20 fakefred kernel: audit: type=1101 audit(1618730480.463:50): pid=1170 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:20 fakefred kernel: audit: type=1103 audit(1618730480.463:51): pid=1170 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:20 fakefred kernel: audit: type=1130 audit(1618730480.493:52): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@973 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:20 fakefred kernel: audit: type=1101 audit(1618730480.503:53): pid=1180 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_access,pam_unix,pam_permit,pam_time acct="sddm" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:20 fakefred kernel: audit: type=1103 audit(1618730480.503:54): pid=1180 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=? acct="sddm" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
Apr 18 15:21:20 fakefred kernel: audit: type=1006 audit(1618730480.503:55): pid=1180 uid=0 old-auid=4294967295 auid=973 tty=(none) old-ses=4294967295 ses=1 res=1
Apr 18 15:21:20 fakefred kernel: audit: type=1300 audit(1618730480.503:55): arch=c000003e syscall=1 success=yes exit=3 a0=9 a1=7fff4cc0df90 a2=3 a3=3cd items=0 ppid=1 pid=1180 auid=973 uid=0 gid=0 euid=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 tty=(none) ses=1 comm="(systemd)" exe="/usr/lib/systemd/systemd" key=(null)
Apr 18 15:21:20 fakefred kernel: audit: type=1327 audit(1618730480.503:55): proctitle="(systemd)"
Apr 18 15:21:20 fakefred kernel: audit: type=1105 audit(1618730480.506:56): pid=1180 uid=0 auid=973 ses=1 msg='op=PAM:session_open grantors=pam_loginuid,pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_permit,pam_mail,pam_systemd,pam_env acct="sddm" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:25 fakefred kernel: kauditd_printk_skb: 30 callbacks suppressed
Apr 18 15:21:25 fakefred kernel: audit: type=1130 audit(1618730485.466:79): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user@1000 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:25 fakefred kernel: audit: type=1105 audit(1618730485.533:80): pid=1942 uid=0 auid=1000 ses=2 msg='op=PAM:session_open grantors=pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_permit,pam_mail,pam_systemd,pam_env,pam_kwallet5 acct="fakefred" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=:0 res=success'
Apr 18 15:21:27 fakefred kernel: audit: type=1130 audit(1618730487.679:81): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=rtkit-daemon comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:28 fakefred kernel: audit: type=1131 audit(1618730488.039:82): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:28 fakefred kernel: Bluetooth: RFCOMM TTY layer initialized
Apr 18 15:21:28 fakefred kernel: Bluetooth: RFCOMM socket layer initialized
Apr 18 15:21:28 fakefred kernel: Bluetooth: RFCOMM ver 1.11
Apr 18 15:21:33 fakefred kernel: wlp4s0: authenticate with d8:32:14:ff:5e:61
Apr 18 15:21:33 fakefred kernel: wlp4s0: send auth to d8:32:14:ff:5e:61 (try 1/3)
Apr 18 15:21:33 fakefred kernel: wlp4s0: authenticated
Apr 18 15:21:33 fakefred kernel: wlp4s0: associate with d8:32:14:ff:5e:61 (try 1/3)
Apr 18 15:21:33 fakefred kernel: wlp4s0: RX AssocResp from d8:32:14:ff:5e:61 (capab=0x411 status=0 aid=5)
Apr 18 15:21:33 fakefred kernel: wlp4s0: associated
Apr 18 15:21:33 fakefred kernel: IPv6: ADDRCONF(NETDEV_CHANGE): wlp4s0: link becomes ready
Apr 18 15:21:33 fakefred kernel: audit: type=1130 audit(1618730493.953:83): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:35 fakefred kernel: audit: type=1131 audit(1618730495.659:84): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user@973 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:35 fakefred kernel: audit: type=1131 audit(1618730495.679:85): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@973 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:44 fakefred kernel: audit: type=1131 audit(1618730504.036:86): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:48 fakefred kernel: audit: type=1131 audit(1618730508.463:87): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:21:48 fakefred kernel: audit: type=1334 audit(1618730508.586:88): prog-id=10 op=UNLOAD
Apr 18 15:21:48 fakefred kernel: audit: type=1334 audit(1618730508.586:89): prog-id=9 op=UNLOAD
Apr 18 15:23:54 fakefred kernel: audit: type=1130 audit(1618730634.499:90): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=pamac-daemon comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:24:01 fakefred kernel: usb 1-2: new high-speed USB device number 3 using xhci_hcd
Apr 18 15:24:02 fakefred kernel: usb 1-2: New USB device found, idVendor=2717, idProduct=ff40, bcdDevice= 4.04
Apr 18 15:24:02 fakefred kernel: usb 1-2: New USB device strings: Mfr=1, Product=2, SerialNumber=3
Apr 18 15:24:02 fakefred kernel: usb 1-2: Product: MI 6
Apr 18 15:24:02 fakefred kernel: usb 1-2: Manufacturer: Xiaomi
Apr 18 15:24:02 fakefred kernel: usb 1-2: SerialNumber: 639335a0
Apr 18 15:24:02 fakefred kernel: usb 1-2: USB disconnect, device number 3
Apr 18 15:24:03 fakefred kernel: usb 1-2: new high-speed USB device number 4 using xhci_hcd
Apr 18 15:24:03 fakefred kernel: usb 1-2: New USB device found, idVendor=2717, idProduct=ff80, bcdDevice= 4.04
Apr 18 15:24:03 fakefred kernel: usb 1-2: New USB device strings: Mfr=1, Product=2, SerialNumber=3
Apr 18 15:24:03 fakefred kernel: usb 1-2: Product: MI 6
Apr 18 15:24:03 fakefred kernel: usb 1-2: Manufacturer: Xiaomi
Apr 18 15:24:03 fakefred kernel: usb 1-2: SerialNumber: 639335a0
Apr 18 15:24:04 fakefred kernel: usbcore: registered new interface driver cdc_ether
Apr 18 15:24:04 fakefred kernel: rndis_host 1-2:1.0 usb0: register 'rndis_host' at usb-0000:05:00.3-2, RNDIS device, ba:c3:34:09:2e:17
Apr 18 15:24:04 fakefred kernel: usbcore: registered new interface driver rndis_host
Apr 18 15:24:04 fakefred kernel: rndis_host 1-2:1.0 enp5s0f3u2: renamed from usb0
Apr 18 15:24:04 fakefred kernel: audit: type=1130 audit(1618730644.129:91): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:24:10 fakefred kernel: audit: type=1131 audit(1618730650.328:92): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=pamac-daemon comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:24:14 fakefred kernel: audit: type=1131 audit(1618730654.042:93): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Apr 18 15:24:37 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Apr 18 15:24:38 fakefred kernel: sched: RT throttling activated
Apr 18 15:24:38 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108301000 from client 27
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00441051
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:39 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108300000 from client 27
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108305000 from client 27
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:40 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108304000 from client 27
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:41 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108307000 from client 27
Apr 18 15:24:42 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:42 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:43 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:43 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:44 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:44 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:44 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:46 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:46 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108303000 from client 27
Apr 18 15:24:46 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108306000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108302000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108309000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x800108308000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x80010830d000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x80010830c000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x80010830f000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x80010830b000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32776, for process minetest pid 30628 thread minetest:cs0 pid 30652)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x80010830e000 from client 27
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: CB (0x0)
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x7
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Apr 18 15:24:47 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=17440, emitted seq=17443
Apr 18 15:24:47 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process minetest pid 30628 thread minetest:cs0 pid 30652
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset begin!
Apr 18 15:24:47 fakefred kernel: [drm] free PSP TMR buffer
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset succeeded, trying to resume
Apr 18 15:24:47 fakefred kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F400900000).
Apr 18 15:24:47 fakefred kernel: [drm] PSP is resuming...
Apr 18 15:24:47 fakefred kernel: [drm] reserve 0x400000 from 0xf40fc00000 for PSP TMR
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: RAS: optional ras ta ucode is not available
Apr 18 15:24:47 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available
Apr 18 15:24:48 fakefred kernel: [drm] kiq ring mec 2 pipe 1 q 0
Apr 18 15:24:48 fakefred kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
Apr 18 15:24:48 fakefred kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v9_0> failed -110
Apr 18 15:24:48 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset(3) failed
Apr 18 15:24:48 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset end with ret = -110
Apr 18 15:24:58 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Apr 18 15:24:58 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=1305, emitted seq=1307
Apr 18 15:24:58 fakefred kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Apr 18 15:24:58 fakefred kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset begin!

Looks like there has been a failed GPU reset, at the end of the log?

1 Like

Thanks!! I think I’ll give it a try and do the whole stable update. It’s called stable for some reason, I guess :sweat_smile: If it doesn’t work, I’ll downgrade mesa drivers again, and start over.

1 Like

Shoot. Yeah, it seems like there were some microcode errors anyway (amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available). Maybe using the latest mesa driver with that kernel does the trick :thinking: Who knows. Another package statuses combination to try.

1 Like

yeah downgrading MESA and using the kernel parameter mentioned above (noretry) definitely do not the trick in my case…lets see what the next update brings to da house…

@poynting_factor
please report if upgrading to 5.11 does anything… thanks

kernel: amdgpu 0000:09:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process Xorg pid 1029 thread Xor>
kernel: amdgpu 0000:09:00.0: amdgpu:   in page starting at address 0x0000800110679000 from client 27
kernel: amdgpu 0000:09:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
kernel: amdgpu 0000:09:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
kernel: amdgpu 0000:09:00.0: amdgpu:          MORE_FAULTS: 0x1
kernel: amdgpu 0000:09:00.0: amdgpu:          WALKER_ERROR: 0x0
kernel: amdgpu 0000:09:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
kernel: amdgpu 0000:09:00.0: amdgpu:          MAPPING_ERROR: 0x0
kernel: amdgpu 0000:09:00.0: amdgpu:          RW: 0x0
1 Like

Hey everybody! I didn’t want to post it so I wouldn’t claim victory too soon, but on Sunday (04-18) evening/night I ran a system update (with yay -Syyu), which upgraded my mesa drivers back to the latest ones (some of the dependencies got even a newer update, I’ll post that info below) and also began using the latest 5.11 kernel. Since then, my machine has been running smoothly and reported no freezes again :pray:

The kernel version is stable branch’s latest for 5.11: 5.11.14-1. Not 100% sure if the kernel fixed it, but I’d bet a lot for it after what @fkfd commented a few days ago (a recent commit on such kernel handled some page fault overflows, which were very likely to happen at the reported freezes).

I’d appreciate if whoever had this issue tries upgrading drivers and kernel to the latest, so we gather more evidence about a fixed system state. I won’t close this post nor mark it as solved until this runs smoothly for a good while, but I’m putting all my hope it does :crossed_fingers: . Here are my kernel and drivers versions:

$ uname -r
5.11.14-1-MANJARO

$ sudo pacman -Q --info mesa lib32-mesa lib32-libva-mesa-driver lib32-mesa-vdpau libva-mesa-driver mesa-vdpau  
Name            : mesa
Version         : 21.0.1-1.0
Description     : An open-source implementation of the OpenGL specification
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : mesa-libgl  opengl-driver
Depends On      : libdrm  wayland  libxxf86vm  libxdamage  libxshmfence  libelf  libomxil-bellagio  libunwind  llvm-libs  lm_sensors  libglvnd  zstd  vulkan-icd-loader  libsensors.so=5-64  libexpat.so=1-64  libvulkan.so=1-64
Optional Deps   : opengl-man-pages: for the OpenGL API man pages
                  mesa-vdpau: for accelerated video playback [installed]
                  libva-mesa-driver: for accelerated video playback [installed]
Required By     : cogl  gst-plugins-base-libs  gtk3  lib32-mesa  libglvnd  mhwd  mpv  qt5-base  qt6-base  xf86-video-amdgpu  xf86-video-ati  xf86-video-intel  xf86-video-nouveau  zoom
Optional For    : None
Conflicts With  : mesa-libgl
Replaces        : mesa-libgl
Installed Size  : 90,73 MiB
Packager        : Philip Mueller <philm@manjaro.org>
Build Date      : lun 05 abr 2021 04:01:29
Install Date    : dom 18 abr 2021 18:12:11
Install Reason  : Installed as a dependency for another package
Install Script  : No
Validated By    : Signature

Name            : lib32-mesa
Version         : 21.0.2-1
Description     : An open-source implementation of the OpenGL specification (32-bit)
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : lib32-mesa-libgl  lib32-opengl-driver
Depends On      : lib32-libdrm  lib32-wayland  lib32-libxxf86vm  lib32-libxdamage  lib32-libxshmfence  lib32-libelf  lib32-libunwind  lib32-llvm-libs  lib32-lm_sensors  lib32-libglvnd  lib32-zstd  lib32-vulkan-icd-loader  mesa
                  libsensors.so=5-32
Optional Deps   : opengl-man-pages: for the OpenGL API man pages
                  lib32-mesa-vdpau: for accelerated video playback [installed]
                  lib32-libva-mesa-driver: for accelerated video playback [installed]
Required By     : lib32-gtk3  lib32-libglvnd
Optional For    : mhwd
Conflicts With  : lib32-mesa-libgl
Replaces        : lib32-mesa-libgl
Installed Size  : 69,02 MiB
Packager        : Laurent Carlier <lordheavym@gmail.com>
Build Date      : jue 08 abr 2021 12:51:02
Install Date    : dom 18 abr 2021 18:12:26
Install Reason  : Installed as a dependency for another package
Install Script  : No
Validated By    : Signature

Name            : lib32-libva-mesa-driver
Version         : 21.0.2-1
Description     : VA-API implementation for gallium (32-bit)
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : None
Depends On      : lib32-libdrm  lib32-libx11  lib32-llvm-libs  lib32-expat  lib32-libelf  lib32-libxshmfence  lib32-zstd
Optional Deps   : None
Required By     : None
Optional For    : lib32-mesa
Conflicts With  : None
Replaces        : None
Installed Size  : 9,58 MiB
Packager        : Laurent Carlier <lordheavym@gmail.com>
Build Date      : jue 08 abr 2021 12:51:02
Install Date    : dom 18 abr 2021 18:12:25
Install Reason  : Explicitly installed
Install Script  : No
Validated By    : Signature

Name            : lib32-mesa-vdpau
Version         : 21.0.2-1
Description     : Mesa VDPAU drivers (32-bit)
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : None
Depends On      : lib32-libdrm  lib32-libx11  lib32-llvm-libs  lib32-expat  lib32-libelf  lib32-libxshmfence  lib32-zstd
Optional Deps   : None
Required By     : None
Optional For    : lib32-mesa
Conflicts With  : None
Replaces        : None
Installed Size  : 9,93 MiB
Packager        : Laurent Carlier <lordheavym@gmail.com>
Build Date      : jue 08 abr 2021 12:51:02
Install Date    : dom 18 abr 2021 18:12:26
Install Reason  : Explicitly installed
Install Script  : No
Validated By    : Signature

Name            : libva-mesa-driver
Version         : 21.0.1-1.0
Description     : VA-API implementation for gallium
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : None
Depends On      : libdrm  libx11  llvm-libs  expat  libelf  libxshmfence  libexpat.so=1-64
Optional Deps   : None
Required By     : None
Optional For    : mesa
Conflicts With  : None
Replaces        : None
Installed Size  : 9,68 MiB
Packager        : Philip Mueller <philm@manjaro.org>
Build Date      : lun 05 abr 2021 04:01:29
Install Date    : dom 18 abr 2021 18:12:29
Install Reason  : Explicitly installed
Install Script  : No
Validated By    : Signature

Name            : mesa-vdpau
Version         : 21.0.1-1.0
Description     : Mesa VDPAU drivers
Architecture    : x86_64
URL             : https://www.mesa3d.org/
Licenses        : custom
Groups          : None
Provides        : None
Depends On      : libdrm  libx11  llvm-libs  expat  libelf  libxshmfence  libexpat.so=1-64
Optional Deps   : None
Required By     : None
Optional For    : mesa
Conflicts With  : None
Replaces        : None
Installed Size  : 10,01 MiB
Packager        : Philip Mueller <philm@manjaro.org>
Build Date      : lun 05 abr 2021 04:01:29
Install Date    : dom 18 abr 2021 18:12:32
Install Reason  : Explicitly installed
Install Script  : No
Validated By    : Signature
4 Likes

I’m on KDE Plasma 5.21.4 and can confirm the system crashes since I did a update a few weeks ago. dmesg has the same error message:

[ 1067.734921] amdgpu 0000:06:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32779, for process Web Content pid 11147 thread firefox:cs0 pid 11584)
[ 1067.734930] amdgpu 0000:06:00.0: amdgpu:   in page starting at address 0x0000800109401000 from client 27
[ 1067.734934] amdgpu 0000:06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00641051
[ 1067.734937] amdgpu 0000:06:00.0: amdgpu: 	 Faulty UTCL2 client ID: TCP (0x8)
[ 1067.734940] amdgpu 0000:06:00.0: amdgpu: 	 MORE_FAULTS: 0x1
[ 1067.734942] amdgpu 0000:06:00.0: amdgpu: 	 WALKER_ERROR: 0x0
[ 1067.734944] amdgpu 0000:06:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x5
[ 1067.734947] amdgpu 0000:06:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
[ 1067.734949] amdgpu 0000:06:00.0: amdgpu: 	 RW: 0x1

@fkfd is right, this commit should fix this error.

BUT this fix is NOT in the 5.11 release, if you check the tag v5.11 in the repo. It’s only fixed on tag v5.12-rc7 and upwards. So you need to be running at least the experimental Kernel 5.12.rc7 to have this fix running.

I’m running on Kernel 5.12.rc7 since 5 hours and did a bit of load testing with 8 video sources running at the same time and it seems the system crash doesn’t occur anymore, so this is fixed!

For those wondering how to switch kernel:

  1. Open the Manjaro Kernel manager GUI (windows key → search “Kernel”)
  2. Click install for the experimental 5.12.rc Kernel
  3. Reboot your PC and in Grub, choose “Advanced Options for Manjaro” and pick Manjaro 5.12

Once booted, run uname -a to verify that the new Kernel is running.

3 Likes

Although I´m using Gnome 40 as my desktop envionment, I though it might be good to add this datapoint:

I using an AMD APU (Ryzen 5 3400g) and experienced exactly the same amdgpu crashes after updating to kernel 5.10 beginning of April ´21.
After chasing some ghosts (potential HW issues like faulty RAM, problems with GPU powermgmt, etc.) I finally found this thread.

I can confirm, that switching to Kernel 5.12.rc7 solves the problem for me as well.

3 Likes

Just to confirm this; I am troubleshooting same problem on a laptop.

inxi -FGz | sed -n "1p; 5p; 6,7p"
System:    Kernel: 5.10.30-1-MANJARO x86_64 bits: 64 Console: tty pts/2 Distro: Manjaro Linux 
CPU:       Info: Dual Core model: AMD Athlon Silver 3050U with Radeon Graphics bits: 64 type: MCP cache: L2: 1024 KiB 
           Speed: 1396 MHz min/max: 1400/2300 MHz Core speeds (MHz): 1: 1396 2: 1398 
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Picasso driver: amdgpu v: kernel
1 Like

Good call out there bro! Yeah, after 5 days of believing my problems were totally solved with the 5.11.14 kernel, I experienced a light crash (KDE died and my session got logged out, but I could log back in easily), so I can’t state that was the fix.

I hope you’re right and the experimental kernel brings up our fix :pray: Theory is on our side, since that commit looks like fixing the errors we saw when experiencing it. I don’t wanna claim victory early, since this issue has been one tough boss to fight, but I’m putting all my hope on the solution. I’ll try this in a couple of days.

Please, keep us updated if you have any further experience on this! And thanks for your contribution :raised_hands:

1 Like

@elektropepi @B007C0DE So i installed the experimental 5.12-rc7 kernel as you suggested, but this time I wasn’t even able to log in to my system :confused: After inputting my credentials, the screen went totally black and system would not respond. Here’s what got logged at journalctl:

abr 24 16:04:07 e495 kwin_x11[1148]: Freeze in OpenGL initialization detected
abr 24 16:04:03 e495 kernel: amdgpu 0000:04:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x11d3a6cc0 flags=0x0070]
abr 24 16:04:03 e495 kernel: amdgpu 0000:04:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x11d3c0000 flags=0x0070]
abr 24 16:04:03 e495 kernel: amdgpu 0000:04:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x11d3a6ca0 flags=0x0070]
abr 24 16:04:03 e495 kernel: amdgpu 0000:04:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x11d3c0000 flags=0x0070]

Along with some of the typical page fault errors. Would you please tell me which version of mesa drivers are you using? Maybe there’s some incompatibility between the kernel and the driver itself. I’m using the latter’s latest version.

UPDATE: I tried booting into that kernel again, and this time I could do so. Idk what may have happened the first time, but I hope it was some really random and isolated error, and that 5.12 carries the real fix for this :pray:

1 Like

Well, I´m afraid that was a bit premature.

Kernel 5.12rc7 definitely reduces the crashes, but they are not completely gone. Just had a freeze with the amdgpu: [gfxhub0] error in the logs.

I just grepped though jourmalctl and I never hat any of these amdgpu errors prior to the April update.

Seems to be more than just the kernel :frowning:

2 Likes

Yeah, I would dare to state that each kernel update gives us better results (5.11.14 made a huge improvement for me, and hopefully 5.12 does the same), but still not there yet. I had never experienced them either! All of this began with the April 9th update for me.

Wish there was a way some kernel/GPU drivers dev sees this so we could get some authorized opinion… Let’s keep on updating future findings on this! We’ll get through it :muscle:

1 Like

Same here since april 9 update :

5.11.14 is better, since wednesdy no freeze/crash or black screen.

Update april 25 :
first black screen

15:57:42 kernel: AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x0000 address=0x111d13280 flags=0x0070]
15:57:42 kernel: amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0x111d40000 flags=0x0070]
15:57:42 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
15:57:32 kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v9_0> failed -110
15:57:32 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
15:57:31 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xwayland pid 1519 thread Xwayland:cs0 pid 1775
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f2000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f0000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f2000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f0000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f2000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f0000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f2000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f0000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f2000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MAPPING_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 PERMISSION_FAULTS: 0x7
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 WALKER_ERROR: 0x0
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 MORE_FAULTS: 0x1
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: 	 Faulty UTCL2 client ID: CB (0x0)
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x004C0071
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x8001030f0000 from client 27
15:57:31 kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32774, for process Xwayland pid 1519 thread Xwayland:cs0 pid 1775)
15:57:31 kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
15:57:26 kernel: amdgpu 0000:05:00.0: amdgpu: 	 RW: 0x1
1 Like

Ok that’s shit. Since I switched kernel, I’ve not had the freeze once. Did you choose the correct kernel on startup (maybe your grub defaults to a LTS kernel)?

I’ll keep you guys posted if that error occurs again.

1 Like

i’m on GNOME and was having the same issues. trying this and crossing my fingers :wink:

1 Like

Just updated kernel to 5.12.rc7. (Lastest available on Stable, for me)

Rebooted.

System is finally responding quickly as it used to before experiencing many of the above mentioned problems others have been having. My system would crash, freeze, run astonishingly sluggishly when it did run, for the past month or so.

Let’s hope this newest kernel is the answer :slight_smile:

1 Like

Not sure if this is related to my problem with freezing recently about 20 minutes after boot. man.db is hogging resources for the first 20 minutes - about 21% cpu time alongside rsynch at around 20%. It seems that any app I run after 20 minutes the entire system freezes. Initially I thought is was teams, but then noticed it happened to thunderbird, firefox and chrome and vlc. The main journal reports an EXT4 error on my main linux disk and insists that I run efsock -D (or something similar). Currently I am in windows to grab the latest ISO to burn onto a USB so I can check and try and fix that. Can’t recall any graphics issues in the logs though they may have been present. The last update is when I started experiencing freezes - not even able to drop in to tty. My setup also have AMD gpu for graphics. Will update this post later.

No more errors regarding the disk in my logs. Ran the e2fsck command and used disks to check the root drive from a live usb and all seems fine 2 hours in and no freezing issues so far.

1 Like

same problem with amd cpu & gpu, kernel 5.12 rc not help

1 Like

Ohhh, I’m sorry to hear that. I’d have thought that is some disk issue, but if your scan showed no faulty devices then that may not be it. Are you running on the experimental kernel?

1 Like

Same problem also using an AMD 3400G, since update a few days ago system is unstable and will page fault and crash, sometimes if i hit ctrl-alt-f1 it will reset to login screen but sometimes it’s just total hard freeze.
Journalctl shows long list of errors in red

 Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:4 pasid:32778, for process brave pid 25334 thread brave:cs0 pid 25361)
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:   in page starting at address 0x0000800000579000 from client 27
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00400C31
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          Faulty UTCL2 client ID: CPG (0x6)
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          MORE_FAULTS: 0x1
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          WALKER_ERROR: 0x0
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          MAPPING_ERROR: 0x0
Apr 27 21:27:04 desktop kernel: amdgpu 0000:09:00.0: amdgpu:          RW: 0x0

etc

I am not a power user or anything and i dont know how to downgrade stuff, i just update everything every couple of days.

2 Likes