Full system freeze on new install after some time

Hello,

I am using Manjaro KDE and I get full system freeze a bit randomly.
It happens mainly in games (but also a few times on the desktop) and completely freezes the system: Image is frozen, sound loops, can’t switch to TTY and even the caps lock led is not reacting.

I am running manjaro on a brand new home built PC (my first so hardware issues are possible). As the crashes seemed to happen more often when I overclocked the RAM using XMP, I decided to swap it for another model. But even on the new model without any overclocking, the crashes still happen.

Since I changed the RAM I can only reproduce the crash on the game Cyberpunk 2077.

After one of those crashes, I ran journalctl --boot=-1 --catalog --no-pager and found the following log:

août 17 23:36:46 BigBender kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000
août 17 23:36:46 BigBender kernel: #PF: supervisor instruction fetch in kernel mode
août 17 23:36:46 BigBender kernel: #PF: error_code(0x0010) - not-present page
août 17 23:36:46 BigBender kernel: PGD 0 P4D 0
août 17 23:36:46 BigBender kernel: Oops: 0010 [#1] PREEMPT SMP NOPTI
août 17 23:36:46 BigBender kernel: CPU: 15 PID: 978 Comm: kwin_wayland Not tainted 6.6.44-1-MANJARO #1 8598aea1d868f10d66f5d5ae2b57b59e735cd775
août 17 23:36:46 BigBender kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C95/B550M PRO-VDH WIFI (MS-7C95), BIOS 2.L0 07/18/2024
août 17 23:36:46 BigBender kernel: RIP: 0010:0x0
août 17 23:36:46 BigBender kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
août 17 23:36:46 BigBender kernel: RSP: 0018:ffffc9000393bda0 EFLAGS: 00010246
août 17 23:36:46 BigBender kernel: RAX: 0000000000000000 RBX: 000000000000000e RCX: 0000000000000000
août 17 23:36:46 BigBender kernel: RDX: ffffffffffffffff RSI: 00000000000000e8 RDI: 0000000000000000
août 17 23:36:46 BigBender kernel: RBP: ffffc9000393bdd8 R08: 0000000000000000 R09: 0000000000000000
août 17 23:36:46 BigBender kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00007fff94cfcf00
août 17 23:36:46 BigBender kernel: R13: 0000000000000020 R14: 0000000000000000 R15: 0000000000000000
août 17 23:36:46 BigBender kernel: FS:  00007f6019ac4a00(0000) GS:ffff8887febc0000(0000) knlGS:0000000000000000
août 17 23:36:46 BigBender kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
août 17 23:36:46 BigBender kernel: CR2: ffffffffffffffd6 CR3: 000000010a846000 CR4: 0000000000f50ee0
août 17 23:36:46 BigBender kernel: PKRU: 55555554
août 17 23:36:46 BigBender kernel: Call Trace:
août 17 23:36:46 BigBender kernel:  <TASK>
août 17 23:36:46 BigBender kernel:  ? __die+0x23/0x70
août 17 23:36:46 BigBender kernel:  ? page_fault_oops+0x174/0x530
août 17 23:36:46 BigBender kernel:  ? exc_page_fault+0x7f/0x180
août 17 23:36:46 BigBender kernel:  ? asm_exc_page_fault+0x26/0x30
août 17 23:36:46 BigBender kernel:  ? do_syscall_64+0x5a/0x80
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? filp_flush+0x52/0x80
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 17 23:36:46 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 17 23:36:46 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 17 23:36:46 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 17 23:36:46 BigBender kernel:  ? __irq_exit_rcu+0x4b/0xc0
août 17 23:36:46 BigBender kernel:  ? entry_SYSCALL_64_after_hwframe+0x78/0xe2
août 17 23:36:46 BigBender kernel:  </TASK>
août 17 23:36:46 BigBender kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq qrtr cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common edac_mce_amd kvm_amd vfat fat mt7921e kvm mt7921_common mt792x_lib mt76_connac_lib irqbypass crct10dif_pclmul mt76 crc32_pclmul btusb polyval_clmulni btrtl polyval_generic snd_hda_codec_realtek gf128mul btintel ghash_clmulni_intel btbcm snd_hda_codec_generic sha512_ssse3 btmtk ledtrig_audio snd_hda_codec_hdmi snd_usb_audio mac80211 sha256_ssse3 snd_hda_intel sha1_ssse3 snd_usbmidi_lib snd_intel_dspcfg bluetooth aesni_intel snd_ump snd_intel_sdw_acpi snd_rawmidi crypto_simd snd_hda_codec libarc4 snd_seq_device cryptd snd_hda_core ecdh_generic mc r8169 rapl snd_hwdep cfg80211 snd_pcm wmi_bmof sp5100_tco realtek mdio_devres snd_timer pcspkr k10temp i2c_piix4 acpi_cpufreq ccp rfkill libphy snd soundcore mousedev joydev gpio_amdpt gpio_generic mac_hid i2c_dev crypto_user dm_mod fuse loop nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2
août 17 23:36:46 BigBender kernel:  hid_logitech_hidpp hid_logitech_dj usbhid amdgpu i2c_algo_bit drm_ttm_helper ttm video drm_exec drm_suballoc_helper amdxcp drm_buddy gpu_sched drm_display_helper nvme crc32c_intel nvme_core cec xhci_pci xhci_pci_renesas nvme_common wmi
août 17 23:36:46 BigBender kernel: CR2: 0000000000000000
août 17 23:36:46 BigBender kernel: ---[ end trace 0000000000000000 ]---
août 17 23:36:46 BigBender kernel: RIP: 0010:0x0
août 17 23:36:46 BigBender kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
août 17 23:36:46 BigBender kernel: RSP: 0018:ffffc9000393bda0 EFLAGS: 00010246
août 17 23:36:46 BigBender kernel: RAX: 0000000000000000 RBX: 000000000000000e RCX: 0000000000000000
août 17 23:36:46 BigBender kernel: RDX: ffffffffffffffff RSI: 00000000000000e8 RDI: 0000000000000000
août 17 23:36:46 BigBender kernel: RBP: ffffc9000393bdd8 R08: 0000000000000000 R09: 0000000000000000
août 17 23:36:46 BigBender kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00007fff94cfcf00
août 17 23:36:46 BigBender kernel: R13: 0000000000000020 R14: 0000000000000000 R15: 0000000000000000
août 17 23:36:46 BigBender kernel: FS:  00007f6019ac4a00(0000) GS:ffff8887febc0000(0000) knlGS:0000000000000000
août 17 23:36:46 BigBender kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
août 17 23:36:46 BigBender kernel: CR2: ffffffffffffffd6 CR3: 000000010a846000 CR4: 0000000000f50ee0
août 17 23:36:46 BigBender kernel: PKRU: 55555554
août 17 23:36:46 BigBender kernel: note: kwin_wayland[978] exited with irqs disabled
août 17 23:36:51 BigBender kwin_wayland[978]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
août 17 23:36:56 BigBender kwin_wayland[978]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
août 17 23:37:01 BigBender kwin_wayland[978]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
août 17 23:37:06 BigBender kwin_wayland[978]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug

If I understand correctly the log (Code: Unable to access opcode bytes at 0xffffffffffffffd6.), it seems to be an issue with the CPU right? Is there a defect on the CPU or is this something I can hope to fix using the right software patch?

Does anyone have tips to debug such issues?

I tried kernel 6.10 and 6.6 LTS, both have the issue. The bios is on the latest version released this month. The system is freshly installed and does not have much installed apart from steam and a few games.

Here is the output of inxi -zv8:

System:
  Kernel: 6.6.44-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 14.1.1
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.6-x86_64
    root=UUID=98510aa7-65f3-452f-8e58-9d25ac4328dc rw quiet splash
    udev.log_priority=3
  Desktop: KDE Plasma v: 6.0.5 tk: Qt v: N/A info: frameworks v: 6.4.0
    wm: kwin_wayland vt: 1 dm: SDDM Distro: Manjaro base: Arch Linux
Machine:
  Type: Desktop System: Micro-Star product: MS-7C95 v: 1.0
    serial: <superuser required>
  Mobo: Micro-Star model: B550M PRO-VDH WIFI (MS-7C95) v: 1.0
    serial: <superuser required> UEFI: American Megatrends LLC. v: 2.L0
    date: 07/18/2024
Battery:
  ID-1: hidpp_battery_0 charge: 52% condition: N/A volts: 3.8 min: N/A
    model: Logitech G502 LIGHTSPEED Wireless Gaming Mouse type: N/A
    serial: <filter> status: discharging
Memory:
  System RAM: total: 32 GiB available: 31.27 GiB used: 3.52 GiB (11.3%)
  Message: For most reliable report, use superuser + dmidecode.
  Array-1: capacity: 128 GiB slots: 4 modules: 2 EC: None
    max-module-size: 32 GiB note: est.
  Device-1: Channel-A DIMM 0 type: no module installed
  Device-2: Channel-A DIMM 1 type: DDR4 detail: synchronous unbuffered
    (unregistered) size: 16 GiB speed: 2133 MT/s volts: note: check curr: 1
    min: 1 max: 1 width (bits): data: 64 total: 64 manufacturer: Corsair
    part-no: CMG32GX4M2E3200C16 serial: N/A
  Device-3: Channel-B DIMM 0 type: no module installed
  Device-4: Channel-B DIMM 1 type: DDR4 detail: synchronous unbuffered
    (unregistered) size: 16 GiB speed: 2133 MT/s volts: note: check curr: 1
    min: 1 max: 1 width (bits): data: 64 total: 64 manufacturer: Corsair
    part-no: CMG32GX4M2E3200C16 serial: N/A
PCI Slots:
  Permissions: Unable to run dmidecode. Root privileges required.
CPU:
  Info: model: AMD Ryzen 7 5800X bits: 64 type: MT MCP arch: Zen 3+ gen: 4
    level: v3 note: check built: 2022 process: TSMC n6 (7nm) family: 0x19 (25)
    model-id: 0x21 (33) stepping: 2 microcode: 0xA20120E
  Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
    L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB L2: 4 MiB desc: 8x512 KiB
    L3: 32 MiB desc: 1x32 MiB
  Speed (MHz): avg: 2749 high: 3800 min/max: 2200/4850 boost: enabled
    scaling: driver: acpi-cpufreq governor: schedutil cores: 1: 3598 2: 3593
    3: 2200 4: 2200 5: 2200 6: 2200 7: 3597 8: 2200 9: 2200 10: 3800 11: 2200
    12: 2200 13: 2200 14: 2200 15: 3800 16: 3601 bogomips: 121635
  Flags: 3dnowprefetch abm adx aes aperfmperf apic arat avic avx avx2 bmi1
    bmi2 bpext cat_l3 cdp_l3 clflush clflushopt clwb clzero cmov cmp_legacy
    constant_tsc cpb cpuid cqm cqm_llc cqm_mbm_local cqm_mbm_total
    cqm_occup_llc cr8_legacy cx16 cx8 de debug_swap decodeassists erms
    extapic extd_apicid f16c flushbyasid fma fpu fsgsbase fsrm fxsr fxsr_opt
    ht hw_pstate ibpb ibrs ibs invpcid irperf lahf_lm lbrv lm mba mca mce
    misalignsse mmx mmxext monitor movbe msr mtrr mwaitx nonstop_tsc nopl npt
    nrip_save nx ospke osvw overflow_recov pae pat pausefilter pclmulqdq
    pdpe1gb perfctr_core perfctr_llc perfctr_nb pfthreshold pge pku pni
    popcnt pse pse36 rapl rdpid rdpru rdrand rdseed rdt_a rdtscp rep_good sep
    sha_ni skinit smap smca smep ssbd sse sse2 sse4_1 sse4_2 sse4a ssse3
    stibp succor svm svm_lock syscall tce topoext tsc tsc_scale umip
    user_shstk v_spec_ctrl v_vmsave_vmload vaes vgif vmcb_clean vme vmmcall
    vpclmulqdq wbnoinvd wdt x2apic xgetbv1 xsave xsavec xsaveerptr xsaveopt
    xsaves
  Vulnerabilities:
  Type: gather_data_sampling status: Not affected
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: reg_file_data_sampling status: Not affected
  Type: retbleed status: Not affected
  Type: spec_rstack_overflow mitigation: Safe RET
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines; IBPB: conditional; IBRS_FW;
    STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not
    affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
    vendor: ASRock driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x
    process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s
    lanes: 16 ports: active: DP-3 empty: DP-1,DP-2,HDMI-A-1 bus-ID: 2d:00.0
    chip-ID: 1002:73df class-ID: 0300
  Display: wayland server: X.org v: 1.21.1.13 with: Xwayland v: 24.1.1
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: modesetting,radeon alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu display-ID: 0
  Monitor-1: DP-3 res: 1920x1080 size: N/A modes: N/A
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: radeonsi
    wayland: drv: radeonsi x11: drv: radeonsi
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.1.5-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 6700 XT (radeonsi
    navi22 LLVM 18.1.8 DRM 3.54 6.6.44-1-MANJARO) device-ID: 1002:73df
    memory: 11.72 GiB unified: no display-ID: :1.0
  API: Vulkan v: 1.3.279 layers: 7 device: 0 type: discrete-gpu name: AMD
    Radeon RX 6700 XT (RADV NAVI22) driver: mesa radv v: 24.1.5-manjaro1.1
    device-ID: 1002:73df surfaces: xcb,xlib,wayland
Audio:
  Device-1: AMD Navi 21/23 HDMI/DP Audio driver: snd_hda_intel v: kernel pcie:
    gen: 4 speed: 16 GT/s lanes: 16 bus-ID: 2d:00.1 chip-ID: 1002:ab28
    class-ID: 0403
  Device-2: AMD Starship/Matisse HD Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 2f:00.4 chip-ID: 1022:1487 class-ID: 0403
  Device-3: Audio-Technica ATR2100x-USB Microphone
    driver: hid-generic,snd-usb-audio,usbhid type: USB rev: 2.0 speed: 480 Mb/s
    lanes: 1 mode: 2.0 bus-ID: 1-3:2 chip-ID: 0909:004d class-ID: 0300
  API: ALSA v: k6.6.44-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 1.2.2 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: MEDIATEK MT7921K Wi-Fi 6E 80MHz driver: mt7921e v: kernel pcie:
    gen: 2 speed: 5 GT/s lanes: 1 bus-ID: 29:00.0 chip-ID: 14c3:0608
    class-ID: 0280
  IF: wlo1 state: down mac: <filter>
  Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    vendor: Micro-Star MSI driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s
    lanes: 1 port: f000 bus-ID: 2a:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp42s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: dynamic noprefixroute scope: global
  IP v6: <filter> type: noprefixroute scope: link
  Info: services: NetworkManager, systemd-timesyncd, wpa_supplicant
  WAN IP: <filter>
Bluetooth:
  Device-1: MediaTek Wireless_Device driver: btusb v: 0.8 type: USB rev: 2.1
    speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-9:5 chip-ID: 0e8d:0608
    class-ID: e001 serial: <filter>
  Report: rfkill ID: hci0 rfk-id: 0 state: up address: see --recommends
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 1.82 TiB used: 849.73 GiB (45.6%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Kingston model: SNV2S2000G
    size: 1.82 TiB block-size: physical: 512 B logical: 512 B speed: 63.2 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: SBM02103 temp: 38.9 C
    scheme: GPT
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 195.31 GiB size: 191.19 GiB (97.89%)
    used: 23.01 GiB (12.0%) fs: ext4 dev: /dev/nvme0n1p1 maj-min: 259:1
    label: N/A uuid: 98510aa7-65f3-452f-8e58-9d25ac4328dc
  ID-2: /boot/efi raw-size: 500 MiB size: 499 MiB (99.80%)
    used: 296 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p3 maj-min: 259:3 label: N/A
    uuid: 0E22-64FA
  ID-3: /home raw-size: 1.63 TiB size: 1.6 TiB (98.37%)
    used: 826.73 GiB (50.4%) fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
    label: N/A uuid: 3bb573ad-50f2-4fe8-9ccb-c1b01c27423b
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: file size: 6 GiB used: 0 KiB (0.0%) priority: -2
    file: /swapfile
Unmounted:
  Message: No unmounted partitions found.
USB:
  Hub-1: 1-0:1 info: hi-speed hub with single TT ports: 10 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 1-3:2 info: Audio-Technica ATR2100x-USB Microphone
    type: audio,HID driver: hid-generic,snd-usb-audio,usbhid interfaces: 4
    rev: 2.0 speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 100mA
    chip-ID: 0909:004d class-ID: 0300
  Hub-2: 1-7:3 info: Genesys Logic Hub ports: 4 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 100mA
    chip-ID: 05e3:0608 class-ID: 0900
  Device-1: 1-8:4 info: Micro Star MYSTIC LIGHT type: HID
    driver: hid-generic,usbhid interfaces: 1 rev: 1.1 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 500mA chip-ID: 1462:7c95 class-ID: 0300
    serial: <filter>
  Device-2: 1-9:5 info: MediaTek Wireless_Device type: bluetooth
    driver: btusb interfaces: 3 rev: 2.1 speed: 480 Mb/s (57.2 MiB/s) lanes: 1
    mode: 2.0 power: 100mA chip-ID: 0e8d:0608 class-ID: e001 serial: <filter>
  Hub-3: 2-0:1 info: super-speed hub ports: 4 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-4: 3-0:1 info: hi-speed hub with single TT ports: 4 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 3-3:2 info: Logitech Lightspeed Receiver
    type: keyboard,mouse,HID driver: logitech-djreceiver,usbhid interfaces: 3
    rev: 2.0 speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1 power: 98mA
    chip-ID: 046d:c539 class-ID: 0300
  Device-2: 3-4:3 info: Keychron Link type: mouse,HID,keyboard
    driver: hid-generic,usbhid interfaces: 3 rev: 1.1 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 100mA chip-ID: 3434:d031 class-ID: 0301
  Hub-5: 4-0:1 info: super-speed hub ports: 4 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 47.8 C mobo: 36.0 C gpu: amdgpu temp: 45.0 C
    mem: 40.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 0
Repos:
  Packages: pm: pacman pkgs: 1335 libs: 370 tools: pamac pm: flatpak pkgs: 0
  Active pacman repo servers in: /etc/pacman.d/mirrorlist
    1: https://mirrors.manjaro.org/repo/stable/$repo/$arch
    2: https://mirror.netcologne.de/manjaro/stable/$repo/$arch
    3: https://ipng.mm.fcix.net/manjaro/stable/$repo/$arch
    4: https://ftp.gwdg.de/pub/linux/manjaro/stable/$repo/$arch
    5: https://repo.ialab.dsu.edu/manjaro/stable/$repo/$arch
    6: https://edgeuno-bog2.mm.fcix.net/manjaro/stable/$repo/$arch
    7: https://mirror.funami.tech/manjaro/stable/$repo/$arch
    8: https://mirror.ufam.edu.br/manjaro/stable/$repo/$arch
Processes:
  CPU top: 5 of 345
  1: cpu: 12.9% command: firefox pid: 3826 mem: 294.7 MiB (0.9%)
  2: cpu: 11.6% command: konsole pid: 4002 mem: 199.8 MiB (0.6%)
  3: cpu: 10.5% command: firefox pid: 3554 mem: 283.7 MiB (0.8%)
  4: cpu: 9.0% command: firefox pid: 2096 mem: 862.8 MiB (2.6%)
  5: cpu: 7.0% command: zsh pid: 4044 mem: 8.29 MiB (0.0%)
  Memory top: 5 of 345
  1: mem: 862.8 MiB (2.6%) command: firefox pid: 2096 cpu: 9.0%
  2: mem: 390.8 MiB (1.2%) command: plasmashell pid: 1194 cpu: 0.2%
  3: mem: 325.4 MiB (1.0%) command: kwin_wayland pid: 973 cpu: 1.9%
  4: mem: 294.7 MiB (0.9%) command: firefox pid: 3826 cpu: 12.9%
  5: mem: 283.7 MiB (0.8%) command: firefox pid: 3554 cpu: 10.5%
Info:
  Processes: 345 Power: uptime: 20m states: freeze,mem,disk suspend: deep
    avail: s2idle wakeups: 0 hibernate: platform avail: shutdown, reboot,
    suspend, test_resume image: 12.49 GiB services: org_kde_powerdevil,
    power-profiles-daemon, upowerd Init: systemd v: 256 default: graphical
    tool: systemctl
  Compilers: clang: 18.1.8 gcc: 14.1.1 Shell: Zsh v: 5.9 default: Bash
    v: 5.2.26 running-in: konsole inxi: 3.3.35

EDIT: Forgot to mention temps are OK, the maximum I saw were 70° for the CPU and 50° for the GPU

Have you tried re-seating the RAM modules?

You mentioned RAM overclocking, which might have damaged it.

I’d suggest running a full memory test (can be done from the GRUB menu IIRC). Let it run overnight.

I just changed the ram modules for brand new ones yesterday because that was my first suspicion, but I am still getting the same crashes. I have not used XMP with these new modules.

I really doubt I got 2 brand new defective modules in a row so the issue must be somewhere else. Plus when I went to the store the guy tested my previous ram and said there was nothing wrong with it.

Did you used Manjaro KDE before you did a new install?

As first work around, i recommend to activate Reisub.

Do you know how to enable Reisub?

As long you don’t have blinking Capslock, im confident that you still have a high chance, that you can restore your system.

If you don’t:

1 Like

Manjaro KDE is the first OS I installed after building the PC, but I’ve used Manjaro KDE on several other computers before.

Thanks for the links, I’ll try to setup REISUB! Would this help me in troubleshooting the issue or is this simply to avoid filesystem corruption?

I also just got a crash while playing Overwatch and replying to this post. But this time the journalctl did not say anything.

By the way, I lost count of how many times I had to force reboot by long pressing the power button (more than 20 that’s for sure). Should I run fsck or should I expect my filesystem to be more damaged than that and simply do a full reinstall?

You were right the computer was still alive enough to allow for a REISUB. I just got a freeze playing Cyberpunk 2077, did a REISUB and was able to retrieve the journalctl after the reboot, here is the output:

août 18 12:33:54 BigBender kernel: ------------[ cut here ]------------
août 18 12:33:54 BigBender kernel: list_add corruption. next->prev should be prev (ffff888103447ea0), but was ffff888103447e98. (next=ffff888103447e98).
août 18 12:33:54 BigBender kernel: WARNING: CPU: 9 PID: 904 at lib/list_debug.c:29 __list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel: Modules linked in: snd_seq_dummy rfcomm snd_hrtimer snd_seq snd_seq_device qrtr cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common vfat fat edac_mce_amd kvm_amd kvm mt7921e mt7921_common snd_hda_codec_realtek mt792x_lib irqbypass crct10dif_pclmul snd_hda_codec_generic mt76_connac_lib crc32_pclmul ledtrig_audio btusb polyval_clmulni mt76 snd_hda_codec_hdmi polyval_generic btrtl gf128mul btintel ghash_clmulni_intel snd_hda_intel sha512_ssse3 btbcm sha256_ssse3 btmtk snd_intel_dspcfg sha1_ssse3 mac80211 snd_intel_sdw_acpi aesni_intel snd_hda_codec bluetooth crypto_simd snd_hda_core libarc4 cryptd snd_hwdep ecdh_generic rapl r8169 snd_pcm wmi_bmof cfg80211 realtek pcspkr sp5100_tco acpi_cpufreq snd_timer mdio_devres snd libphy rfkill k10temp soundcore i2c_piix4 ccp joydev mousedev gpio_amdpt gpio_generic mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_logitech_hidpp hid_logitech_dj usbhid amdgpu
août 18 12:33:54 BigBender kernel:  i2c_algo_bit drm_ttm_helper ttm video drm_exec drm_suballoc_helper amdxcp drm_buddy gpu_sched nvme drm_display_helper crc32c_intel nvme_core cec xhci_pci nvme_common xhci_pci_renesas wmi
août 18 12:33:54 BigBender kernel: CPU: 9 PID: 904 Comm: kwin_wayland Not tainted 6.6.44-1-MANJARO #1 8598aea1d868f10d66f5d5ae2b57b59e735cd775
août 18 12:33:54 BigBender kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C95/B550M PRO-VDH WIFI (MS-7C95), BIOS 2.L0 07/18/2024
août 18 12:33:54 BigBender kernel: RIP: 0010:__list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel: Code: a4 ff 0f 0b 31 c0 e9 20 48 a9 00 48 c7 c7 38 fc 88 8a e8 c9 a0 a4 ff 0f 0b eb e9 48 89 c1 48 c7 c7 60 fc 88 8a e8 b6 a0 a4 ff <0f> 0b eb d6 48 89 d1 48 89 c6 4c 89 c2 48 c7 c7 b0 fc 88 8a e8 9d
août 18 12:33:54 BigBender kernel: RSP: 0018:ffffc90001ebf918 EFLAGS: 00010086
août 18 12:33:54 BigBender kernel: RAX: 0000000000000000 RBX: ffff8881a6383310 RCX: 0000000000000027
août 18 12:33:54 BigBender kernel: RDX: ffff8887fec616c8 RSI: 0000000000000001 RDI: ffff8887fec616c0
août 18 12:33:54 BigBender kernel: RBP: ffff888103447e80 R08: 0000000000000000 R09: ffffc90001ebf788
août 18 12:33:54 BigBender kernel: R10: ffff88881f2f6428 R11: 0000000000000003 R12: ffff888103447e98
août 18 12:33:54 BigBender kernel: R13: 0000000000000092 R14: ffff8881a6383330 R15: ffff888103447ea0
août 18 12:33:54 BigBender kernel: FS:  00007fe81d84ba00(0000) GS:ffff8887fec40000(0000) knlGS:0000000000000000
août 18 12:33:54 BigBender kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
août 18 12:33:54 BigBender kernel: CR2: 00007ff6d573f000 CR3: 0000000101202000 CR4: 0000000000f50ee0
août 18 12:33:54 BigBender kernel: PKRU: 55555554
août 18 12:33:54 BigBender kernel: Call Trace:
août 18 12:33:54 BigBender kernel:  <TASK>
août 18 12:33:54 BigBender kernel:  ? __list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel:  ? __warn+0x81/0x140
août 18 12:33:54 BigBender kernel:  ? __list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel:  ? report_bug+0x16f/0x1a0
août 18 12:33:54 BigBender kernel:  ? handle_bug+0x3c/0x80
août 18 12:33:54 BigBender kernel:  ? exc_invalid_op+0x17/0x70
août 18 12:33:54 BigBender kernel:  ? asm_exc_invalid_op+0x1a/0x20
août 18 12:33:54 BigBender kernel:  ? __list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel:  ? __list_add_valid_or_report+0x6a/0xa0
août 18 12:33:54 BigBender kernel:  damon_split_region_at+0x74/0xa0
août 18 12:33:54 BigBender kernel:  kdamond_fn+0x1093/0x12b0
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_sys_poll+0x4c9/0x5e0
août 18 12:33:54 BigBender kernel:  ? __wake_up_common+0x7a/0x180
août 18 12:33:54 BigBender kernel:  ? __wake_up_common_lock+0x80/0xd0
août 18 12:33:54 BigBender kernel:  ? unix_write_space+0x5b/0xa0
août 18 12:33:54 BigBender kernel:  ? sock_wfree+0x9d/0x1d0
août 18 12:33:54 BigBender kernel:  ? unix_destruct_scm+0x86/0xd0
août 18 12:33:54 BigBender kernel:  ? skb_release_head_state+0x27/0x90
août 18 12:33:54 BigBender kernel:  ? consume_skb+0x30/0xd0
août 18 12:33:54 BigBender kernel:  ? unix_stream_read_generic+0xaf3/0xc50
août 18 12:33:54 BigBender kernel:  ? ep_done_scan+0xe0/0x130
août 18 12:33:54 BigBender kernel:  ? unix_stream_recvmsg+0x8c/0xa0
août 18 12:33:54 BigBender kernel:  ? __pfx_unix_stream_read_actor+0x10/0x10
août 18 12:33:54 BigBender kernel:  ? sock_recvmsg+0xc0/0xd0
août 18 12:33:54 BigBender kernel:  ? ____sys_recvmsg+0x97/0x1f0
août 18 12:33:54 BigBender kernel:  ? ___sys_recvmsg+0xbb/0xe0
août 18 12:33:54 BigBender kernel:  ? __sys_recvmsg+0xca/0x100
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x5a/0x80
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? __x64_sys_ioctl+0xaf/0xd0
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? entry_SYSCALL_64_after_hwframe+0x78/0xe2
août 18 12:33:54 BigBender kernel:  </TASK>
août 18 12:33:54 BigBender kernel: ---[ end trace 0000000000000000 ]---
août 18 12:33:54 BigBender kernel: BUG: kernel NULL pointer dereference, address: 000000000000009a
août 18 12:33:54 BigBender kernel: #PF: supervisor write access in kernel mode
août 18 12:33:54 BigBender kernel: #PF: error_code(0x0002) - not-present page
août 18 12:33:54 BigBender kernel: PGD 0 P4D 0 
août 18 12:33:54 BigBender kernel: Oops: 0002 [#1] PREEMPT SMP NOPTI
août 18 12:33:54 BigBender kernel: CPU: 9 PID: 904 Comm: kwin_wayland Tainted: G        W          6.6.44-1-MANJARO #1 8598aea1d868f10d66f5d5ae2b57b59e735cd775
août 18 12:33:54 BigBender kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C95/B550M PRO-VDH WIFI (MS-7C95), BIOS 2.L0 07/18/2024
août 18 12:33:54 BigBender kernel: RIP: 0010:damon_split_region_at+0x89/0xa0
août 18 12:33:54 BigBender kernel: Code: 30 4c 89 65 08 4c 8b 65 20 48 89 43 30 4c 89 e2 e8 8c b5 26 00 84 c0 74 11 4d 89 74 24 08 4c 89 63 20 4c 89 7b 28 4c 89 75 20 <41> 83 45 08 01 5b 5d 41 5c 41 5d 41 5e 41 5f e9 ce fd cf 00 0f 1f
août 18 12:33:54 BigBender kernel: RSP: 0018:ffffc90001ebf920 EFLAGS: 00010046
août 18 12:33:54 BigBender kernel: RAX: 0000000000000000 RBX: ffff8881a6383310 RCX: 0000000000000027
août 18 12:33:54 BigBender kernel: RDX: ffff8887fec616c8 RSI: 0000000000000001 RDI: ffff8887fec616c0
août 18 12:33:54 BigBender kernel: RBP: ffff888103447e80 R08: 0000000000000000 R09: ffffc90001ebf788
août 18 12:33:54 BigBender kernel: R10: ffff88881f2f6428 R11: 0000000000000003 R12: ffff888103447e98
août 18 12:33:54 BigBender kernel: R13: 0000000000000092 R14: ffff8881a6383330 R15: ffff888103447ea0
août 18 12:33:54 BigBender kernel: FS:  00007fe81d84ba00(0000) GS:ffff8887fec40000(0000) knlGS:0000000000000000
août 18 12:33:54 BigBender kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
août 18 12:33:54 BigBender kernel: CR2: 000000000000009a CR3: 0000000101202000 CR4: 0000000000f50ee0
août 18 12:33:54 BigBender kernel: PKRU: 55555554
août 18 12:33:54 BigBender kernel: Call Trace:
août 18 12:33:54 BigBender kernel:  <TASK>
août 18 12:33:54 BigBender kernel:  ? __die+0x23/0x70
août 18 12:33:54 BigBender kernel:  ? page_fault_oops+0x174/0x530
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? exc_page_fault+0x7f/0x180
août 18 12:33:54 BigBender kernel:  ? asm_exc_page_fault+0x26/0x30
août 18 12:33:54 BigBender kernel:  ? damon_split_region_at+0x89/0xa0
août 18 12:33:54 BigBender kernel:  kdamond_fn+0x1093/0x12b0
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_sys_poll+0x4c9/0x5e0
août 18 12:33:54 BigBender kernel:  ? __wake_up_common+0x7a/0x180
août 18 12:33:54 BigBender kernel:  ? __wake_up_common_lock+0x80/0xd0
août 18 12:33:54 BigBender kernel:  ? unix_write_space+0x5b/0xa0
août 18 12:33:54 BigBender kernel:  ? sock_wfree+0x9d/0x1d0
août 18 12:33:54 BigBender kernel:  ? unix_destruct_scm+0x86/0xd0
août 18 12:33:54 BigBender kernel:  ? skb_release_head_state+0x27/0x90
août 18 12:33:54 BigBender kernel:  ? consume_skb+0x30/0xd0
août 18 12:33:54 BigBender kernel:  ? unix_stream_read_generic+0xaf3/0xc50
août 18 12:33:54 BigBender kernel:  ? ep_done_scan+0xe0/0x130
août 18 12:33:54 BigBender kernel:  ? unix_stream_recvmsg+0x8c/0xa0
août 18 12:33:54 BigBender kernel:  ? __pfx_unix_stream_read_actor+0x10/0x10
août 18 12:33:54 BigBender kernel:  ? sock_recvmsg+0xc0/0xd0
août 18 12:33:54 BigBender kernel:  ? ____sys_recvmsg+0x97/0x1f0
août 18 12:33:54 BigBender kernel:  ? ___sys_recvmsg+0xbb/0xe0
août 18 12:33:54 BigBender kernel:  ? __sys_recvmsg+0xca/0x100
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x5a/0x80
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? __x64_sys_ioctl+0xaf/0xd0
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? syscall_exit_to_user_mode+0x22/0x40
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
août 18 12:33:54 BigBender kernel:  ? do_syscall_64+0x66/0x80
août 18 12:33:54 BigBender kernel:  ? entry_SYSCALL_64_after_hwframe+0x78/0xe2
août 18 12:33:54 BigBender kernel:  </TASK>
août 18 12:33:54 BigBender kernel: Modules linked in: snd_seq_dummy rfcomm snd_hrtimer snd_seq snd_seq_device qrtr cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common vfat fat edac_mce_amd kvm_amd kvm mt7921e mt7921_common snd_hda_codec_realtek mt792x_lib irqbypass crct10dif_pclmul snd_hda_codec_generic mt76_connac_lib crc32_pclmul ledtrig_audio btusb polyval_clmulni mt76 snd_hda_codec_hdmi polyval_generic btrtl gf128mul btintel ghash_clmulni_intel snd_hda_intel sha512_ssse3 btbcm sha256_ssse3 btmtk snd_intel_dspcfg sha1_ssse3 mac80211 snd_intel_sdw_acpi aesni_intel snd_hda_codec bluetooth crypto_simd snd_hda_core libarc4 cryptd snd_hwdep ecdh_generic rapl r8169 snd_pcm wmi_bmof cfg80211 realtek pcspkr sp5100_tco acpi_cpufreq snd_timer mdio_devres snd libphy rfkill k10temp soundcore i2c_piix4 ccp joydev mousedev gpio_amdpt gpio_generic mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_logitech_hidpp hid_logitech_dj usbhid amdgpu
août 18 12:33:54 BigBender kernel:  i2c_algo_bit drm_ttm_helper ttm video drm_exec drm_suballoc_helper amdxcp drm_buddy gpu_sched nvme drm_display_helper crc32c_intel nvme_core cec xhci_pci nvme_common xhci_pci_renesas wmi
août 18 12:33:54 BigBender kernel: CR2: 000000000000009a
août 18 12:33:54 BigBender kernel: ---[ end trace 0000000000000000 ]---
août 18 12:33:54 BigBender kernel: RIP: 0010:damon_split_region_at+0x89/0xa0
août 18 12:33:54 BigBender kernel: Code: 30 4c 89 65 08 4c 8b 65 20 48 89 43 30 4c 89 e2 e8 8c b5 26 00 84 c0 74 11 4d 89 74 24 08 4c 89 63 20 4c 89 7b 28 4c 89 75 20 <41> 83 45 08 01 5b 5d 41 5c 41 5d 41 5e 41 5f e9 ce fd cf 00 0f 1f
août 18 12:33:54 BigBender kernel: RSP: 0018:ffffc90001ebf920 EFLAGS: 00010046
août 18 12:33:54 BigBender kernel: RAX: 0000000000000000 RBX: ffff8881a6383310 RCX: 0000000000000027
août 18 12:33:54 BigBender kernel: RDX: ffff8887fec616c8 RSI: 0000000000000001 RDI: ffff8887fec616c0
août 18 12:33:54 BigBender kernel: RBP: ffff888103447e80 R08: 0000000000000000 R09: ffffc90001ebf788
août 18 12:33:54 BigBender kernel: R10: ffff88881f2f6428 R11: 0000000000000003 R12: ffff888103447e98
août 18 12:33:54 BigBender kernel: R13: 0000000000000092 R14: ffff8881a6383330 R15: ffff888103447ea0
août 18 12:33:54 BigBender kernel: FS:  00007fe81d84ba00(0000) GS:ffff8887fec40000(0000) knlGS:0000000000000000
août 18 12:33:54 BigBender kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
août 18 12:33:54 BigBender kernel: CR2: 000000000000009a CR3: 0000000101202000 CR4: 0000000000f50ee0
août 18 12:33:54 BigBender kernel: PKRU: 55555554
août 18 12:33:54 BigBender kernel: note: kwin_wayland[904] exited with irqs disabled
août 18 12:33:54 BigBender kernel: note: kwin_wayland[904] exited with preempt_count 1
août 18 12:33:56 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1500 suppressed)
août 18 12:33:56 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2042468106261 a:2040465320132 f:2040465322592 waiting:18446744071706765487 process:2460 status:triggered
août 18 12:33:58 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:33:58 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2044470755326 a:2040465320132 f:2040465322592 waiting:18446744069704116422 process:2460 status:triggered
août 18 12:33:59 BigBender kwin_wayland[904]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
août 18 12:34:00 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:34:00 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2046473405628 a:2040465320132 f:2040465322592 waiting:18446744067701466120 process:2460 status:triggered
août 18 12:34:02 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:34:02 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2048476057868 a:2040465320132 f:2040465322592 waiting:18446744065698813880 process:2460 status:triggered
août 18 12:34:04 BigBender kwin_wayland[904]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
août 18 12:34:04 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:34:04 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2050478709233 a:2040465320132 f:2040465322592 waiting:18446744063696162515 process:2460 status:triggered
août 18 12:34:06 BigBender kernel: sysrq: Keyboard mode set to system default
août 18 12:34:06 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:34:06 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2052481359774 a:2040465320132 f:2040465322592 waiting:18446744061693511974 process:2460 status:triggered
août 18 12:34:08 BigBender pipewire[962]: pw.node: (alsa_output.pci-0000_2f_00.4.analog-stereo-48) graph xrun (1501 suppressed)
août 18 12:34:08 BigBender pipewire[962]: pw.node: (Cyberpunk 2077-72) xrun state:0x7fd66dbde008 pending:0/1 s:2054484009501 a:2040465320132 f:2040465322592 waiting:18446744059690862247 process:2460 status:triggered
août 18 12:34:08 BigBender systemd-journald[426]: Journal stopped
░░ Subject: The journal has been stopped
░░ Defined-By: systemd
░░ Support: https://forum.manjaro.org/c/support
░░ 
░░ The system journal process has shut down and closed all currently
░░ active journal files.
2 Likes
https://docs.kernel.org/admin-guide/tainted-kernels.html

You find the tainted state near the top in a line starting with ‘CPU:’; if or why the kernel was tainted is shown after the Process ID (‘PID:’) and a shortened name of the command (‘Comm:’) that triggered the event

Judging from the log it seems that kwin_wayland is responsible.

 $ mbn info kwin -q
Branch         : archlinux
Name           : kwin
Version        : 6.1.4-1
Repository     : extra
Build Date     : Wed 07 Aug 2024 09:44:20 
Packager       : Tomaz Canabrava <tcanabrava@archlinux.org>

Branch         : unstable
Name           : kwin
Version        : 6.1.4-1
Repository     : extra
Build Date     : Wed 07 Aug 2024 09:44:20 
Packager       : Tomaz Canabrava <tcanabrava@archlinux.org>
Branch         : testing

Name           : kwin
Version        : 6.0.5-3
Repository     : extra
Build Date     : Mon 01 Jul 2024 06:02:50 
Packager       : Philip Mueller <philm@manjaro.org>
Branch         : stable

Name           : kwin
Version        : 6.0.5-3
Repository     : extra
Build Date     : Mon 01 Jul 2024 06:02:50 
Packager       : Philip Mueller <philm@manjaro.org>

Since you are on stable branch - using Plasma 6.0.5 - I suggest you make a timeshift backup.

As you are not using Nvidia (judging from your inxi info above) you should have no issues with Plasma 6.1.4.

Then switch branch to unstable (providing Plasma 6.1.4) and run a full system sync.

2024-08-17T22:00:00Z

  • Linux 6.6 is at 6.6.46
  • Linux 6.10 is at 6.10.5
  • Linux 6.11 is at 6.11.0-rc3
1 Like

Thanks for the info!

I do seem to remember the crashes happening on X11 as well, I’ll try to run it for a while and post the log here if it freezes.

Any updates on when plasma 6.1.4 is supposed to land on stable? If it’s only a few days I can wait instead of switching to unstable.

I don’t know - don’t expect it soon - as there - apparently- is unresolved issues with Nvidia.

As a former nvidia laptop user, thank you for not pushing an update that breaks the GPU :slight_smile:

I played around for a bit with X11 and I have not been able to reproduce the full system freeze for now. Cyberpunk crashed a few times (that’s a bit expected from this game sadly) but only the game, not the full computer. I’ll come back to this thread if I manage to reproduce the freeze on X11 or when I have updated to plasma 6.1.

Thanks a lot for your help!

Welp, a freeze just happened while playing Overwatch on X11.

Sadly this time REISUB did not work and I was forced to reboot using the power button. I also cannot find anything in journalctl.

I’ll try to update to plasma 6.1 then.

You can try this out, but i wouldn’t expect to much from it.

6.0.5 should be run stable.

I personally would first create a Timeshift snapshot then switch Kernels 6.1 or the newest 6.10 and when you said you never was running Manjaro with your system. Its still possible that your CPU is unstable… I would downclock that Prozessor with Process Governor (not sure if this works with AMD CPU, i only tried it with my Intel System).

You could also try to let your CPU Mhz unchanged and add just a little vcore to the CPU in your Bios settings.

Since you not using even XMP (i think for AMD it is called expo) your Ram, should be fine with the weak default clocks… but you still could do memory testing, maybe one Ram is faulty or the slot has dust on the slot… who knows.

A Filesystem check can’t hurt… if you had a Timeshift snapshot before all this crashes occured, you could saved a good amount of time.

But if 20 freezes already enough for data loss? Nobody can tell… i personally would only reinstall after your solved your unstable system problem.

A good way to see if your system is stable is also Video Encoding, after i switched from Windows to Manjaro, my CPU was unstable in Linux but was stable under Windows… i had to remove 300Mhz OC to get it stable again under Linux with Shotcut.

Switched to unstable, installed plasma 6.1 and indeed after a few hours I still got a full system freeze.

I am on kernel 6.6 LTS but I was using 6.10 before when I first experienced issues.

For the RAM, I’ve already tried 2 models with XMP off and both have issues. Could try cleaning up the slots though, it won’t hurt.

For the CPU, I did not overclock it and it is still on the stock 3.8GHz clock speed (can go up to 4.6GHz). Could damaged pins create such issues or would the computer not even boot? It’s my first build so it’s possible I messed up inserting it.

I’ll try to run stress to check the CPU. I’ll try to play with the vcore stuff but I’m not really familiar with that (I’m used to laptops CPU you can’t configure).

You could also try 1 single ram at a time.
Im pretty sure that you don’t have damaged pins on the CPU. With that you wouldn’t even install Manjaro at all.

I recommend to reduce Mhz on your CPU first, but i have no idea how Overvolting or Downclocking on a AMD system is working.

Its probably worth it to watch a little tutorial how to do that on your Mainboard. Always keep in mind that you can damage your hardware if you doing something wrong there… so be carefull of what you doing, when you doing important settings.

AMD’s Agesa stuff doing a lot background adjustments and i have no experience about AMD CPU’s (only Intel yet)… because my AMD Laptop denied me all adjustments, which is possible on PC.

Some updates on the troubleshooting. I did not try messing with the CPU but here is what I did:

Installed the Bazzite distro, still had system freezes, 1 while downloading a game the others playing cyberpunk.
Removed one RAM slot, changed nothing.
Tried playing by switching to another TTY and starting the game with gamescope, but still had freezes.

So the issue is not limited to Manjaro (as it happens on Bazzite), and not limited to KDE (as it happens on dedicated gamescope).

I may just go to a computer store and ask to check for hardware defects, I am completely lost on what is causing this.

Same.

Might be a good idea. Just be careful of what the Winblows fanboyz try shoving onto you!

1 Like

You could do the same with the other Ram Module and choose another RAM slot.

You can do that or try to solve it for yourself, its up to you :slight_smile:

The motherboard I use specifies which slot to use depending on the number of modules I have installed. So if I only have 1 I can’t really move it.

I’d love to be able to solve it myself but I don’t really know much about hardware troubleshooting, and I don’t have the time to dive too deep into the subject.

Good news everyone!

I was too lazy to move the computer to the computer store so I did some more digging and found the culprit.

I am new to AMD hardware and turns out the CPU is overclocked out of the box. This processor has the “Core Performance Boost” feature which allows it to reach the boost clock of 4.8GHz compared to the base 3.8GHz. After disabling this feature from the BIOS, I did not have a single freeze in a few days.

The issues were mostly present when playing Cyberpunk 2077 because this game hates overclocks (even on Windows). Without the overclock, the game never crashed.

Maybe there is an issue with the default settings of the motherboard and the voltages are not right for the CPU, don’t know but I guess the next step is to tweak the settings to have the overclock more stable.

Thanks everyone for your help!

EDIT: For anyone having the same issue, I found a better fix which allows me to use “Core Performance Boost” to get the most out of the CPU. In the BIOS, I increased the DRAM voltage from 1.2V to 1.25V, and also increased the VCore voltage by 0.050V. After increasing these voltages and re-enabling CPB, I was able to use the computer with the CPU reaching 4.8GHz without any crashes.

2 Likes