Proton Games freezing entire system randomly

As the title says, my entire system freezes on seemingly random when playing games trough proton, the freeze will almost always happen, but how long I can play for before it freezes is uncertain and changing.

inxi output:

System:
  Kernel: 5.12.9-1-MANJARO x86_64 bits: 64 compiler: gcc v: 11.1.0 
  parameters: BOOT_IMAGE=/boot/vmlinuz-5.12-x86_64 
  root=UUID=b009476d-30ae-4640-bac2-177f50f3df93 rw apparmor=1 
  security=apparmor udev.log_priority=3 
  Console: tty pts/2 wm: kwin_x11 DM: SDDM Distro: Manjaro Linux 
  base: Arch Linux 
Machine:
  Type: Laptop System: Notebook product: NP50DE_DB v: N/A serial: N/A Chassis: 
  type: 10 serial: N/A 
  Mobo: Notebook model: NP50DE_DB serial: N/A UEFI: INSYDE v: 1.07.04 
  date: 02/17/2020 
Battery:
  ID-1: BAT0 charge: 48.7 Wh (100.0%) condition: 48.7/48.3 Wh (100.9%) 
  volts: 17.1 min: 15.2 model: Notebook BAT type: Li-ion serial: <filter> 
  status: Full 
Memory:
  RAM: total: 31.06 GiB used: 7.69 GiB (24.8%) 
  Array-1: capacity: 32 GiB slots: 2 EC: None max-module-size: 32 GiB 
  note: est. 
  Device-1: ChannelA-DIMM0 size: 32 GiB speed: 2667 MT/s type: DDR4 
  detail: synchronous bus-width: 64 bits total: 64 bits manufacturer: Corsair 
  part-no: CM4X32GE2666C18S2 serial: N/A 
  Device-2: ChannelB-DIMM0 size: No Module Installed 
CPU:
  Info: 6-Core model: Intel Core i7-10750H socket: U3E1 bits: 64 type: MT MCP 
  arch: Comet Lake family: 6 model-id: A5 (165) stepping: 2 microcode: E0 
  cache: L2: 12 MiB bogomips: 62431 
  Speed: 2570 MHz min/max: 2500/5000 MHz base/boost: 2475/8300 volts: 0.8 V 
  ext-clock: 100 MHz Core speeds (MHz): 1: 2570 2: 2508 3: 2662 4: 2609 
  5: 2614 6: 2516 7: 2565 8: 2630 9: 2541 10: 2532 11: 2578 12: 2575 
  Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat arch_capabilities 
  arch_perfmon art avx avx2 bmi1 bmi2 bts clflush clflushopt cmov constant_tsc 
  cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64 dtherm dts epb ept ept_ad erms 
  est f16c flexpriority flush_l1d fma fpu fsgsbase fxsr ht hwp hwp_act_window 
  hwp_epp hwp_notify ibpb ibrs ibrs_enhanced ida intel_pt invpcid 
  invpcid_single lahf_lm lm mca mce md_clear mmx monitor movbe mpx msr mtrr 
  nonstop_tsc nopl nx ospke pae pat pbe pcid pclmulqdq pdcm pdpe1gb pebs pge 
  pku pln pni popcnt pse pse36 pts rdrand rdseed rdtscp rep_good sdbg sep sgx 
  sgx_lc smap smep ss ssbd sse sse2 sse4_1 sse4_2 ssse3 stibp syscall tm tm2 
  tpr_shadow tsc tsc_adjust tsc_deadline_timer vme vmx vnmi vpid x2apic 
  xgetbv1 xsave xsavec xsaveopt xsaves xtopology xtpr 
  Vulnerabilities: Type: itlb_multihit status: KVM: VMX disabled 
  Type: l1tf status: Not affected 
  Type: mds status: Not affected 
  Type: meltdown status: Not affected 
  Type: spec_store_bypass 
  mitigation: Speculative Store Bypass disabled via prctl and seccomp 
  Type: spectre_v1 
  mitigation: usercopy/swapgs barriers and __user pointer sanitization 
  Type: spectre_v2 mitigation: Enhanced IBRS, IBPB: conditional, RSB filling 
  Type: srbds status: Not affected 
  Type: tsx_async_abort status: Not affected 
Graphics:
  Device-1: Intel UHD Graphics vendor: CLEVO/KAPOK driver: i915 v: kernel 
  bus-ID: 00:02.0 chip-ID: 8086:9bc4 class-ID: 0300 
  Device-2: NVIDIA TU117M vendor: CLEVO/KAPOK driver: nvidia v: 465.31 
  alternate: nouveau,nvidia_drm bus-ID: 01:00.0 chip-ID: 10de:1f99 
  class-ID: 0300 
  Device-3: Chicony Chicony USB2.0 Camera type: USB driver: uvcvideo 
  bus-ID: 1-8:3 chip-ID: 04f2:b685 class-ID: 0e02 
  Display: server: X.Org 1.20.11 compositor: kwin_x11 driver: 
  loaded: modesetting,nvidia alternate: fbdev,intel,nouveau,nv,vesa 
  display-ID: :0 screens: 1 
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.0x11.2") 
  s-diag: 582mm (22.9") 
  Monitor-1: eDP-1 res: 1920x1080 hz: 144 dpi: 142 size: 344x194mm (13.5x7.6") 
  diag: 395mm (15.5") 
  OpenGL: renderer: Mesa Intel UHD Graphics (CML GT2) v: 4.6 Mesa 21.1.2 
  direct render: Yes 
Audio:
  Device-1: Intel Comet Lake PCH cAVS vendor: CLEVO/KAPOK 
  driver: snd_hda_intel v: kernel alternate: snd_soc_skl,snd_sof_pci_intel_cnl 
  bus-ID: 00:1f.3 chip-ID: 8086:06c8 class-ID: 0403 
  Sound Server-1: ALSA v: k5.12.9-1-MANJARO running: yes 
  Sound Server-2: JACK v: 0.125.0 running: no 
  Sound Server-3: PulseAudio v: 14.2 running: yes 
  Sound Server-4: PipeWire v: 0.3.30 running: yes 
Network:
  Device-1: Intel Wi-Fi 6 AX200 driver: iwlwifi v: kernel port: 4000 
  bus-ID: 07:00.0 chip-ID: 8086:2723 class-ID: 0280 
  IF: wlp7s0 state: up mac: <filter> 
  IP v4: <filter> type: dynamic noprefixroute scope: global 
  broadcast: <filter> 
  IP v6: <filter> type: noprefixroute scope: link 
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet 
  vendor: CLEVO/KAPOK driver: r8168 v: 8.048.03-NAPI modules: r8169 port: 3000 
  bus-ID: 08:00.1 chip-ID: 10ec:8168 class-ID: 0200 
  IF: enp8s0f1 state: up speed: 1000 Mbps duplex: full mac: <filter> 
  IP v4: <filter> type: dynamic noprefixroute scope: global 
  broadcast: <filter> 
  IP v6: <filter> type: noprefixroute scope: link 
  WAN IP: <filter> 
Bluetooth:
  Device-1: Intel AX200 Bluetooth type: USB driver: btusb v: 0.8 
  bus-ID: 1-14:4 chip-ID: 8087:0029 class-ID: e001 
  Report: rfkill ID: hci0 rfk-id: 3 state: up address: see --recommends 
Logical:
  Message: No logical block device data found. 
RAID:
  Message: No RAID data found. 
Drives:
  Local Storage: total: 931.51 GiB used: 522.99 GiB (56.1%) 
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Western Digital 
  model: WDS100T2B0C-00PXH0 size: 931.51 GiB block-size: physical: 512 B 
  logical: 512 B speed: 31.6 Gb/s lanes: 4 rotation: SSD serial: <filter> 
  rev: 211070WD temp: 36 Celsius C scheme: GPT 
  SMART: yes health: PASSED on: 97d 15h cycles: 1,052 
  read-units: 10,311,069 [5.27 TB] written-units: 11,712,283 [5.99 TB] 
  Message: No optical or floppy data found. 
Partition:
  ID-1: / raw-size: 931.41 GiB size: 915.79 GiB (98.32%) 
  used: 522.95 GiB (57.1%) fs: ext4 block-size: 4096 B dev: /dev/nvme0n1p5 
  maj-min: 259:2 label: N/A uuid: b009476d-30ae-4640-bac2-177f50f3df93 
  ID-2: /boot/efi raw-size: 100 MiB size: 96 MiB (96.00%) 
  used: 44.4 MiB (46.2%) fs: vfat block-size: 512 B dev: /dev/nvme0n1p1 
  maj-min: 259:1 label: N/A uuid: CC53-A380 
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) 
  ID-1: swap-1 type: file size: 16 GiB used: 0 KiB (0.0%) priority: -2 
  file: /swapfile 
Unmounted:
  Message: No unmounted partitions found. 
USB:
  Hub-1: 1-0:1 info: Full speed (or root) Hub ports: 16 rev: 2.0 
  speed: 480 Mb/s chip-ID: 1d6b:0002 class-ID: 0900 
  Device-1: 1-2:2 info: [Maxxter] Optical Gaming Mouse [Xtrem] 
  type: Mouse,Keyboard driver: hid-generic,usbhid interfaces: 2 rev: 1.1 
  speed: 1.5 Mb/s power: 100mA chip-ID: 18f8:0f97 class-ID: 0301 
  Device-2: 1-8:3 info: Chicony Chicony USB2.0 Camera type: Video 
  driver: uvcvideo interfaces: 2 rev: 2.0 speed: 480 Mb/s power: 500mA 
  chip-ID: 04f2:b685 class-ID: 0e02 
  Device-3: 1-14:4 info: Intel AX200 Bluetooth type: Bluetooth driver: btusb 
  interfaces: 2 rev: 2.0 speed: 12 Mb/s power: 100mA chip-ID: 8087:0029 
  class-ID: e001 
  Hub-2: 2-0:1 info: Full speed (or root) Hub ports: 8 rev: 3.1 speed: 10 Gb/s 
  chip-ID: 1d6b:0003 class-ID: 0900 
Sensors:
  System Temperatures: cpu: 59.0 C mobo: N/A 
  Fan Speeds (RPM): N/A 
Info:
  Processes: 350 Uptime: 4h 47m wakeups: 7 Init: systemd v: 248 
  tool: systemctl Compilers: gcc: 11.1.0 clang: 12.0.0 Packages: pacman: 1473 
  lib: 425 flatpak: 0 snap: 0 Shell: Bash (sudo) v: 5.1.8 running-in: konsole 
  inxi: 3.3.04

and mhwd -li output:

Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
         network-r8168            2016.04.20                true            PCI
     video-modesetting            2020.01.13                true            PCI
video-hybrid-intel-nvidia-prime            2020.11.30               false            PCI


Warning: No installed USB configs!

Apart from doing a hard shutdown by holding down the power button, nothing works to resolve the issue or regain any sense of control over the laptop. It is literally frozen. This consistently happens, even with games reported to be Platinum by ProtonDB. I use the latest official Proton version released by valve and the latest GloriousEggroll version, if I experience issues with the former.
Any help would be much appreciated!

Hi,

Sometimes I have freezes, too, but they usually happen when I switch to the Desktop (ALT + TAB) with the game still running. During the games it happens very, very rarely.
Anyway, a way I use to get my system to work without a hard reset is by logging into another session (tty).
Did you try that already?
When you press CTRL+ALT+F2 you should see a login screen to the console. Login and use commands ps -ef | grep <name of the game> and kill -9 <PID of the game> to kill it.
Afterwards you can switch back to the desktop with CTRL+ALT+F1.

This works for me to unfreeze my KDE desktop session. Hope this works for you as well, so that you don’t loose any open sessions on your desktop.

Apart from a possible workaround I cannot provide any help unfortunately. I don’t know what causes the freezes and I also don’t know how to fix, sorry :frowning:

1 Like

tried and tested, does not work, at all, but I appreciate the effort, and for me it happens in the middle of games, even when using windowed fullscreen

Hello @CursedAuroran :wink:

Freezes can have many roots. Proton is a just a wine wrapper which translates windows api to a linux api. Wine is not perfect and does not have all features of windows, only the ones, which are necessary.

At this point it is necessary to investigate why it freezes… usually visible at the logs. So there is no straight answer to this.

My first guess is a heating problem…

where can I find these logs?

Systemlogs are here:

journalctl --since=-30min --no-pager

For proton, you can enable it with:

PROTON_LOG=1 PROTON_LOG_DIR=$HOME/logs/ %command%

Need more? Enable debug logs:

PROTON_LOG=1 PROTON_LOG_DIR=$HOME/logs/ PROTON_DUMP_DEBUG_COMMANDS=1 PROTON_DEBUG_DIR=$HOME/debug/ %command%

https://github.com/ValveSoftware/Proton/#runtime-config-options

Btw… i hope you run the games in proton/steam with:

prime-run %command%

I did not, as I did not know about that variable before, will try that out in a bit and then report back

Oh well… If you have a hybrid gpu and run games without prime-run on INTEL/NVIDIA-GPU, then the games run on the Intel gpu. prime-run offloads the graphical work to the nvidia card.

Try this to be sure that it works:

glxinfo | grep "OpenGL renderer"

and

prime-run glxinfo | grep "OpenGL renderer"

after some testing, the graphical load does appear to get offloaded to the nvidia card, but the crashing and freezing persists, will now give the other things a look and see if I can find anything

@megavolt these are the 2 seconds right before the crash I believe, do you see anything that could be the issue

Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46500, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46501, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46502, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46503, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46504, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46505, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46506, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46507, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46689, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46690, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46691, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46692, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46693, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46694, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46695, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46696, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46698, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46699, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46700, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46701, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46702, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46703, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46704, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46705, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46779, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46780, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46781, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46782, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46783, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46784, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46785, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46786, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46788, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46789, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46790, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46791, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46792, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46793, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46794, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46795, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46798, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46799, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46800, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46801, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46802, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46803, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46804, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46805, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46807, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46808, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46809, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46810, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46811, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46812, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46813, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46814, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout 

@CursedAuroran

This could be indeed a problem. AER (Advanced Error Reporting) recived here an error at the communication at PCIE Bus, but it gets corrected: severity=Corrected. I would be worried if it is severity=Uncorrected, that would be a problem.

Either it is a real a error, or just communication problem between the kernel and the pcie root device. It is just a guess, but these errors occur when the bandwidth is lower than gpu can device connected to the pci slot can receive. So the nvidia driver tries use the full bandwidth, but actually reach the threshold and all data above the threshold are not rejected, but corrected by sending it again.

You can disable this reporting by adding:

pci=noaer

to the kernel parameter.

Maybe your laptop has a bad memory mapped config space at the UEFI. For the most, there are workarounds, but you can also add this to disable it:

pci=nommconf

or maybe the Message Signaled Interrupts have problems here… add:

pci=nomsi

However, non of the parameter can harm your hardware, but rather disable some functions which cause problem by faulty UEFI implementations. Add them step by step and watch the logs. And then you know where the problem comes.

@megavolt unfortunately those kernel parameters dont appear to do anything, with the issue persisting with both, I have tried setting up the proton logging you mentioned, but unfortunately, it doesnt appear that anything gets logged at all when I look for them after the crash

@CursedAuroran

Please upload then a full journal on an external service. I guess it is a deeper problem.

journalctl --since=yesterday | curl -F'file=@-' https://0x0.st

Or use another service and share the link here. Thanks.

https://0x0.st/-9S2.txt

I see nothing, what could cause this at the log and to be honest I had never such a problem ever on my laptops…

What maybe could also say more, are the Xorg logs:

for x in /var/log/Xorg*; do echo -e "---$x---\n\n $(cat $x)\n\n"; done 

Maybe someone with better knowledge about this problem will see it… but it seems to be a problem with the PCIE Interface.

Since you use also an NVME Drive, which is commonly connected via PCIE, I would would look at this aswell. Could be also a problem, where you need to re-seat the card (if possible).

However, it is really strange problem which I guess is more hardware related.

Have you tried different kernels?

mhwd-kernel -l

Have you checked the temperature? You know, it is summer…

nvidia-smi

Maybe a driver issue? Remove the nvidia driver and see if the issue persist at the logs with open source drivers only.

At the moment I can just suggest to tinker a bit, if you have time.

Maybe a problem of the ASPM?

pcie_aspm=off

pcie_aspm= [PCIE] Forcibly enable or disable PCIe Active State Power
Management.
off Disable ASPM.
force Enable ASPM even on devices that claim not to support it.
WARNING: Forcing ASPM on may cause system lockups.

https://www.kernel.org/doc/html/latest/admin-guide/kernel-parameters.html

I am tinkering, and I figured out that if I disable the ability of applications to block compositing it works just fine (or maybe I just got lucky and it didnt crash) so I am now also investigating different render backend options to see if they do something in that regard

right so, changing backend doesnt fix the issue, whereas disabling the applications their ability to change the composite does, but doing so results in major performance tanking and a near unplayability of native games

@megavolt after doing more searching I think I found the cause of the issue, or at least part of it. When proton games turn off the compositor in favour of performance, something happens that causes plasma/KDE to freak out and freeze up

@CursedAuroran Probably the cause of using opengl rendering instead of the software rendering? I remember there were issues when using KDE.

Yea its possible, going to take a look at potentially switching to a different DE and see if that works

WELP for some reason, now that I have closely investigated the issue with proton, its spreading to native games. Whoohoo. Time to get a second USB stick to back up important files and then switch to a different DE