Proton Games freezing entire system randomly

Hi,

Sometimes I have freezes, too, but they usually happen when I switch to the Desktop (ALT + TAB) with the game still running. During the games it happens very, very rarely.
Anyway, a way I use to get my system to work without a hard reset is by logging into another session (tty).
Did you try that already?
When you press CTRL+ALT+F2 you should see a login screen to the console. Login and use commands ps -ef | grep <name of the game> and kill -9 <PID of the game> to kill it.
Afterwards you can switch back to the desktop with CTRL+ALT+F1.

This works for me to unfreeze my KDE desktop session. Hope this works for you as well, so that you don’t loose any open sessions on your desktop.

Apart from a possible workaround I cannot provide any help unfortunately. I don’t know what causes the freezes and I also don’t know how to fix, sorry :frowning:

1 Like

tried and tested, does not work, at all, but I appreciate the effort, and for me it happens in the middle of games, even when using windowed fullscreen

Hello @CursedAuroran :wink:

Freezes can have many roots. Proton is a just a wine wrapper which translates windows api to a linux api. Wine is not perfect and does not have all features of windows, only the ones, which are necessary.

At this point it is necessary to investigate why it freezes… usually visible at the logs. So there is no straight answer to this.

My first guess is a heating problem…

where can I find these logs?

Systemlogs are here:

journalctl --since=-30min --no-pager

For proton, you can enable it with:

PROTON_LOG=1 PROTON_LOG_DIR=$HOME/logs/ %command%

Need more? Enable debug logs:

PROTON_LOG=1 PROTON_LOG_DIR=$HOME/logs/ PROTON_DUMP_DEBUG_COMMANDS=1 PROTON_DEBUG_DIR=$HOME/debug/ %command%

https://github.com/ValveSoftware/Proton/#runtime-config-options

Btw… i hope you run the games in proton/steam with:

prime-run %command%

I did not, as I did not know about that variable before, will try that out in a bit and then report back

Oh well… If you have a hybrid gpu and run games without prime-run on INTEL/NVIDIA-GPU, then the games run on the Intel gpu. prime-run offloads the graphical work to the nvidia card.

Try this to be sure that it works:

glxinfo | grep "OpenGL renderer"

and

prime-run glxinfo | grep "OpenGL renderer"

after some testing, the graphical load does appear to get offloaded to the nvidia card, but the crashing and freezing persists, will now give the other things a look and see if I can find anything

@megavolt these are the 2 seconds right before the crash I believe, do you see anything that could be the issue

Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46500, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46501, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46502, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46503, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46504, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46505, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46506, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46507, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46689, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46690, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46691, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46692, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46693, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46694, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46695, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46696, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46698, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46699, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46700, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46701, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46702, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46703, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46704, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46705, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46779, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46780, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46781, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46782, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46783, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46784, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46785, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46786, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46788, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46789, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46790, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46791, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46792, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46793, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46794, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46795, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46798, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46799, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46800, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46801, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46802, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46803, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46804, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46805, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46807, resource id: 44040303, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46808, resource id: 44040305, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46809, resource id: 44040307, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46810, resource id: 44040309, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46811, resource id: 44040311, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46812, resource id: 44040313, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46813, resource id: 44040299, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kwin_x11[1534]: qt.qpa.xcb: QXcbConnection: XCB error: 9 (BadDrawable), sequence: 46814, resource id: 44040301, major code: 14 (GetGeometry), minor code: 0
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:19 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout               
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: AER: Corrected error received: 0000:00:1d.6
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:   device [8086:06b6] error status/mask=00001000/00002000
Jun 18 18:17:20 CursedLaptop kernel: pcieport 0000:00:1d.6:    [12] Timeout 

@CursedAuroran

This could be indeed a problem. AER (Advanced Error Reporting) recived here an error at the communication at PCIE Bus, but it gets corrected: severity=Corrected. I would be worried if it is severity=Uncorrected, that would be a problem.

Either it is a real a error, or just communication problem between the kernel and the pcie root device. It is just a guess, but these errors occur when the bandwidth is lower than gpu can device connected to the pci slot can receive. So the nvidia driver tries use the full bandwidth, but actually reach the threshold and all data above the threshold are not rejected, but corrected by sending it again.

You can disable this reporting by adding:

pci=noaer

to the kernel parameter.

Maybe your laptop has a bad memory mapped config space at the UEFI. For the most, there are workarounds, but you can also add this to disable it:

pci=nommconf

or maybe the Message Signaled Interrupts have problems here… add:

pci=nomsi

However, non of the parameter can harm your hardware, but rather disable some functions which cause problem by faulty UEFI implementations. Add them step by step and watch the logs. And then you know where the problem comes.

@megavolt unfortunately those kernel parameters dont appear to do anything, with the issue persisting with both, I have tried setting up the proton logging you mentioned, but unfortunately, it doesnt appear that anything gets logged at all when I look for them after the crash

@CursedAuroran

Please upload then a full journal on an external service. I guess it is a deeper problem.

journalctl --since=yesterday | curl -F'file=@-' https://0x0.st

Or use another service and share the link here. Thanks.

https://0x0.st/-9S2.txt

I see nothing, what could cause this at the log and to be honest I had never such a problem ever on my laptops…

What maybe could also say more, are the Xorg logs:

for x in /var/log/Xorg*; do echo -e "---$x---\n\n $(cat $x)\n\n"; done 

Maybe someone with better knowledge about this problem will see it… but it seems to be a problem with the PCIE Interface.

Since you use also an NVME Drive, which is commonly connected via PCIE, I would would look at this aswell. Could be also a problem, where you need to re-seat the card (if possible).

However, it is really strange problem which I guess is more hardware related.

Have you tried different kernels?

mhwd-kernel -l

Have you checked the temperature? You know, it is summer…

nvidia-smi

Maybe a driver issue? Remove the nvidia driver and see if the issue persist at the logs with open source drivers only.

At the moment I can just suggest to tinker a bit, if you have time.

Maybe a problem of the ASPM?

pcie_aspm=off

pcie_aspm= [PCIE] Forcibly enable or disable PCIe Active State Power
Management.
off Disable ASPM.
force Enable ASPM even on devices that claim not to support it.
WARNING: Forcing ASPM on may cause system lockups.

https://www.kernel.org/doc/html/latest/admin-guide/kernel-parameters.html

I am tinkering, and I figured out that if I disable the ability of applications to block compositing it works just fine (or maybe I just got lucky and it didnt crash) so I am now also investigating different render backend options to see if they do something in that regard

right so, changing backend doesnt fix the issue, whereas disabling the applications their ability to change the composite does, but doing so results in major performance tanking and a near unplayability of native games

@megavolt after doing more searching I think I found the cause of the issue, or at least part of it. When proton games turn off the compositor in favour of performance, something happens that causes plasma/KDE to freak out and freeze up

@CursedAuroran Probably the cause of using opengl rendering instead of the software rendering? I remember there were issues when using KDE.

Yea its possible, going to take a look at potentially switching to a different DE and see if that works

WELP for some reason, now that I have closely investigated the issue with proton, its spreading to native games. Whoohoo. Time to get a second USB stick to back up important files and then switch to a different DE