System freezes, possibly Qt or GPU-related

Hello,

My system has been freezing randomly lately. journalctl --boot=-1 --priority=3 returns errors that happened quite some time before the crash, nothing at the time / right before.

First I thought that this was related to clementine player because my system had crashed several times shortly after I opened the applications. But I uninstalled clementine and the crash happened again some time after when I opened okular. Both applications are Qt-based so maybe this is a lead ?

Another information is that I started using a new screen recently, which I use through the integrated graphics + HDMI port of my motherboard.

Whenever this happens, it looks like I cannot do anything but hard reset… REISUB or trying to get to a tty do not work.

Here is the output of inxi -v7azy :

System:
  Kernel: 6.1.25-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 12.2.1
    parameters: BOOT_IMAGE=/vmlinuz-6.1-x86_64
    root=UUID=f4d201c0-b0bb-4680-8b1c-968a0f6f5473 rw quiet apparmor=1
    security=apparmor sysrq_always_enabled=1
    resume=UUID=b15c10e7-131f-4aa9-9529-6e72b17ae442 udev.log_priority=3
  Desktop: i3 v: 4.22 info: i3bar vt: 7 dm: LightDM v: 1.32.0
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop Mobo: Gigabyte model: H87-HD3 v: x.x
    serial: <superuser required> UEFI: American Megatrends v: F10
    date: 08/18/2015
Battery:
  Message: No system battery data found. Is one present?
Memory:
  RAM: total: 7.64 GiB used: 5.85 GiB (76.5%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges required.
CPU:
  Info: model: Intel Core i5-4570 bits: 64 type: MCP arch: Haswell gen: core 4
    level: v3 note: check built: 2013-15 process: Intel 22nm family: 6
    model-id: 0x3C (60) stepping: 3 microcode: 0x28
  Topology: cpus: 1x cores: 4 smt: <unsupported> cache: L1: 256 KiB
    desc: d-4x32 KiB; i-4x32 KiB L2: 1024 KiB desc: 4x256 KiB L3: 6 MiB
    desc: 1x6 MiB
  Speed (MHz): avg: 939 high: 1362 min/max: 800/3600 scaling:
    driver: intel_cpufreq governor: schedutil cores: 1: 798 2: 798 3: 1362 4: 798
    bogomips: 25548
  Flags: abm acpi aes aperfmperf apic arat arch_perfmon avx avx2 bmi1 bmi2
    bts clflush cmov constant_tsc cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64
    dtherm dts erms est f16c flush_l1d fma fpu fsgsbase fxsr ht ibpb ibrs ida
    invpcid invpcid_single lahf_lm lm mca mce md_clear mmx monitor movbe msr
    mtrr nonstop_tsc nopl nx pae pat pbe pcid pclmulqdq pdcm pdpe1gb pebs pge
    pln pni popcnt pse pse36 pti pts rdrand rdtscp rep_good sdbg sep smep smx
    ss ssbd sse sse2 sse4_1 sse4_2 ssse3 stibp syscall tm tm2 tsc tsc_adjust
    tsc_deadline_timer vme x2apic xsave xsaveopt xtopology xtpr
  Vulnerabilities:
  Type: itlb_multihit status: KVM: VMX unsupported
  Type: l1tf mitigation: PTE Inversion
  Type: mds mitigation: Clear CPU buffers; SMT disabled
  Type: meltdown mitigation: PTI
  Type: mmio_stale_data status: Unknown: No mitigations
  Type: retbleed status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, IBRS_FW,
    STIBP: disabled, RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Intel Xeon E3-1200 v3/4th Gen Core Processor Integrated Graphics
    vendor: Gigabyte driver: i915 v: kernel arch: Gen-7.5 process: Intel 22nm
    built: 2013 ports: active: HDMI-A-2 empty: HDMI-A-1,VGA-1 bus-ID: 00:02.0
    chip-ID: 8086:0412 class-ID: 0300
  Device-2: Microsoft LifeCam VX-5000 type: USB
    driver: snd-usb-audio,uvcvideo bus-ID: 3-4:3 chip-ID: 045e:0728
    class-ID: 0102
  Display: x11 server: X.Org v: 21.1.8 compositor: Picom v: git-b700a driver:
    X: loaded: modesetting alternate: fbdev,vesa dri: crocus gpu: i915
    display-ID: :0 screens: 1
  Screen-1: 0 s-res: 2160x3840 s-dpi: 96 s-size: 571x1015mm (22.48x39.96")
    s-diag: 1165mm (45.85")
  Monitor-1: HDMI-A-2 mapped: HDMI-2 model: Dell S2721QS serial: <filter>
    built: 2022 res: 2160x3840 hz: 30 dpi: 163 gamma: 1.2
    size: 336x597mm (13.23x23.5") diag: 685mm (27") ratio: 16:9 modes:
    max: 3840x2160 min: 720x400
  API: OpenGL Message: Unable to show GL data. Required tool glxinfo missing.
Audio:
  Device-1: Intel Xeon E3-1200 v3/4th Gen Core Processor HD Audio
    driver: snd_hda_intel v: kernel bus-ID: 3-4:3 chip-ID: 045e:0728
    class-ID: 0102
  Device-2: Intel 8 Series/C220 Series High Definition Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel bus-ID: 00:1b.0 chip-ID: 8086:8c20
    class-ID: 0403
  Device-3: Microsoft LifeCam VX-5000 type: USB
    driver: snd-usb-audio,uvcvideo
  Device-4: Focusrite-Novation Scarlett 2i4 type: USB driver: snd-usb-audio
    bus-ID: 3-5:4 chip-ID: 1235:800a class-ID: fe01
  API: ALSA v: k6.1.25-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: cadence,jack_control
  Server-2: PipeWire v: 0.3.70 status: off tools: pw-cli
  Server-3: PulseAudio v: 16.1 status: active with: pulseaudio-jack
    type: module tools: pacat,pactl,pavucontrol
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: Gigabyte driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s
    lanes: 1 port: e000 bus-ID: 02:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp2s0 state: down mac: <filter>
  Device-2: TP-Link Archer T4UH v2 [Realtek RTL8812AU] type: USB
    driver: rtl88XXau bus-ID: 3-9:6 chip-ID: 2357:010e class-ID: 0000
    serial: <filter>
  IF: wlp0s20u9 state: up mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  WAN IP: <filter>
Bluetooth:
  Message: No bluetooth data found.
Logical:
  Message: No logical block device data found.
RAID:
  Hardware-1: Intel SATA Controller [RAID mode] driver: ahci v: 3.0 port: f060
    bus-ID: 00:1f.2 chip-ID: 8086:2822 rev: N/A class-ID: 0104
Drives:
  Local Storage: total: 1.14 TiB used: 299.09 GiB (25.7%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung model: SSD 980 250GB
    size: 232.89 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 type: SSD serial: <filter> rev: 2B4QFXO7 temp: 23.9 C scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 vendor: Seagate model: ST1000DM003-1SB10C
    size: 931.51 GiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
    type: HDD rpm: 7200 serial: <filter> rev: CC41 scheme: GPT
  Optical-1: /dev/sr0 vendor: TSSTcorp model: CDDVDW SH-S203B rev: SB00
    dev-links: cdrom
  Features: speed: 48 multisession: yes audio: yes dvd: yes
    rw: cd-r,cd-rw,dvd-r,dvd-ram state: running
Partition:
  ID-1: / raw-size: 232.88 GiB size: 228.17 GiB (97.98%)
    used: 35.75 GiB (15.7%) fs: ext4 dev: /dev/nvme0n1p1 maj-min: 259:1
    label: root uuid: f4d201c0-b0bb-4680-8b1c-968a0f6f5473
  ID-2: /boot raw-size: 1024 MiB size: 973.4 MiB (95.06%)
    used: 77.7 MiB (8.0%) fs: ext4 dev: /dev/sda2 maj-min: 8:2 label: boot
    uuid: 448567bd-b6fa-40b0-a2a2-56c27ac9254b
  ID-3: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 312 KiB (0.1%) fs: vfat dev: /dev/sda1 maj-min: 8:1 label: EFS
    uuid: 20A7-A589
  ID-4: /home raw-size: 921.68 GiB size: 906.14 GiB (98.31%)
    used: 263.26 GiB (29.1%) fs: ext4 dev: /dev/sda3 maj-min: 8:3 label: home
    uuid: 2050cf81-01da-430f-a9a5-bc98f20ae434
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default)
  ID-1: swap-1 type: partition size: 8.54 GiB used: 524 KiB (0.0%)
    priority: -2 dev: /dev/sda4 maj-min: 8:4 label: N/A
    uuid: b15c10e7-131f-4aa9-9529-6e72b17ae442
Unmounted:
  Message: No unmounted partitions found.
USB:
  Hub-1: 1-0:1 info: Full speed or root hub ports: 2 rev: 2.0 speed: 480 Mb/s
    chip-ID: 1d6b:0002 class-ID: 0900
  Hub-2: 1-1:2 info: Intel Integrated Rate Matching Hub ports: 6 rev: 2.0
    speed: 480 Mb/s chip-ID: 8087:8008 class-ID: 0900
  Hub-3: 2-0:1 info: Full speed or root hub ports: 2 rev: 2.0 speed: 480 Mb/s
    chip-ID: 1d6b:0002 class-ID: 0900
  Hub-4: 2-1:2 info: Intel Integrated Rate Matching Hub ports: 8 rev: 2.0
    speed: 480 Mb/s chip-ID: 8087:8000 class-ID: 0900
  Hub-5: 3-0:1 info: Hi-speed hub with single TT ports: 14 rev: 2.0
    speed: 480 Mb/s chip-ID: 1d6b:0002 class-ID: 0900
  Device-1: 3-3:2 info: Holtek Keyboard [Diatec Majestouch 2 Tenkeyless]
    type: Keyboard,Mouse driver: hid-generic,usbhid interfaces: 2 rev: 1.1
    speed: 1.5 Mb/s power: 100mA chip-ID: 04d9:4545 class-ID: 0301
  Device-2: 3-4:3 info: Microsoft LifeCam VX-5000 type: Video,Audio
    driver: snd-usb-audio,uvcvideo interfaces: 4 rev: 2.0 speed: 480 Mb/s
    power: 500mA chip-ID: 045e:0728 class-ID: 0102
  Device-3: 3-5:4 info: Focusrite-Novation Scarlett 2i4 type: Audio
    driver: snd-usb-audio interfaces: 6 rev: 2.0 speed: 480 Mb/s power: 500mA
    chip-ID: 1235:800a class-ID: fe01
  Device-4: 3-6:5 info: Microsoft IntelliMouse Optical type: Mouse
    driver: hid-generic,usbhid interfaces: 1 rev: 1.1 speed: 1.5 Mb/s
    power: 100mA chip-ID: 045e:0039 class-ID: 0301
  Device-5: 3-9:6 info: TP-Link Archer T4UH v2 [Realtek RTL8812AU]
    type: Network driver: rtl88XXau interfaces: 1 rev: 2.0 speed: 480 Mb/s
    power: 500mA chip-ID: 2357:010e class-ID: 0000 serial: <filter>
  Hub-6: 4-0:1 info: Super-speed hub ports: 6 rev: 3.0 speed: 5 Gb/s
    chip-ID: 1d6b:0003 class-ID: 0900
Sensors:
  System Temperatures: cpu: 33.0 C mobo: N/A
  Fan Speeds (RPM): N/A
Info:
  Processes: 190 Uptime: 39m wakeups: 0 Init: systemd v: 252 default: graphical
  tool: systemctl Compilers: gcc: 12.2.1 clang: 15.0.7 Packages: pm: pacman
  pkgs: 1654 libs: 376 tools: pamac,yay Shell: Bash v: 5.1.16 running-in: tmux:
  inxi: 3.3.26

Thanks !

EDIT : I think the Qt lead may be the right one, since the computer freezes seconds after I open okular. But still no error logged inside the system journal. Any advice on where to look for some info ?

try an older kernel, like the 5.15, since you have a older machine…
if it doesnt help, and you have to hard reset provide logs:
journalctl -b-1 -p4 --no-pager

Thanks, I will try an older kernel, although I doubt this could be causing the issue since I installed a fresh 6.x Manjaro only a few weeks back.

Unfortunately, nothing about Okular inside the system journal even using log level up to 7.

Here is the output with level 4:

avril 23 17:11:28 shibboleth kernel: x86/cpu: VMX (outside TXT) disabled by BIOS
avril 23 17:11:28 shibboleth systemd-journald[329]: File /var/log/journal/bd164d542f904e12b7cd02d8fae1532d/system.journal corrupted or uncleanly shut down, renaming>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001828-0x000000000000182F conflicts with OpRegion 0x0000000000001800-0x000000000000187>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001C40-0x0000000000001C4F conflicts with OpRegion 0x0000000000001C00-0x0000000000001FF>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001C30-0x0000000000001C3F conflicts with OpRegion 0x0000000000001C00-0x0000000000001C3>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001C30-0x0000000000001C3F conflicts with OpRegion 0x0000000000001C00-0x0000000000001FF>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001C00-0x0000000000001C2F conflicts with OpRegion 0x0000000000001C00-0x0000000000001C3>
avril 23 17:11:28 shibboleth kernel: ACPI Warning: SystemIO range 0x0000000000001C00-0x0000000000001C2F conflicts with OpRegion 0x0000000000001C00-0x0000000000001FF>
avril 23 17:11:28 shibboleth kernel: lpc_ich: Resource conflict(s) found affecting gpio_ich
avril 23 17:11:28 shibboleth kernel: r8169 0000:02:00.0: can't disable ASPM; OS doesn't have ASPM control
avril 23 17:11:28 shibboleth kernel: 88XXau: loading out-of-tree module taints kernel.
avril 23 17:11:29 shibboleth kernel: at24 0-0051: supply vcc not found, using dummy regulator
avril 23 17:11:29 shibboleth kernel: at24 0-0053: supply vcc not found, using dummy regulator
avril 23 17:11:29 shibboleth kernel: kauditd_printk_skb: 64 callbacks suppressed
avril 23 17:11:41 shibboleth kernel: kauditd_printk_skb: 3 callbacks suppressed
avril 23 17:11:47 shibboleth kernel: kauditd_printk_skb: 49 callbacks suppressed
avril 23 17:11:48 shibboleth lightdm[708]: gkr-pam: unable to locate daemon control file
avril 23 17:11:50 shibboleth wpa_supplicant[633]: bgscan simple: Failed to enable signal strength monitoring
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth rtkit-daemon[931]: Warning: Reached burst limit for user '1000', denying request.
avril 23 17:11:52 shibboleth pulseaudio[928]: stat('/etc/pulse/default.pa.d'): No such file or directory
avril 23 17:11:53 shibboleth kernel: kauditd_printk_skb: 11 callbacks suppressed
avril 23 17:12:01 shibboleth ModemManager[575]: <warn>  [plugin/generic] could not grab port ttyS4: Cannot add port 'tty/ttyS4', unhandled port type
avril 23 17:12:01 shibboleth ModemManager[575]: <warn>  [base-manager] couldn't create modem for device '/sys/devices/pci0000:00/0000:00:16.3': Failed to find prima>
avril 23 17:12:17 shibboleth pulseaudio[928]: GetManagedObjects() failed: org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes include: the >
avril 23 17:13:55 shibboleth gnome-keyring-daemon[747]: asked to register item /org/freedesktop/secrets/collection/login/2, but it's already registered
avril 23 17:13:55 shibboleth gnome-keyring-d[747]: asked to register item /org/freedesktop/secrets/collection/login/2, but it's already registered
avril 23 17:25:28 shibboleth pulseaudio[928]: ALSA woke us up to write new data to the device, but there was actually nothing to write.
avril 23 17:25:28 shibboleth pulseaudio[928]: Most likely this is a bug in the ALSA driver 'snd_usb_audio'. Please report this issue to the ALSA developers.
avril 23 17:25:28 shibboleth pulseaudio[928]: We were woken up with POLLOUT set -- however a subsequent snd_pcm_avail() returned 0 or another value < min_avail.
avril 23 17:27:56 shibboleth pinentry-gtk-2[2763]: Unable to locate theme engine in module_path: "murrine",
avril 23 17:27:56 shibboleth pinentry-gtk-2[2763]: Unable to locate theme engine in module_path: "murrine",
avril 23 17:27:56 shibboleth pinentry-gtk-2[2763]: Unable to locate theme engine in module_path: "murrine",

Crash occured at 18:06

well nothing in logs … so try the 5.15 kernel …

I wonder if theres something funny with the qtconf ?

stat ~/.config/QtProject.conf

Here you go:

~$ stat .config/QtProject.conf 
  File: .config/QtProject.conf
  Size: 734             Blocks: 8          IO Block: 4096   regular file
Device: 8,3     Inode: 9982512     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/ raphael)   Gid: ( 1000/ raphael)
Access: 2023-04-23 17:08:35.042302441 +0200
Modify: 2023-04-23 17:08:35.042302441 +0200
Change: 2023-04-23 17:08:35.078969015 +0200
 Birth: 2023-04-23 17:08:35.042302441 +0200

Not that I can see. Sometime if its really big it can be evidence of an issue … but not here.

What qt packages are installed?

pacman -Qs qt

I recall recently a scenario where a system crashed on certain events and it was found to be caused by qt5ct while the system contained up to Qt6.

Thanks for your answer, sorry for the delay as I was away for a few days.

It seems that you’re right, I have packages both in the qt and qt6 groups:

pacman -Qg qt output

qt qt5-base
qt qt5-declarative
qt qt5-graphicaleffects
qt qt5-location
qt qt5-multimedia
qt qt5-networkauth
qt qt5-quickcontrols
qt qt5-quickcontrols2
qt qt5-speech
qt qt5-svg
qt qt5-translations
qt qt5-wayland
qt qt5-webchannel
qt qt5-webengine
qt qt5-websockets
qt qt5-x11extras
qt qt5-xmlpatterns

pacman -Qg qt6 output

qt6 qt6-5compat
qt6 qt6-base
qt6 qt6-declarative
qt6 qt6-imageformats
qt6 qt6-svg
qt6 qt6-translations
qt6 qt6-wayland

Should I then install qt6ct in place of qt5ct ?

That was the solution in the other case.
I’d give it a whirl … the desktop is crashing as is anyways, right?

My system has not crashed since my last boot, even when I launch okular :person_shrugging: The crash is less reproducible than it had appeared to be initially…

Thanks a lot for your suggestions anyway. I will close the thread if I do not encounter more crashes in the following days.

Well, two more crashes yesterday. And I wasn’t using any Qt-based app as far as I can tell. Still nothing in the system journal…

This time, I was scrolling down my music library in quod libet. Could this be my integrated graphics having a hard time handling the rendering of lots of album covers images ?

For the record, I’m currently using my mobo HDMI port. It is a little old and can only display 3840*2160 at 30Hz so I get a not-so-smooth experience with a bit of screen tearing. I ordered a little GPU card that is able to handle this resolution at 60Hz, so I hope this will solve the problem at the same time if this is indeed a graphics problem.