Weird complete system freezes

Hello there good fellows from the community,

I have a particular freezing issue with Manjaro GNOME, and I would really appreciate if you could spare some minutes in helping me out.

Warnings: I am a Linux noob. I came to Manjaro from Ubuntu Budgie.

The gist of the problem:

Manjaro generally works fine. However, at particular days, and typically after boot to desktop (GNOME), the whole system freezes totally. Nothing helps, CTRL-ALT-DEL, CTRL+F1, ALT+F1, ALT+Print Screen+REISUB etc. the system is completely frozen. Even if I leave it be for tens of minutes, it remains frozen.

The only thing I kinda feel might be the problem is when my laptop is cold. But then again, it is not that cold that it should result in a system freeze.

What I’d like to know is if there’s some kind of way to determine what exactly is the problem here. How can I turn on logging on everything, and then interpret it, so that I discover the culprit?

My system is Intel + nVidia 1650 GTX (proprietary driver, can’t find right now which one because the system froze yet again), just a plain laptop.

The thing is that most of the time (95%+) the system works just fine without any issues whatsoever. But when it freezes, it freezes completely and only reboot helps.

Thanks,
A

P.S. besides this issue, I want to express my gratitudes to the Manjaro community for delivering this amazing OS. I am truly amazed at how a community can create something that looks and works so good. Respect.

Hello,

When you laptop is not frozen, try to gather and share the information as described here: How to provide good information

OK, will try. In the meantime, it seems that freezing happens when my laptop is unplugged from AC power.

OK, finally managed to get around and provide info. My computer was fine for a certain period of time, and yesterday and especially today, it started freezing again out of a sudden. Always plugged in power, so that’s not the issue it now. Just opening websites, opening NVidia control panel, regular stuff like that.

Here’s the output of inxi:

System:
  Kernel: 5.14.18-1-MANJARO x86_64 bits: 64 compiler: gcc v: 11.1.0
  parameters: BOOT_IMAGE=/boot/vmlinuz-5.14-x86_64
  root=UUID=a3af1144-0e10-4f8a-8976-02dafbc41ef5 rw quiet splash apparmor=1
  security=apparmor udev.log_priority=3
  Desktop: GNOME 41.1 tk: GTK 3.24.30 wm: gnome-shell dm: GDM 41.0
  Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Laptop System: LENOVO product: 81LL v: IdeaPad L340-17IRH Gaming
  serial: <superuser required> Chassis: type: 10 v: IdeaPad L340-17IRH Gaming
  serial: <superuser required>
  Mobo: LENOVO model: LNVNB161216 v: NO DPK serial: <superuser required>
  UEFI: LENOVO v: BGCN24WW date: 08/19/2019
Battery:
  ID-1: BAT0 charge: 39.3 Wh (100.0%) condition: 39.3/45.0 Wh (87.4%)
  volts: 12.7 min: 11.4 model: SMP L18M3PF1 type: Li-poly serial: <filter>
  status: Full cycles: 331
Memory:
  RAM: total: 7.65 GiB used: 2.47 GiB (32.2%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges required.
CPU:
  Info: 6-Core model: Intel Core i7-9750H bits: 64 type: MT MCP
  arch: Kaby Lake note: check family: 6 model-id: 9E (158) stepping: A (10)
  microcode: EA cache: L1: 384 KiB L2: 1.5 MiB L3: 12 MiB bogomips: 62431
  Speed: 935 MHz min/max: 800/4500 MHz Core speeds (MHz): 1: 900 2: 907 3: 863
  4: 900 5: 897 6: 895 7: 832 8: 869 9: 897 10: 900 11: 900 12: 900
  Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat arch_perfmon art
  avx avx2 bmi1 bmi2 bts clflush clflushopt cmov constant_tsc cpuid
  cpuid_fault cx16 cx8 de ds_cpl dtes64 dtherm dts epb ept ept_ad erms est
  f16c flexpriority flush_l1d fma fpu fsgsbase fxsr ht hwp hwp_act_window
  hwp_epp hwp_notify ibpb ibrs ida intel_pt invpcid invpcid_single lahf_lm lm
  mca mce md_clear mmx monitor movbe mpx msr mtrr nonstop_tsc nopl nx pae pat
  pbe pcid pclmulqdq pdcm pdpe1gb pebs pge pln pni popcnt pse pse36 pti pts
  rdrand rdseed rdtscp rep_good sdbg sep smap smep ss ssbd sse sse2 sse4_1
  sse4_2 ssse3 stibp syscall tm tm2 tpr_shadow tsc tsc_adjust
  tsc_deadline_timer vme vmx vnmi vpid x2apic xgetbv1 xsave xsavec xsaveopt
  xsaves xtopology xtpr
  Vulnerabilities: Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf
  mitigation: PTE Inversion; VMX: conditional cache flushes, SMT vulnerable
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: spec_store_bypass
  mitigation: Speculative Store Bypass disabled via prctl and seccomp
  Type: spectre_v1
  mitigation: usercopy/swapgs barriers and __user pointer sanitization
  Type: spectre_v2 mitigation: Full generic retpoline, IBPB: conditional,
  IBRS_FW, STIBP: conditional, RSB filling
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Intel CoffeeLake-H GT2 [UHD Graphics 630] vendor: Lenovo
  driver: i915 v: kernel bus-ID: 00:02.0 chip-ID: 8086:3e9b class-ID: 0300
  Device-2: NVIDIA TU117M [GeForce GTX 1650 Mobile / Max-Q] vendor: Lenovo
  driver: nvidia v: 495.44 alternate: nouveau,nvidia_drm bus-ID: 01:00.0
  chip-ID: 10de:1f91 class-ID: 0300
  Device-3: Lite-On Integrated Camera type: USB driver: uvcvideo bus-ID: 1-8:3
  chip-ID: 04ca:7070 class-ID: 0e02
  Display: x11 server: X.org 1.21.1.1 compositor: gnome-shell driver:
  loaded: modesetting,nvidia resolution: <missing: xdpyinfo>
  OpenGL: renderer: NVIDIA GeForce GTX 1650/PCIe/SSE2 v: 4.6.0 NVIDIA 495.44
  direct render: Yes
Audio:
  Device-1: Intel Cannon Lake PCH cAVS vendor: Lenovo driver: snd_hda_intel
  v: kernel alternate: snd_soc_skl,snd_sof_pci_intel_cnl bus-ID: 00:1f.3
  chip-ID: 8086:a348 class-ID: 0403
  Device-2: NVIDIA driver: snd_hda_intel v: kernel bus-ID: 01:00.1
  chip-ID: 10de:10fa class-ID: 0403
  Sound Server-1: ALSA v: k5.14.18-1-MANJARO running: yes
  Sound Server-2: JACK v: 1.9.19 running: no
  Sound Server-3: PulseAudio v: 15.0 running: yes
  Sound Server-4: PipeWire v: 0.3.40 running: yes
Network:
  Device-1: Realtek RTL8821CE 802.11ac PCIe Wireless Network Adapter
  vendor: Lenovo driver: rtw_8821ce v: N/A modules: rtw88_8821ce port: 4000
  bus-ID: 07:00.0 chip-ID: 10ec:c821 class-ID: 0280
  IF: wlp7s0 state: up mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
  broadcast: <filter>
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
  vendor: Lenovo driver: r8169 v: kernel port: 3000 bus-ID: 08:00.0
  chip-ID: 10ec:8168 class-ID: 0200
  IF: enp8s0 state: down mac: <filter>
  WAN IP: No WAN IP found. Connected to web? SSL issues? Try enabling dig
Bluetooth:
  Device-1: Realtek Bluetooth Radio type: USB driver: btusb v: 0.8
  bus-ID: 1-14:4 chip-ID: 0bda:c024 class-ID: e001 serial: <filter>
  Report: rfkill ID: hci0 rfk-id: 3 state: down bt-service: enabled,running
  rfk-block: hardware: no software: yes address: see --recommends
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 1.38 TiB used: 376.15 GiB (26.7%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Intel model: SSDPEKNW512G8L
  size: 476.94 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
  lanes: 4 type: SSD serial: <filter> rev: L02C temp: 29.9 C scheme: MBR
  ID-2: /dev/sda maj-min: 8:0 vendor: Western Digital model: WD10JPVX-22JC3T0
  size: 931.51 GiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
  type: HDD rpm: 5400 serial: <filter> rev: 1A01 scheme: MBR
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 37.68 GiB size: 36.79 GiB (97.64%) used: 29.23 GiB (79.5%)
  fs: ext4 dev: /dev/nvme0n1p3 maj-min: 259:3 label: N/A
  uuid: a3af1144-0e10-4f8a-8976-02dafbc41ef5
  ID-2: /boot/efi raw-size: 548 MiB size: 546.9 MiB (99.80%)
  used: 280 KiB (0.0%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
  label: BOOT-EFI uuid: 3B4A-90A5
  ID-3: /home raw-size: 427.45 GiB size: 419.74 GiB (98.20%)
  used: 346.91 GiB (82.7%) fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
  label: N/A uuid: 936d3b36-18d3-4d66-9d7c-af46af9a9248
Swap:
  Alert: No swap data was found.
Unmounted:
  ID-1: /dev/sda1 maj-min: 8:1 size: 350 MiB fs: ntfs label: System Reserved
  uuid: C27A42567A4246FF
  ID-2: /dev/sda2 maj-min: 8:2 size: 116.85 GiB fs: ntfs label: N/A
  uuid: 86B012E9B012E00B
  ID-3: /dev/sda3 maj-min: 8:3 size: 814.32 GiB fs: ntfs label: N/A
  uuid: 60620E0B620DE69E
USB:
  Hub-1: 1-0:1 info: Hi-speed hub with single TT ports: 16 rev: 2.0
  speed: 480 Mb/s chip-ID: 1d6b:0002 class-ID: 0900
  Device-1: 1-1:2 info: Logitech M105 Optical Mouse type: Mouse
  driver: hid-generic,usbhid interfaces: 1 rev: 2.0 speed: 1.5 Mb/s
  power: 100mA chip-ID: 046d:c077 class-ID: 0301
  Device-2: 1-8:3 info: Lite-On Integrated Camera type: Video driver: uvcvideo
  interfaces: 2 rev: 2.0 speed: 480 Mb/s power: 500mA chip-ID: 04ca:7070
  class-ID: 0e02
  Device-3: 1-14:4 info: Realtek Bluetooth Radio type: Bluetooth driver: btusb
  interfaces: 2 rev: 1.1 speed: 12 Mb/s power: 500mA chip-ID: 0bda:c024
  class-ID: e001 serial: <filter>
  Hub-2: 2-0:1 info: Super-speed hub ports: 8 rev: 3.1 speed: 10 Gb/s
  chip-ID: 1d6b:0003 class-ID: 0900
Sensors:
  System Temperatures: cpu: 45.0 C pch: 45.0 C mobo: N/A gpu: nvidia
  temp: 40 C
  Fan Speeds (RPM): N/A
Info:
  Processes: 332 Uptime: 10m wakeups: 1 Init: systemd v: 249 tool: systemctl
  Compilers: gcc: 11.1.0 Packages: 1485 pacman: 1454 lib: 393 flatpak: 21
  snap: 10 Shell: Zsh v: 5.8 running-in: gnome-terminal inxi: 3.3.09

Do you guys see anything suspicious here?

Please guys, can anyone help me out with this or at least point me where I should start reading? My computer is almost unusable because of these random freezes.

Alright, after a bit of troubleshooting, I figured out that there’s some issue with NetworkManager.

First, turned off WiFi and pulled the logs through journalctl -o -short-precise -k -b -1 to -5. Here they are:

nov 30 17:01:00.394244 [username] kernel: audit: type=1130 audit(1638288060.387:134): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’

nov 30 17:00:06.063119 [username] kernel: audit: type=1131 audit(1638288006.056:252): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’

nov 30 16:40:20.053956 [username] kernel: audit: type=1130 audit(1638286820.047:236): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’

nov 30 16:40:30.070550 [username] kernel: audit: type=1131 audit(1638286830.064:237): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’

nov 30 16:17:34.067196 [username] kernel: audit: type=1131 audit(1638285454.060:213): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’

As you can see, they all end up on the same thing.

Then, I turned on WiFi, and my system immediately froze. After hard reboot, I pulled journalctl -o short-precise -k -b -1 again, and this is what I got:

nov 30 17:57:44.610232 [username] kernel: audit: type=1334 audit(1638291464.604:230): prog-id=40 op=LOAD
nov 30 17:57:44.610282 [username] kernel: audit: type=1334 audit(1638291464.604:231): prog-id=41 op=LOAD
nov 30 17:57:44.610306 [username] kernel: audit: type=1334 audit(1638291464.604:232): prog-id=42 op=LOAD
nov 30 17:57:44.693622 [username] kernel: audit: type=1130 audit(1638291464.687:233): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=systemd-hostnamed comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’
nov 30 17:58:14.745047 [username] kernel: audit: type=1131 audit(1638291494.734:234): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=systemd-hostnamed comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’
nov 30 17:58:15.023794 [username] kernel: audit: type=1334 audit(1638291495.017:235): prog-id=42 op=UNLOAD
nov 30 17:58:15.024028 [username] kernel: audit: type=1334 audit(1638291495.017:236): prog-id=41 op=UNLOAD
nov 30 17:58:15.024114 [username] kernel: audit: type=1334 audit(1638291495.017:237): prog-id=40 op=UNLOAD
nov 30 18:02:42.400204 [username] kernel: audit: type=1334 audit(1638291762.394:238): prog-id=43 op=LOAD
nov 30 18:02:42.400258 [username] kernel: audit: type=1334 audit(1638291762.394:239): prog-id=44 op=LOAD
nov 30 18:02:42.400285 [username] kernel: audit: type=1334 audit(1638291762.394:240): prog-id=45 op=LOAD
nov 30 18:02:42.480211 [username] kernel: audit: type=1130 audit(1638291762.474:241): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=systemd-localed comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’
nov 30 18:03:12.516958 [username] kernel: audit: type=1131 audit(1638291792.510:242): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=systemd-localed comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’
nov 30 18:03:12.703574 [username] kernel: audit: type=1334 audit(1638291792.697:243): prog-id=45 op=UNLOAD
nov 30 18:03:12.703631 [username] kernel: audit: type=1334 audit(1638291792.697:244): prog-id=44 op=UNLOAD
nov 30 18:03:12.703665 [username] kernel: audit: type=1334 audit(1638291792.697:245): prog-id=43 op=UNLOAD

So, again, something similar.

What do you guys suggest? Should I downgrade/upgrade the kernel? Or can I somehow repair this NetworkManager module or whatever it is? Username is replaced with [username] above.

Thanks,
A

OK, did some further digging. It seems that my RTL8821CE wireless card is sh*t. :smiley:

I downgraded my kernel to 5.10.79 LTS, and had no issue for some hours. But then, when I tried to put a higher sudden load on my internet connection, the system froze again. This time, though, by some miracle it was able to unfreeze with some combination of CTRL ALT DEL, CTRL ALT Backspace CTRL ALT ? ALT PrntScr REISUB CTRL ALT F1 to F7 etc.

I was thrown into console, where I luckily remembered the command to see the log: journalctl -o short-precise -k -b -0 (meaning, current session).

Immediately after the freeze, this comes up:

nov 30 19:36:53.332315 [username] kernel: audit: type=1130 audit(1638297413.326:224): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg=‘unit=NetworkManager-dispatcher comm=“systemd” exe=“/usr/lib/systemd/systemd” hostname=? addr=? terminal=? res=success’
nov 30 19:36:53.585616 [username] kernel: ------------[ cut here ]------------
nov 30 19:36:53.585717 [username] kernel: purge skb(s) not reported by firmware
nov 30 19:36:53.585745 [username] kernel: WARNING: CPU: 8 PID: 0 at drivers/net/wireless/realtek/rtw88/tx.c:161 rtw_tx_report_purge_timer+0x20/0x50 [rtw88_core]

and then further down below:

nov 30 19:36:53.585876 [username] kernel: CPU: 8 PID: 0 Comm: swapper/8 Tainted: P W OE 5.10.79-1-MANJARO #1
nov 30 19:36:53.585899 [username] kernel: Hardware name: LENOVO [redacted], BIOS [redacted] 08/19/2019
nov 30 19:36:53.585924 [username] kernel: RIP: 0010:rtw_tx_report_purge_timer+0x20/0x50 [rtw88_core]

This seems to be a known problem with either the wireless card or with the driver. For example, people at Debian also reported this problem.

The question is now, how to fix this? Is there an alternative driver to install? Manjaro Settings doesn’t show any available, unfortunately. :confused:

You can try the one in AUR rtl8821ce-dkms-git.
If that doesnt work you can try instead rtw88-dkms-git.
You may also blacklist any conflicting modules:

lsmod | grep 'rtw'