Kernel 6.6.19-1 Installed updates released since 2024-02-21 cause system freezes & Kernel bug errors

Problem

What happens:

I have recently installed updates, which have started causing issues where after some time of using it, the system would become unresponsive. I’d need to hold the laptop’s power button to cut the power to exit this frozen DE state.

Frequency:

The system seems to halt what feels like every few hours.

Not a complete freeze:

The first occurrence of it I encountered, was when everything seen on the screen would become un’interact’able with. I could move the mouse, but nothing I would click on would do anything, or would give any visual feedback. Also I could still hear the sound of the work meeting going on.

Vague circumstances:

I’m unsure of whether I can provide any patterns of when the issues would happen. However, I noticed it starting whenever I would be closing a browser window, spectacle screenshot app, or would be turning off the system. I don’t recall whether these are the only circumstance around which the issue has manifested itself. Plus, trying to reproduce the issue by doing the above actions does not seem to work, however, there may be a some sort of correlation there.

Also not 100% sure on this one, but I believe the laptop fan starts blowing harder as well whenever an issue is happening.

What were the updates?

According to /var/log/pacman.log, the last time I installed updates before these ones was: 2024-02-15. So the updates I installed were the stack that has piled up between 2024-02-15 and 2024-03-10.
I’m guessing it’s the 2024-02-21 and 2024-03-06 [Stable update] releases.

Also a few days later, the 2024-03-13 updates would come out, which I have installed in hopes these issues would be gone, however, it was not the case. So that’s my current system state when it comes to updates.

Logs:

Since I’ve got to cut the power to the laptop to exit the freeze, I don’t think I have any system crash logs. However, the last logs before the power cut tends to yield stuff like this:

Mar 13 11:37:19 user123 kernel: BUG: unable to handle page fault for address: 0000000000002501
Mar 13 11:37:19 user123 kernel: #PF: supervisor read access in kernel mode
Mar 13 11:37:19 user123 kernel: #PF: error_code(0x0000) - not-present page
Mar 13 11:37:19 user123 kernel: PGD 0 P4D 0 
Mar 13 11:37:19 user123 kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Mar 13 11:37:19 user123 kernel: CPU: 0 PID: 4606 Comm: brave Tainted: P           OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
Mar 13 11:37:19 user123 kernel: Hardware name: LENOVO 20EQS0VV07/20EQS0VV07, BIOS N1EET76W (1.49 ) 02/21/2018
Mar 13 11:37:19 user123 kernel: RIP: 0010:refill_obj_stock+0x4f/0x180
Mar 13 11:37:19 user123 kernel: Code: c7 40 08 03 00 65 4c 03 3d 9e f0 43 54 49 8b 47 10 48 39 f8 0f 84 9c 00 00 00 4c 89 ff e8 69 f2 ff ff 49 89 c6 e8 a1 22 d9 ff <48> 8b 45 00 a8 03 0f 85 c9 00 00 00 65 48 ff 00 e8 1c 55 d9 ff 49
Mar 13 11:37:19 user123 kernel: RSP: 0000:ffffa975c3f5bbb8 EFLAGS: 00010002

and this:

Mar 13 16:21:53 user123 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000ce2
Mar 13 16:21:53 user123 kernel: #PF: supervisor read access in kernel mode
Mar 13 16:21:53 user123 kernel: #PF: error_code(0x0000) - not-present page
Mar 13 16:21:53 user123 kernel: PGD 0 P4D 0 
Mar 13 16:21:53 user123 kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Mar 13 16:21:53 user123 kernel: CPU: 1 PID: 684 Comm: kwin_x11 Tainted: P    B      OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
Mar 13 16:21:53 user123 kernel: Hardware name: LENOVO 20EQS0VV07/20EQS0VV07, BIOS N1EET76W (1.49 ) 02/21/2018

and this:

Mar 14 09:47:14 user123: ==================================================================
Mar 14 09:47:14 user123: BUG: KFENCE: memory corruption in acpi_os_release_object+0xe/0x20
Mar 14 09:47:14 user123: Corrupted memory at 0x000000001dc1c5a1 [ ! ! ! ! ! ! ! ! . . . . . . . . ] (in kfence-#113):
Mar 14 09:47:14 user123:  acpi_os_release_object+0xe/0x20
Mar 14 09:47:14 user123:  acpi_os_execute_deferred+0x17/0x30
Mar 14 09:47:14 user123:  process_one_work+0x171/0x340
Mar 14 09:47:14 user123:  worker_thread+0x27b/0x3a0
Mar 14 09:47:14 user123:  kthread+0xe5/0x120
Mar 14 09:47:14 user123:  ret_from_fork+0x31/0x50
Mar 14 09:47:14 user123:  ret_from_fork_asm+0x1b/0x30
Mar 14 09:47:14 user123:
Mar 14 09:47:14 user123: kfence-#113: 0x00000000bd4d0c1d-0x0000000076e71976, size=80, cache=Acpi-State
Mar 14 09:47:14 user123: allocated by task 33 on cpu 2 at 295.802004s:
Mar 14 09:47:14 user123:  acpi_ut_create_generic_state+0x37/0x50
Mar 14 09:47:14 user123:  acpi_ev_queue_notify_request+0x72/0x1e0
Mar 14 09:47:14 user123:  acpi_ex_opcode_2A_0T_0R+0xb0/0xe0
Mar 14 09:47:14 user123:  acpi_ds_exec_end_op+0x1f6/0x860
Mar 14 09:47:14 user123:  acpi_ps_parse_loop+0x265/0xa30
Mar 14 09:47:14 user123:  acpi_ps_parse_aml+0x221/0x5e0
Mar 14 09:47:14 user123:  acpi_ps_execute_method+0x171/0x3e0
Mar 14 09:47:14 user123:  acpi_ns_evaluate+0x174/0x5d0
Mar 14 09:47:14 user123:  acpi_evaluate_object+0x16f/0x450
Mar 14 09:47:14 user123:  acpi_ec_event_processor+0xa8/0x100
Mar 14 09:47:14 user123:  process_one_work+0x171/0x340
Mar 14 09:47:14 user123:  worker_thread+0x27b/0x3a0
Mar 14 09:47:14 user123:  kthread+0xe5/0x120
Mar 14 09:47:14 user123:  ret_from_fork+0x31/0x50
Mar 14 09:47:14 user123:  ret_from_fork_asm+0x1b/0x30
Mar 14 09:47:14 user123:
Mar 14 09:47:14 user123: freed by task 168 on cpu 0 at 295.809306s:
Mar 14 09:47:14 user123:  acpi_os_release_object+0xe/0x20
Mar 14 09:47:14 user123:  acpi_os_execute_deferred+0x17/0x30
Mar 14 09:47:14 user123:  process_one_work+0x171/0x340
Mar 14 09:47:14 user123:  worker_thread+0x27b/0x3a0
Mar 14 09:47:14 user123:  kthread+0xe5/0x120
Mar 14 09:47:14 user123:  ret_from_fork+0x31/0x50
Mar 14 09:47:14 user123:  ret_from_fork_asm+0x1b/0x30
Mar 14 09:47:14 user123:
Mar 14 09:47:14 user123: CPU: 0 PID: 168 Comm: kworker/0:2 Tainted: P           OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
Mar 14 09:47:14 user123: Hardware name: LENOVO 20EQS0VV07/20EQS0VV07, BIOS N1EET76W (1.49 ) 02/21/2018
Mar 14 09:47:14 user123: Workqueue: kacpi_notify acpi_os_execute_deferred
Mar 14 09:47:14 user123: ==================================================================
Mar 14 09:48:13 user123: input: WH-1000XM4 (AVRCP) as /devices/virtual/input/input21
Mar 14 10:15:40 user123: perf: interrupt took too long (2514 > 2500), lowering kernel.perf_event_max_sample_rate to 79500
Mar 14 10:31:32 user123: BUG: kernel NULL pointer dereference, address: 0000000000000acb
Mar 14 10:31:32 user123: #PF: supervisor read access in kernel mode
Mar 14 10:31:32 user123: #PF: error_code(0x0000) - not-present page
Mar 14 10:31:32 user123: PGD 0 P4D 0
Mar 14 10:31:32 user123: Oops: 0000 [#1] PREEMPT SMP PTI
Mar 14 10:31:32 user123: CPU: 4 PID: 2013 Comm: brave Tainted: P    B      OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
Mar 14 10:31:32 user123: Hardware name: LENOVO 20EQS0VV07/20EQS0VV07, BIOS N1EET76W (1.49 ) 02/21/2018
Mar 14 10:31:32 user123: RIP: 0010:zswap_load+0x30e/0x490
Mar 14 10:31:32 user123: Code: 34 1b 00 00 01 65 ff 0d 48 25 ea 5a 0f 84 74 01 00 00 65 48 ff 05 0a ef e9 5a 4d 8b 67 38 4d 85 e4 74 1b 66 90 e8 c2 b8 dd ff <49> 8b 7c 24 10 be 6f 00 00 00 e8 73 d8 ff ff e8 3e eb dd ff 48 89
Mar 14 10:31:32 user123: RSP: 0000:ffffa66c85997be0 EFLAGS: 00010202
Mar 14 10:31:32 user123: RAX: 0000000000000001 RBX: ffffe8614977d5c0 RCX: 0000000000e33c04
Mar 14 10:31:32 user123: RDX: ffff895e6ea98000 RSI: 0000000000e33a04 RDI: 0000000000000000
Mar 14 10:31:32 user123: RBP: ffff895d8f77ec68 R08: 0000000000000000 R09: 0000000000039160
Mar 14 10:31:32 user123: R10: ffffa66c806bc000 R11: 0000000000000018 R12: 0000000000000abb
Mar 14 10:31:32 user123: R13: ffff895d8119a6f0 R14: ffff895d8f77ec60 R15: ffff895ef92187d0
Mar 14 10:31:32 user123: FS:  00007f87359555c0(0000) GS:ffff895f17700000(0000) knlGS:0000000000000000
Mar 14 10:31:32 user123: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 14 10:31:32 user123: CR2: 0000000000000acb CR3: 00000001f06d4006 CR4: 00000000003706e0
Mar 14 10:31:32 user123: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 14 10:31:32 user123: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 14 10:31:32 user123: Call Trace:
Mar 14 10:31:32 user123:  <TASK>
Mar 14 10:31:32 user123:  ? __die+0x23/0x70
Mar 14 10:31:32 user123:  ? page_fault_oops+0x171/0x4e0
Mar 14 10:31:32 user123:  ? exc_page_fault+0x7f/0x180
Mar 14 10:31:32 user123:  ? asm_exc_page_fault+0x26/0x30
Mar 14 10:31:32 user123:  ? zswap_load+0x30e/0x490
Mar 14 10:31:32 user123:  swap_readpage+0x81/0x460
Mar 14 10:31:32 user123:  swapin_readahead+0x1e2/0x4e0
Mar 14 10:31:32 user123:  do_swap_page+0x1b8/0xd30
Mar 14 10:31:32 user123:  ? do_wp_page+0x711/0xb80
Mar 14 10:31:32 user123:  ? __pte_offset_map+0x1b/0x180
Mar 14 10:31:32 user123:  __handle_mm_fault+0x7fb/0xd90
Mar 14 10:31:32 user123:  handle_mm_fault+0x17f/0x360
Mar 14 10:31:32 user123:  do_user_addr_fault+0x15b/0x660
Mar 14 10:31:32 user123:  exc_page_fault+0x7f/0x180
Mar 14 10:31:32 user123:  asm_exc_page_fault+0x26/0x30
Mar 14 10:31:32 user123: RIP: 0033:0x559ce2ce2ca5
Mar 14 10:31:32 user123: Code: e9 b0 68 d7 02 cc 0f 0b cc 0f 0b cc cc cc cc cc cc cc cc cc cc 55 48 89 e5 41 57 41 56 41 55 41 54 53 50 48 89 d3 4c 8b 6f 08 <0f> b7 42 fc a8 01 0f 84 d3 00 00 00 49 89 ce 0f b7 43 fe 89 c1 83
Mar 14 10:31:32 user123: RSP: 002b:00007ffda002a1f0 EFLAGS: 00010246
Mar 14 10:31:32 user123: RAX: 0000559cec7d1958 RBX: 0000199b007d53b8 RCX: 0000559ce03ffe70
Mar 14 10:31:32 user123: RDX: 0000199b007d53b8 RSI: 0000199b007d53b8 RDI: 00007ffda002a2e8
Mar 14 10:31:32 user123: RBP: 00007ffda002a220 R08: 00007ffda002a230 R09: 00000000000f0000
Mar 14 10:31:32 user123: R10: 00007ffda002a2e8 R11: 00007ffda00ed080 R12: 000000000000003d
Mar 14 10:31:32 user123: R13: 00000d68080f7b30 R14: 00000000000005f0 R15: 0000000000000003
Mar 14 10:31:32 user123:  </TASK>
Mar 14 10:31:32 user123: Modules linked in: rfcomm ccm qrtr cmac algif_hash algif_skcipher af_alg bnep joydev uinput nvidia_drm(POE) intel_rapl_msr intel_rapl_common nvidia_modeset(POE) intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_ctl_led coretemp snd_hda_codec_realtek crct10dif_pclmul crc32_pclmul snd_hda_codec_generic polyval_clmulni polyval_generic gf128mul snd_soc_avs ghash_clmulni_intel rmi_smbus nvidia_uvm(POE) iwlmvm rmi_core snd_soc_hda_codec sha512_ssse3 nvidia(POE) sha256_ssse3 mac80211 snd_hda_ext_core sha1_ssse3 snd_soc_core libarc4 i915 snd_compress vfat snd_hda_codec_hdmi aesni_intel fat ac97_bus snd_pcm_dmaengine crypto_simd uvcvideo snd_hda_intel videobuf2_vmalloc cryptd snd_intel_dspcfg btusb uvc iwlwifi btrtl snd_intel_sdw_acpi videobuf2_memops drm_buddy btintel snd_hda_codec iTCO_wdt rapl i2c_algo_bit btbcm intel_pmc_bxt snd_hda_core ttm intel_cstate think_lmi snd_hwdep mei_hdcp mei_pxp btmtk mei_wdt ee1004 iTCO_vendor_support intel_wmi_thunderbolt videobuf2_v4l2 wmi_bmof thinkpad_acpi
Mar 14 10:31:32 user123:  firmware_attributes_class drm_display_helper snd_pcm intel_uncore videodev cfg80211 ledtrig_audio bluetooth mei_me snd_timer i2c_i801 platform_profile cec videobuf2_common intel_gtt snd ecdh_generic psmouse pcspkr i2c_smbus mc soundcore rfkill e1000e intel_pch_thermal video mei wmi mousedev mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 usbhid serio_raw rtsx_pci_sdmmc atkbd libps2 mmc_core vivaldi_fmap crc32c_intel rtsx_pci xhci_pci xhci_pci_renesas i8042 serio
Mar 14 10:31:32 user123: CR2: 0000000000000acb
Mar 14 10:31:32 user123: ---[ end trace 0000000000000000 ]---
Mar 14 10:31:32 user123: RIP: 0010:zswap_load+0x30e/0x490
Mar 14 10:31:32 user123: Code: 34 1b 00 00 01 65 ff 0d 48 25 ea 5a 0f 84 74 01 00 00 65 48 ff 05 0a ef e9 5a 4d 8b 67 38 4d 85 e4 74 1b 66 90 e8 c2 b8 dd ff <49> 8b 7c 24 10 be 6f 00 00 00 e8 73 d8 ff ff e8 3e eb dd ff 48 89
Mar 14 10:31:32 user123: RSP: 0000:ffffa66c85997be0 EFLAGS: 00010202
Mar 14 10:31:32 user123: RAX: 0000000000000001 RBX: ffffe8614977d5c0 RCX: 0000000000e33c04
Mar 14 10:31:32 user123: RDX: ffff895e6ea98000 RSI: 0000000000e33a04 RDI: 0000000000000000
Mar 14 10:31:32 user123: RBP: ffff895d8f77ec68 R08: 0000000000000000 R09: 0000000000039160
Mar 14 10:31:32 user123: R10: ffffa66c806bc000 R11: 0000000000000018 R12: 0000000000000abb
Mar 14 10:31:32 user123: R13: ffff895d8119a6f0 R14: ffff895d8f77ec60 R15: ffff895ef92187d0
Mar 14 10:31:32 user123: FS:  00007f87359555c0(0000) GS:ffff895f17700000(0000) knlGS:0000000000000000
Mar 14 10:31:32 user123: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 14 10:31:32 user123: CR2: 0000000000000acb CR3: 00000001f06d4006 CR4: 00000000003706e0
Mar 14 10:31:32 user123: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 14 10:31:32 user123: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 14 10:31:32 user123: note: brave[2013] exited with irqs disabled
Mar 14 10:50:25 user123: BUG: unable to handle page fault for address: 000000000000129b
Mar 14 10:50:25 user123: #PF: supervisor read access in kernel mode
Mar 14 10:50:25 user123: #PF: error_code(0x0000) - not-present page

How do I know it’s updates related?

I’ve ran the command: …

journalctl --since '2024-02-16 10:46:00' --output='short-iso' --no-pager | grep -i -A 3 'BUG:' | grep -i 'kernel'

…to list just the kernel logs for the time range that starts way before the 2024-02-21 and later updates were installed. The output results start with timestamps matching the day of updates were installed.

Some of this command’s output:

2024-03-10T21:20:37+00:00 user123: BUG: kernel NULL pointer dereference, address: 0000000000000b33
2024-03-10T21:20:37+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-10T21:20:37+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-10T21:20:37+00:00 user123: PGD 0 P4D 0 
2024-03-10T21:21:33+00:00 user123: watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [kwin_x11:681]
2024-03-10T21:21:33+00:00 user123: Modules linked in: rfcomm ccm qrtr cmac algif_hash algif_skcipher af_alg bnep snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic joydev uinput nvidia_uvm(POE) intel_rapl_msr intel_rapl_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp nvidia_drm(POE) coretemp snd_soc_avs crct10dif_pclmul crc32_pclmul snd_soc_hda_codec snd_hda_ext_core polyval_clmulni polyval_generic snd_soc_core gf128mul rmi_smbus nvidia_modeset(POE) rmi_core ghash_clmulni_intel iwlmvm snd_compress sha512_ssse3 i915 snd_hda_codec_hdmi sha256_ssse3 ac97_bus snd_pcm_dmaengine mac80211 sha1_ssse3 drm_buddy iTCO_wdt aesni_intel btusb i2c_algo_bit uvcvideo snd_hda_intel ttm btrtl intel_pmc_bxt videobuf2_vmalloc crypto_simd uvc cryptd libarc4 ee1004 intel_wmi_thunderbolt mousedev videobuf2_memops drm_display_helper thinkpad_acpi rapl btintel snd_intel_dspcfg iTCO_vendor_support videobuf2_v4l2 snd_intel_sdw_acpi cec ledtrig_audio iwlwifi intel_cstate btbcm vfat platform_profile videodev snd_hda_codec fat think_lmi intel_gtt
2024-03-10T21:21:33+00:00 user123:  btmtk wmi_bmof mei_hdcp mei_wdt mei_pxp firmware_attributes_class videobuf2_common video nvidia(POE) intel_uncore snd_hda_core mc snd_hwdep cfg80211 bluetooth snd_pcm usbhid snd_timer psmouse e1000e pcspkr i2c_i801 ecdh_generic rfkill i2c_smbus snd soundcore mei_me mei intel_pch_thermal wmi mac_hid i2c_dev crypto_user fuse loop dm_mod nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 serio_raw rtsx_pci_sdmmc atkbd libps2 mmc_core vivaldi_fmap crc32c_intel xhci_pci rtsx_pci xhci_pci_renesas i8042 serio
2024-03-10T21:21:33+00:00 user123: CPU: 0 PID: 681 Comm: kwin_x11 Tainted: P      D    OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
2024-03-10T21:21:57+00:00 user123: watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [dbus-broker:613]
2024-03-10T21:21:57+00:00 user123: Modules linked in: rfcomm ccm qrtr cmac algif_hash algif_skcipher af_alg bnep snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic joydev uinput nvidia_uvm(POE) intel_rapl_msr intel_rapl_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp nvidia_drm(POE) coretemp snd_soc_avs crct10dif_pclmul crc32_pclmul snd_soc_hda_codec snd_hda_ext_core polyval_clmulni polyval_generic snd_soc_core gf128mul rmi_smbus nvidia_modeset(POE) rmi_core ghash_clmulni_intel iwlmvm snd_compress sha512_ssse3 i915 snd_hda_codec_hdmi sha256_ssse3 ac97_bus snd_pcm_dmaengine mac80211 sha1_ssse3 drm_buddy iTCO_wdt aesni_intel btusb i2c_algo_bit uvcvideo snd_hda_intel ttm btrtl intel_pmc_bxt videobuf2_vmalloc crypto_simd uvc cryptd libarc4 ee1004 intel_wmi_thunderbolt mousedev videobuf2_memops drm_display_helper thinkpad_acpi rapl btintel snd_intel_dspcfg iTCO_vendor_support videobuf2_v4l2 snd_intel_sdw_acpi cec ledtrig_audio iwlwifi intel_cstate btbcm vfat platform_profile videodev snd_hda_codec fat think_lmi intel_gtt
2024-03-10T21:21:57+00:00 user123:  btmtk wmi_bmof mei_hdcp mei_wdt mei_pxp firmware_attributes_class videobuf2_common video nvidia(POE) intel_uncore snd_hda_core mc snd_hwdep cfg80211 bluetooth snd_pcm usbhid snd_timer psmouse e1000e pcspkr i2c_i801 ecdh_generic rfkill i2c_smbus snd soundcore mei_me mei intel_pch_thermal wmi mac_hid i2c_dev crypto_user fuse loop dm_mod nfnetlink bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 serio_raw rtsx_pci_sdmmc atkbd libps2 mmc_core vivaldi_fmap crc32c_intel xhci_pci rtsx_pci xhci_pci_renesas i8042 serio
2024-03-10T21:21:57+00:00 user123: CPU: 3 PID: 613 Comm: dbus-broker Tainted: P      D    OEL     6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
2024-03-10T21:22:01+00:00 user123: watchdog: BUG: soft lockup - CPU#0 stuck for 52s! [kwin_x11:681]
2024-03-10T21:22:38+00:00 user123: microcode: updated early: 0xc2 -> 0xf0, date = 2021-11-12
2024-03-10T21:22:38+00:00 user123: Linux version 6.6.19-1-MANJARO (builduser@fv-az1021-257) (gcc (GCC) 13.2.1 20230801, GNU ld (GNU Binutils) 2.42.0) #1 SMP PREEMPT_DYNAMIC Fri Mar  1 18:16:16 UTC 2024
2024-03-11T10:10:03+00:00 user123: BUG: unable to handle page fault for address: 000000000000190d
2024-03-11T10:10:03+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-11T10:10:03+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-11T10:10:03+00:00 user123: PGD 0 P4D 0 
2024-03-11T10:13:37+00:00 user123: BUG: unable to handle page fault for address: 00000000000014f8
2024-03-11T10:13:37+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-11T10:13:37+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-11T10:13:37+00:00 user123: PGD 0 P4D 0 
2024-03-11T12:12:10+00:00 user123: BUG: KFENCE: memory corruption in acpi_os_release_object+0xe/0x20
2024-03-11T12:12:10+00:00 user123: Corrupted memory at 0x000000004a00ecd6 [ ! ! ! ! ! ! ! ! . . . . . . . . ] (in kfence-#252):
2024-03-11T12:12:10+00:00 user123:  acpi_os_release_object+0xe/0x20
2024-03-11T12:12:10+00:00 user123:  acpi_os_execute_deferred+0x17/0x30
2024-03-11T12:14:12+00:00 user123: BUG: kernel NULL pointer dereference, address: 000000000000001a
2024-03-11T12:14:12+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-11T12:14:12+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-11T12:14:12+00:00 user123: PGD 0 P4D 0 
2024-03-11T22:03:44+00:00 user123: BUG: kernel NULL pointer dereference, address: 000000000000001a
2024-03-11T22:03:44+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-11T22:03:44+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-11T22:03:44+00:00 user123: PGD 0 P4D 0 
2024-03-13T11:37:19+00:00 user123: BUG: unable to handle page fault for address: 0000000000002501
2024-03-13T11:37:19+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-13T11:37:19+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-13T11:37:19+00:00 user123: PGD 0 P4D 0 
2024-03-13T14:42:03+00:00 user123: BUG: KFENCE: out-of-bounds write in _nv044009rm+0x10/0x30 [nvidia]
2024-03-13T14:42:03+00:00 user123: Out-of-bounds write at 0x00000000eb7cfb8d (24B left of kfence-#105):
2024-03-13T14:42:03+00:00 user123:  _nv044009rm+0x10/0x30 [nvidia]
2024-03-13T14:42:03+00:00 user123:  _nv014559rm+0x4d/0x90 [nvidia]
2024-03-13T14:42:04+00:00 user123: BUG: KFENCE: memory corruption in acpi_os_release_object+0xe/0x20
2024-03-13T14:42:04+00:00 user123: Corrupted memory at 0x0000000065651713 [ ! ! ! ! ! ! ! ! . . . . . . . . ] (in kfence-#250):
2024-03-13T14:42:04+00:00 user123:  acpi_os_release_object+0xe/0x20
2024-03-13T14:42:04+00:00 user123:  acpi_os_execute_deferred+0x17/0x30
2024-03-13T16:21:53+00:00 user123: BUG: kernel NULL pointer dereference, address: 0000000000000ce2
2024-03-13T16:21:53+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-13T16:21:53+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-13T16:21:53+00:00 user123: PGD 0 P4D 0 
2024-03-14T08:39:49+00:00 user123: BUG: KFENCE: memory corruption in acpi_os_release_object+0xe/0x20
2024-03-14T08:39:49+00:00 user123: Corrupted memory at 0x00000000c77f737a [ ! ! ! ! ! ! ! ! . . . . . . . . ] (in kfence-#174):
2024-03-14T08:39:49+00:00 user123:  acpi_os_release_object+0xe/0x20
2024-03-14T08:39:49+00:00 user123:  acpi_os_execute_deferred+0x17/0x30
2024-03-14T09:41:42+00:00 user123: BUG: kernel NULL pointer dereference, address: 0000000000000624
2024-03-14T09:41:42+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-14T09:41:42+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-14T09:47:14+00:00 user123: BUG: KFENCE: memory corruption in acpi_os_release_object+0xe/0x20
2024-03-14T09:47:14+00:00 user123: Corrupted memory at 0x000000001dc1c5a1 [ ! ! ! ! ! ! ! ! . . . . . . . . ] (in kfence-#113):
2024-03-14T09:47:14+00:00 user123:  acpi_os_release_object+0xe/0x20
2024-03-14T09:47:14+00:00 user123:  acpi_os_execute_deferred+0x17/0x30
2024-03-14T10:31:32+00:00 user123: BUG: kernel NULL pointer dereference, address: 0000000000000acb
2024-03-14T10:31:32+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-14T10:31:32+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-14T10:31:32+00:00 user123: PGD 0 P4D 0 
2024-03-14T10:50:25+00:00 user123: BUG: unable to handle page fault for address: 000000000000129b
2024-03-14T10:50:25+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-14T10:50:25+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-14T11:46:15+00:00 user123: BUG: kernel NULL pointer dereference, address: 000000000000016b
2024-03-14T11:46:15+00:00 user123: #PF: supervisor read access in kernel mode
2024-03-14T11:46:15+00:00 user123: #PF: error_code(0x0000) - not-present page
2024-03-14T11:46:15+00:00 user123: PGD 0 P4D 0 
2024-03-14T11:46:15+00:00 user123: BUG: scheduling while atomic: baloo_file_extr/4653/0x00000000
... etc.

Extra Info:

Operating System: Manjaro Linux 
KDE Plasma Version: 5.27.11
KDE Frameworks Version: 5.115.0
Qt Version: 5.15.12
Kernel Version: 6.6.19-1-MANJARO (64-bit)
Graphics Platform: X11
Processors: 8 × Intel® Core™ i7-6820HQ CPU @ 2.70GHz
Memory: 7.6 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics 530
Manufacturer: LENOVO
Product Name: -
System Version: ThinkPad P50

Question:

Is there any way I can resolve this? (Preferably without having to reinstall the OS.)
(Can’t use the Timeshift to rollback because it’s not set up :facepalm:)

If anyone has got any idea what is happening, please don’t hesitate to tell:
I’d love to learn more about what’s bringing down my system to its knees.

Please provide systeminfo using inxi

inxi -Fazy -c0
  • Check your swap allocation
  • Boot a live ISO and run memtest+
  • Sudden changes after updates could be caused by different methods of allocating memory

I am fairly certain it is hardware related - the logs point to that

Requested System Info:

System:
  Kernel: 6.6.19-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 13.2.1
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.6-x86_64
    root=UUID=cbdf1a8b-406b-433c-9528-a87b111401b3 rw quiet
    resume=UUID=43412e1e-1f33-4bf6-a039-ea2d38607166 udev.log_priority=3
  Desktop: KDE Plasma v: 5.27.11 tk: Qt v: 5.15.12 info: frameworks
    v: 5.115.0 wm: kwin_x11 vt: 2 dm: SDDM Distro: Manjaro base: Arch Linux
Machine:
  Type: Laptop System: LENOVO product: 20EQS0VV07 v: ThinkPad P50
    serial: <superuser required> Chassis: type: 10 serial: <superuser required>
  Mobo: LENOVO model: 20EQS0VV07 v: SDK0J40705 WIN
    serial: <superuser required> part-nu: LENOVO_MT_20EQ_BU_Think_FM_ThinkPad P50
    uuid: <superuser required> UEFI: LENOVO v: N1EET76W (1.49 )
    date: 02/21/2018
Battery:
  ID-1: BAT0 charge: 48.0 Wh (100.0%) condition: 48.0/90.1 Wh (53.3%)
    volts: 12.7 min: 11.4 model: LGC 00NY492 type: Li-poly serial: <filter>
    status: not charging
CPU:
  Info: model: Intel Core i7-6820HQ bits: 64 type: MT MCP arch: Skylake-S
    gen: core 6 level: v3 note: check built: 2015 process: Intel 14nm family: 6
    model-id: 0x5E (94) stepping: 3 microcode: 0xF0
  Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
    L1: 256 KiB desc: d-4x32 KiB; i-4x32 KiB L2: 1024 KiB desc: 4x256 KiB
    L3: 8 MiB desc: 1x8 MiB
  Speed (MHz): avg: 800 min/max: 800/3600 scaling: driver: intel_pstate
    governor: powersave cores: 1: 800 2: 800 3: 800 4: 800 5: 800 6: 800 7: 800
    8: 800 bogomips: 43214
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3
  Vulnerabilities:
  Type: gather_data_sampling status: Vulnerable: No microcode
  Type: itlb_multihit status: KVM: VMX unsupported
  Type: l1tf mitigation: PTE Inversion
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: mmio_stale_data mitigation: Clear CPU buffers; SMT vulnerable
  Type: retbleed mitigation: IBRS
  Type: spec_rstack_overflow status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: IBRS, IBPB: conditional, STIBP: conditional,
    RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort mitigation: TSX disabled
Graphics:
  Device-1: Intel HD Graphics 530 vendor: Lenovo driver: i915 v: kernel
    arch: Gen-9 process: Intel 14n built: 2015-16 ports: active: eDP-1
    empty: DP-1, DP-2, HDMI-A-1, HDMI-A-2 bus-ID: 00:02.0 chip-ID: 8086:191b
    class-ID: 0300
  Device-2: NVIDIA GM107GLM [Quadro M1000M] vendor: Lenovo driver: nvidia
    v: 550.54.14 alternate: nouveau,nvidia_drm non-free: 545.xx+ status: current
    (as of 2024-02; EOL~2026-12-xx) arch: Maxwell code: GMxxx
    process: TSMC 28nm built: 2014-2019 pcie: gen: 1 speed: 2.5 GT/s lanes: 16
    link-max: gen: 3 speed: 8 GT/s bus-ID: 01:00.0 chip-ID: 10de:13b1
    class-ID: 0300
  Device-3: Chicony Integrated Camera driver: uvcvideo type: USB rev: 2.0
    speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-8:3 chip-ID: 04f2:b52c
    class-ID: 0e02 serial: <filter>
  Display: x11 server: X.Org v: 21.1.11 compositor: kwin_x11 driver: X:
    loaded: modesetting,nvidia alternate: fbdev,nouveau,nv,vesa dri: iris
    gpu: i915 display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.00x11.22")
    s-diag: 582mm (22.93")
  Monitor-1: eDP-1 model: LG Display 0x04a7 built: 2015 res: 1920x1080 hz: 60
    dpi: 142 gamma: 1.2 size: 344x194mm (13.54x7.64") diag: 395mm (15.5")
    ratio: 16:9 modes: 1920x1080
  API: EGL v: 1.5 hw: drv: intel iris drv: nvidia platforms: device: 0
    drv: nvidia device: 2 drv: iris device: 3 drv: swrast gbm: drv: kms_swrast
    surfaceless: drv: nvidia x11: drv: iris inactive: wayland,device-1
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: intel mesa v: 24.0.2-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: Mesa Intel HD Graphics 530 (SKL GT2)
    device-ID: 8086:191b memory: 7.41 GiB unified: yes
  API: Vulkan v: 1.3.279 layers: 5 device: 0 type: discrete-gpu
    name: Quadro M1000M driver: nvidia v: 550.54.14 device-ID: 10de:13b1
    surfaces: xcb,xlib
Audio:
  Device-1: Intel 100 Series/C230 Series Family HD Audio vendor: Lenovo
    driver: snd_hda_intel v: kernel alternate: snd_soc_avs bus-ID: 00:1f.3
    chip-ID: 8086:a170 class-ID: 0403
  Device-2: NVIDIA GM107 High Definition Audio [GeForce 940MX]
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 01:00.1 chip-ID: 10de:0fbc class-ID: 0403
  API: ALSA v: k6.6.19-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 1.0.3 status: off with: pipewire-media-session
    status: active tools: pw-cli
  Server-3: PulseAudio v: 17.0 status: active with: pulseaudio-alsa
    type: plugin tools: pacat,pactl
Network:
  Device-1: Intel Ethernet I219-LM vendor: Lenovo driver: e1000e v: kernel
    port: N/A bus-ID: 00:1f.6 chip-ID: 8086:15b7 class-ID: 0200
  IF: enp0s31f6 state: down mac: <filter>
  Device-2: Intel Wireless 8260 driver: iwlwifi v: kernel pcie: gen: 1
    speed: 2.5 GT/s lanes: 1 bus-ID: 04:00.0 chip-ID: 8086:24f3 class-ID: 0280
  IF: wlp4s0 state: up mac: <filter>
  Info: services: NetworkManager, systemd-timesyncd, wpa_supplicant
Bluetooth:
  Device-1: Intel Bluetooth wireless interface driver: btusb v: 0.8 type: USB
    rev: 2.0 speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-14:5 chip-ID: 8087:0a2b
    class-ID: e001
  Report: rfkill ID: hci0 rfk-id: 1 state: up address: see --recommends
Drives:
  Local Storage: total: 238.47 GiB used: 179.07 GiB (75.1%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/sda maj-min: 8:0 vendor: Toshiba model: THNSFJ256GDNU A
    size: 238.47 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    tech: SSD serial: <filter> fw-rev: 1102 scheme: GPT
Partition:
  ID-1: / raw-size: 229.37 GiB size: 224.71 GiB (97.97%)
    used: 178.81 GiB (79.6%) fs: ext4 dev: /dev/sda2 maj-min: 8:2
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 312 KiB (0.1%) fs: vfat dev: /dev/sda1 maj-min: 8:1
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: partition size: 8.8 GiB used: 264.2 MiB (2.9%)
    priority: -2 dev: /dev/sda3 maj-min: 8:3
Sensors:
  System Temperatures: cpu: 40.0 C pch: 45.0 C mobo: N/A
  Fan Speeds (rpm): fan-1: 0 fan-2: 0
Info:
  Memory: total: 8 GiB note: est. available: 7.59 GiB used: 5.41 GiB (71.3%)
  Processes: 281 Power: uptime: 1h 3m states: freeze,mem,disk suspend: deep
    avail: s2idle wakeups: 0 hibernate: platform avail: shutdown, reboot,
    suspend, test_resume image: 3.03 GiB services: org_kde_powerdevil,upowerd
    Init: systemd v: 255 default: graphical tool: systemctl
  Packages: 1573 pm: pacman pkgs: 1491 libs: 400 tools: pamac pm: flatpak
    pkgs: 82 Compilers: clang: 16.0.6 gcc: 13.2.1 Shell: Zsh v: 5.9 default: Bash
    v: 5.2.26 running-in: konsole inxi: 3.3.33

That BIOS is 6 years old, the first thing I’d try is updating it. 1.74 was released 28 Nov 2023.

I Couldn’t run the memtest+ from the Manjaro ISO as my BIOS is set to UEFI only mode.
However, I ran Lenovo Memory Extended Diagnostics, which has passed on every test there was.

Not sure what exactly to look for when checking the swap allocation, however, I did some testing of pushing the swap near the limits of 7.9 GB out of 8.8GB. So it doesn’t seem like there’s any usage threshold there that would trigger my issue.

Perhaps, it could be related to hardware in a way that the new, updated software is using it differently enough for usage to be unexpected?

I’m thinking so because I didn’t not have any Kernel BUG errors for an entire month (that’s how far I checked) before installing the mentioned Updates, but as soon as installed them, I started getting multiple of these Kernel BUG errors per day. Also I haven’t changed anything hardware related at all on this machine.

On a separate note, it may be worth adding that I doubt the issue is related to running out of memory. As most these issues happened close to ~4-5GB of RAM usage out of total 8GB available. Plus it happened a few times while having barely anything open at all.

Perhaps, there’s a way to track down what exactly ends up causing these kernel errors?
Something has changed within the updated packages that’s causing these issues.
Perhaps, it’s possible to track down which one of those is causing issues by narrowing the package list down based on what impact their changes have? (Like which one is capable freezing the UI elements, but not the audio or mouse)?
Or perhaps, there’s a way to enable more verbose logs or something of the sort to get the answers to some of these questions?

I compiled a list of warnings and errors I found scrolling though the logs - most or all may not be related at all, but…
Maybe there’s a chance of finding something useful in there? :thinking:

Like, one of those baloo file errors that’s next to the kernel bug, mentions taintedness… which is also mentioned during nvidia module load. I don’t know, I’m just grasping for straws now.

...
08:32:33 kernel: x86/cpu: VMX (outside TXT) disabled by BIOS
08:32:33 kernel: x86/cpu: SGX disabled by BIOS.
...
08:32:34 kernel: nvidia: loading out-of-tree module taints kernel.
08:32:34 kernel: nvidia: module license 'NVIDIA' taints kernel.
08:32:34 kernel: Disabling lock debugging due to kernel taint
08:32:34 kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
08:32:34 kernel: nvidia: module license taints kernel.
08:32:35 kernel: nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
...
08:32:43 pkgfile[575]: BUG: unhandled repo->dl_result=0
...
08:39:49 kernel: CPU: 0 PID: 182 Comm: kworker/0:2 Tainted: P           OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
...
11:44:10 kate[4550]: kf.sonnet.core: No language dictionaries for the language: "en_US"
11:44:15 wpa_supplicant[499]: wlp4s0: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=-65 noise=9999 txrate=650000
11:44:23 systemd[586]: app-org.kde.kate-5a2710b7456247bcbf98b0be154be6d8.scope: Consumed 1.387s CPU time.
...
11:46:15 kernel: Fixing recursive fault but reboot is needed!
11:46:15 kernel: BUG: scheduling while atomic: baloo_file_extr/4653/0x00000000
...
11:46:15 kernel: CPU: 2 PID: 4653 Comm: baloo_file_extr Tainted: P      D W  OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
...
11:47:02 kernel: CPU: 5 PID: 642 Comm: baloo_file Tainted: P      D W  OE      6.6.19-1-MANJARO #1 76c482e512047110118a77981ac42e42c9746e1c
...
: Starting Update XDG user dir configuration...
19:09:53 systemd[1]: Started Session c1 of User sddm.
19:09:53 sddm-helper[542]: Writing cookie to "/tmp/xauth_mzoCqf"
19:09:53 sddm-helper[542]: Starting X11 session: "" "/usr/bin/sddm-greeter --socket /tmp/sddm-:0-DJgNRV --theme /usr/share/sddm/themes/breath"
19:09:53 sddm[519]: Greeter session started successfully
19:09:53 xdg-user-dirs-update[558]: Can't save user-dirs.dirs
19:09:53 systemd[546]: xdg-user-dirs-update.service: Main process exited, code=exited, status=1/FAILURE
19:09:53 systemd[546]: xdg-user-dirs-update.service: Failed with result 'exit-code'.
19:09:53 systemd[546]: Failed to start Update XDG user dir configuration.
...
19:09:55 sddm-greeter[559]: file:///usr/share/sddm/themes/breath/Main.qml:465:13: Unable to assign [undefined] to QUrl
...
19:09:53 sddm-greeter[559]: QObject: Cannot create children for a parent that is in a different thread.
                                                    (Parent is QGuiApplication(0x7fffa4cb8f00), parent's thread is QThread(0x5569c70ceb90), current thread is QThread(0x5569c71f27e0)
19:09:53 sddm-greeter[559]: QObject::installEventFilter(): Cannot filter events for objects in a different thread.
...
19:09:55 sddm-greeter[559]: QDBusConnection: name 'org.freedesktop.UDisks2' had owner '' but we thought it was ':1.24'
19:09:55 sddm-greeter[559]: QDBusConnection: name 'org.freedesktop.UPower' had owner '' but we thought it was ':1.25'
...
19:10:02 ksmserver[696]: QDBusConnection: name 'org.kde.kglobalaccel' had owner '' but we thought it was ':1.15'
...
19:10:03 org_kde_powerdevil[780]: org.kde.powerdevil: Handle button events action could not check for screen configuration
...
19:10:03 kded5[698]: kscreen.kded: PowerDevil SuspendSession action not available!
...
19:10:03 kded5[698]: kf.bluezqt: PendingCall Error: "The name is not activatable"
19:10:03 kded5[698]: kf.networkmanagerqt: void NetworkManager::ConnectionPrivate::onPropertiesChanged(const QVariantMap&) Unhandled property "VersionId"
...
19:10:03 kded5[698]: QDBusAbstractAdaptor: Cannot relay signal KDEDModule::moduleDeleted(KDEDModule*): Pointers are not supported: KDEDModule*
...
19:10:03 plasmashell[745]: Aborting shell load: The activity manager daemon (kactivitymanagerd) is not running.
19:10:03 plasmashell[745]: If this Plasma has been installed into a custom prefix, verify that its D-Bus services dir is known to the system for the daemon to be activatable.
...
19:10:04 plasmashell[745]: file:///usr/lib/qt/qml/org/kde/kirigami.2/templates/InlineMessage.qml:265:13: QML SelectableLabel: Binding loop detected for property "implicitWidth"
...
19:10:04 plasmashell[745]: Trying to use rootObject before initialization is completed, whilst using setInitializationDelayed. Forcing completion
...
19:10:05 plasmashell[745]: Cyclic dependency detected between "file:///usr/share/plasma/plasmoids/org.kde.plasma.notifications/contents/ui/global/Globals.qml" and "file:///usr/share/plasma/plasmoids/org.kde.plasma.notifications/contents/ui/ThumbnailStrip.qml"
19:10:05 plasmashell[745]: file:///usr/share/plasma/plasmoids/org.kde.plasma.networkmanagement/contents/ui/main.qml:95: TypeError: Cannot read property 'airplaneModeAvailable' of null
19:10:05 plasmashell[745]: file:///usr/share/plasma/plasmoids/org.kde.plasma.networkmanagement/contents/ui/main.qml:95: TypeError: Cannot read property 'airplaneModeAvailable' of null
19:10:05 plasmashell[745]: file:///usr/share/plasma/plasmoids/org.kde.plasma.private.systemtray/contents/ui/main.qml:18:1: QML MouseArea (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "minimumWidth"
19:10:05 plasmashell[745]: Both point size and pixel size set. Using pixel size.
19:10:05 plasmashell[745]: file:///usr/share/plasma/plasmoids/org.kde.plasma.digitalclock/contents/ui/Tooltip.qml:78:9: QML GridLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "minimumWidth"
...
19:10:07 kioslave5[1097]: QObject::connect: No such slot DesktopProtocol::_k_slotRedirection(KIO::Job *, QUrl)
...
19:10:07 plasmashell[745]: file:///usr/lib/qt/qml/org/kde/plasma/extras/PlaceholderMessage.qml:238:5: QML Heading: Binding loop detected for property "verticalAlignment"
19:10:07 plasmashell[745]: file:///usr/lib/qt/qml/org/kde/plasma/extras/PlaceholderMessage.qml:238:5: QML Heading: Binding loop detected for property "verticalAlignment"
19:10:09 plasmashell[745]: Could not find the Plasmoid for Plasma::FrameSvgItem(0x5638bd15b3e0) QQmlContext(0x5638b9374460) QUrl("file:///usr/share/plasma/plasmoids/org.kde.plasma.notifications/contents/ui/global/Globals.qml")
...
19:10:09 plasmashell[745]: QFont::setPointSizeF: Point size <= 0 (0.000000), must be greater than 0
...
19:10:26 plasmashell[745]: kf.sonnet.core: No language dictionaries for the language: "en_US"
19:10:27 plasmashell[745]: Connecting to deprecated signal QDBusConnectionInterface::serviceOwnerChanged(QString,QString,QString)

...
19:12:47 kate[2376]: kf.sonnet.core: No language dictionaries for the language: "en_US"

-----
14:27:15 systemd[585]: kde-baloo.service: Failed with result 'core-dump'.
14:27:16 kded5[680]: ag_manager_list: assertion 'AG_IS_MANAGER (manager)' failed
14:27:17 dbus-broker-launch[432]: Activation request for 'org.freedesktop.resolve1' failed: The systemd unit 'dbus-org.freedesktop.resolve1.service' could not be found.
14:27:17 kded5[680]: kscreen.kded:         Failed to find a matching output in the current info data - this means that our info is corrupted or a different device with the same serial number has been connected (very unlikely).
14:27:17 kded5[680]: kscreen.kded:         Failed to find a matching mode - this means that our config is corrupted or a different device with the same serial number has been connected (very unlikely). Falling back to preferred modes.

Logs with matching timestamps do not necessarily go one one after another. I copied them from all over the place. Also some logs close by, may actually be separated with more logs in between. If any of these logs is of interest, let me know - I can get everything around them time-wise.

Thanks for the suggestion.
Will probably not tend to this now, but It’s on my to do list.
Thus far couldn’t find the time to figure out the hoops of doing this on Linux. :sweat_smile:
I had a rough BIOS upgrade experience with my previous laptop, where I bricked it. :persevere:
So I feel I need to be extra sure of that whatever I’m doing is alright when it comes to this.

I wonder if your baloo index database is corrupt.

Disable File Search in System Settings → Search → File Search, click Apply, then click “Delete Index Data” when asked. Reboot.

Test it like that for a while to see if your issues are still happening. Afterwards you can either leave it disabled if you don’t use that feature, or turn it back on which will reindex everything.

maybe try a different kernel series.

Tested it for a couple of days.

I think your suggestion has fixed the Baloo issue. Thanks!

The system freezing issue appears to be a separate issue though. It still persists.

I think I may just give it a go soon.

Sorry for late reply - I was holding out so far on your suggestion in hopes of testing MrLavender's suggested fix first & in hopes of finding some other clues for what’s happening if that wouldn’t work.

I think I’ll try upping the logs verbosity slightly in the coming days. If I get nothing still, I’ll give it a go with kernel downgrade.

New observations

Kernel bugs aren’t killing the system immediatelly

Across these couple days I noticed that those memory kernel bugs were not causing an immediate system freeze. The system can run just fine for hours past encountering these bugs.

Cannot tell the cause of the real issue yet because it’s not logged

Whenever the whatever critical failure does happen, the system does not produce any logs at that point or after. I let my system run for 40 minutes past it entering a complete halt (nothing was functional except the keyboard lights toggle) - it produced absolutely no logs.

Memory usage before the system freeze

Minutes before the system halting my system was sitting at around ~5.5 out of 8 GB RAM usage, and ~2.2 out of 8 GB swap usage.

I’m not sure what it was using all that memory for as I had a browser window open with like 8 tabs, Konsole terminate with 4 tabs that only displayed my logs search results, and I had a System Monitor up.

More details on what happened before the halt

The system did encounter maybe a 2nd or 3rd memory kernel bug during that session 5-10 minutes before the it halting. However, it was usable past it.

What triggered the system to start getting into freezing state, was clicking the “Windows” key whilst attempting to open the “Start menu” to launch “Spectacle” app. The UI froze on that moment as it drew the edges of the “Start menu”. From that point on those edges were stuck there with the main menu area being completely transparent.

I could still use Alt+Tab to switch between the application windows, however, the system went into a complete halt as tried launching “Spectacle” via terminal to screenshot … “plasmashell” (or a process with a similar name) being reported as a zombie process.

VM Size (probs red herring)

I’m aware that based on System Monitor’s description, the VM Size is nearly meaningless number, but is it normal for its size to be reported to be larger than the available system RAM or Storage?

For instance, the Brave browser reports VM Size of 1.1TB. I can assure you, there’s nothing 1TB sized on my system.

Edit: another halt while closing an app.

Just had another system freeze, I think they get triggered via UI actions.

I was about to shut off my machine. So clicked “x” to close Brave browser window - and then it happened - complete halt.

Even seconds before doing that, I could do whatever I wanted - switch windows, tabs, etc. But as soon as I clicked close browser window - it halts. This has happened multiple times - mostly with browser applications. But happened with at least one other app I don’t remember.

This also happens whilst attempting to Shut Down a system.

Unfortunately, I never got to changing the kernel version.

My laptop ended up dying from, I assume, me cutting power to it to exit the freeze every couple of hours. Completely no reaction to pressing the power button now. At the repair shop now, hoping it will be economical to give it another lease of life. If you’re suffering from this issue, I don’t recommend waiting for too long given my experience.

In relation to this post, I don’t think I’ll be able to post any updates as to whether anything changes with the kernel update. However, judging from the Manjaro Discord group & the newer posts on this forum, it seems I’m not the only one with these system freeze and kernel bug issues.

Hopefully, the issue will be found and fixed with the new data points incoming about this.

This post can probably be closed if the information on it is no longer relevant to anyone.

I also encountered some crashing issues when upgrading to the 6.6 kernel; my computer would automatically reboot and there were no relevant error logs. My solution was to install the 6.1 kernel, which runs well and is very stable on my computer.