Hi all, Been a happy Manjaro user for many years now - running on a Thinkpad X1 Carbon Gen6. Many thanks to all those who work on this distro!
However, since the most recent kernel updates (I have tried both 6.8.5-1 and 6.6.26-1 LTS), I have been having unstable behavior. I have experienced several kernel panics and other kernel issues that eventually require a reboot or hard shutdown to resume normal system operation. I have not changed anything on the system recently and have not experienced a single such issue over the years.
Issues in the log are usually along the lines of: BUG: scheduling while atomic: kworker/6:0/171277/0x00000002
or watchdog: BUG: soft lockup - CPU#6 stuck for 160s! [kworker/6:1:168392] (multiple repeated)
Other strange symptons I have experienced since this update are NetworkManager seg-faulting (dumping core), and Bluetooth connections can sometimes not be managed via gui (neither Plasma nor blueman).
My first thought was a hardware failure. I have run all of Lenovo’s hardware tests and they all pass. I have also updated the Bios to the latest version.
I realise this is all quite vague. However, the issues appear random. I guess I’m wondering whether:
there have been any recent changes that could lead to such issues; and
anyone can offer suggestions on how to troubleshoot further? (Since I don’t see other topic in the forum, it’s more likely something specific on my end.)
I’ve had networkmanager crash multiple times today, but system itself has stayed functional. I’ve now tried disabling the associated systemd service and running directly from terminal via networkmanager --no-daemon --debug in case that leads to more useful information.
Update: This setup has remained stable for the entire day. No kernel errors such as those in the original post.
However, trying to restart NetworkManager.service via systemd resulted in multiple errors as shown, and finally a kernel panic (system completely unresponsive and flashing Caps-Lock key):
Apr 16 18:47:04 kernel: BUG: scheduling while atomic: NetworkManager/201980/0x00000002
Apr 16 18:47:04 kernel: BUG: scheduling while atomic: NetworkManager/201980/0x00000000
Apr 16 18:47:05 systemd-coredump[202009]: Process 201980 (NetworkManager) of user 0 dumped core.
Apr 16 18:47:05 systemd[1]: Failed to start Network Manager.
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: NetworkManager/202045/0x00000002
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: NetworkManager/202045/0x00000000
Apr 16 18:47:05 NetworkManager[202045]: <error> [1713257225.3343] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:05 NetworkManager[202045]: <error> [1713257225.3344] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:05 systemd-coredump[202052]: Process 202045 (NetworkManager) of user 0 dumped core.
Apr 16 18:47:05 systemd[1]: Failed to start Network Manager.
Apr 16 18:47:05 NetworkManager[202061]: <error> [1713257225.8595] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:05 NetworkManager[202061]: <error> [1713257225.8596] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: NetworkManager/202061/0x00000002
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: NetworkManager/202061/0x00000000
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: Link Monitor/1903/0x00000002
Apr 16 18:47:05 kernel: BUG: scheduling while atomic: Link Monitor/1903/0x00000000
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: conky/111712/0x00000002
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: conky/111712/0x00000000
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1934/0x00000002
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1877/0x00000002
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1934/0x00000000
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1934/0x00000002
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1877/0x00000000
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: Qt bearer threa/1934/0x00000000
Apr 16 18:47:06 systemd-coredump[202066]: Process 202061 (NetworkManager) of user 0 dumped core.
Apr 16 18:47:06 systemd[1]: Failed to start Network Manager.
Apr 16 18:47:06 systemd-coredump[202073]: Process 111704 (conky) of user 1000 dumped core.
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: NetworkManager/202083/0x00000002
Apr 16 18:47:06 kernel: BUG: scheduling while atomic: NetworkManager/202083/0x00000000
Apr 16 18:47:06 NetworkManager[202083]: <error> [1713257226.5571] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:06 NetworkManager[202083]: <error> [1713257226.5571] platform-linux: netlink[rtnl]: read: failed to retrieve incoming events: Bad address (-14)
Apr 16 18:47:06 systemd-coredump[202094]: Process 202083 (NetworkManager) of user 0 dumped core.
Apr 16 18:47:06 systemd[1]: Failed to start Network Manager.
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: kworker/1:4/194948/0x00000002
Apr 16 18:47:08 kernel: BUG: workqueue leaked lock or atomic: kworker/1:4/0x7fffffff/194948
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: kworker/1:4/194948/0x00000000
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: Link Monitor/1903/0x00000002
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: NetworkManager/202109/0x00000002
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: Link Monitor/1903/0x00000000
Apr 16 18:47:08 kernel: BUG: scheduling while atomic: NetworkManager/202109/0x00000000
Apr 16 18:47:09 kernel: BUG: scheduling while atomic: Qt bearer threa/1522/0x00000002
Apr 16 18:47:09 kernel: BUG: scheduling while atomic: Qt bearer threa/1522/0x00000000
Apr 16 18:47:09 systemd-coredump[202144]: Process 202109 (NetworkManager) of user 0 dumped core.
Apr 16 18:47:09 systemd[1]: Failed to start Network Manager.
Apr 16 18:47:09 systemd[1]: Failed to start Network Manager.
Today’s experience leads me to believe that this in not a kernel topic. It is something systemd and/or NetworkManager.service related. I’ve edited the original title to reflect this.
Any ideas why networkmanager would behave so badly when run via systemd but totally fine when run via networkmanager --no-daemon --debug ?
Question: Could this kind of strange intermittent behaviour be explained by intermittent connection to the wifi card? I am always on wifi (not ethernet), and it always seems to be NetworkManager crashing that leads to the subsequent kernel errors.
I’ve had the laptop since new (approx 6 years) and, whilst I look after it, it has experienced a few accidental bumps in its time. Wondering it is worth opening and re-seating the card? (Would rather not open it up if it’s completely infeasible for this to be the cause.)