Unable to shutdown or reboot hangs or gives watchdog error

My system will no longer shutdown or reboot. I was originally using kernel 5.4 but upgraded to 5.10 but that did not resolve it. On shutdown the system hangs on a black screen and never fully shuts down. On reboot I get the error:

[ 168.212102] watchdog: watchdog0: watchdog did not stop!

Sometimes the above message is preceded with:

[ 167.731541] kauditd_printk_skb: 45 callbacks suppressed
[ 168.212102] watchdog: watchdog0: watchdog did not stop!

I added nowatchdog to GRUB_CMDLINE_LINUX_DEFAULT= in /etc/default/grub as described here but I continue to see the watchdog message on reboot. For either shutdown or reboot, I have to hold the power button to turn off the computer. I’ve read that shutdown/reboot problems could be tied to nvidia. After the latest update, I am forced to only use the opensource nvidia driver otherwise I get a black screen on boot.

Below is inxi -Fza

System:
  Kernel: 5.10.7-3-MANJARO x86_64 bits: 64 compiler: gcc v: 10.2.1 
  parameters: BOOT_IMAGE=/boot/vmlinuz-5.10-x86_64 
  root=UUID=03797b50-8d13-441b-8fe0-7801ff62f632 rw quiet 
  cryptdevice=UUID=8e0a0035-265a-4f38-940d-a7eb0cb3614c:luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
  root=/dev/mapper/luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
  resume=/dev/mapper/luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
  Desktop: Cinnamon 4.8.6 tk: GTK 3.24.24 dm: LightDM 1.30.0 
  Distro: Manjaro Linux 
Machine:
  Type: Portable System: Dell product: Latitude E6500 v: N/A 
  serial: <filter> Chassis: type: 8 serial: <filter> 
  Mobo: Dell model: 0PP476 serial: <filter> BIOS: Dell v: A01 
  date: 07/22/2008 
CPU:
  Info: Dual Core model: Intel Core2 Duo P8400 bits: 64 type: MCP 
  arch: Penryn family: 6 model-id: 17 (23) stepping: 6 microcode: 60F 
  L2 cache: 3 MiB 
  flags: lm nx pae sse sse2 sse3 sse4_1 ssse3 bogomips: 9046 
  Speed: 1215 MHz min/max: 800/2268 MHz Core speeds (MHz): 1: 1215 2: 1182 
  Vulnerabilities: Type: itlb_multihit status: KVM: VMX unsupported 
  Type: l1tf mitigation: PTE Inversion 
  Type: mds 
  status: Vulnerable: Clear CPU buffers attempted, no microcode; SMT disabled 
  Type: meltdown mitigation: PTI 
  Type: spec_store_bypass status: Vulnerable 
  Type: spectre_v1 
  mitigation: usercopy/swapgs barriers and __user pointer sanitization 
  Type: spectre_v2 
  mitigation: Full generic retpoline, STIBP: disabled, RSB filling 
  Type: srbds status: Not affected 
  Type: tsx_async_abort status: Not affected 
Graphics:
  Device-1: NVIDIA G98M [Quadro NVS 160M] vendor: Dell driver: nouveau 
  v: kernel bus ID: 01:00.0 chip ID: 10de:06eb 
  Display: x11 server: X.Org 1.20.10 driver: loaded: nouveau 
  unloaded: modesetting alternate: fbdev,nv,vesa display ID: :0 screens: 1 
  Screen-1: 0 s-res: 1440x900 s-dpi: 96 s-size: 381x238mm (15.0x9.4") 
  s-diag: 449mm (17.7") 
  OpenGL: renderer: NV98 v: 3.3 Mesa 20.3.3 direct render: Yes 
Audio:
  Device-1: Intel 82801I HD Audio vendor: Dell driver: snd_hda_intel 
  v: kernel bus ID: 00:1b.0 chip ID: 8086:293e 
  Sound Server: ALSA v: k5.10.7-3-MANJARO 
Network:
  Device-1: Intel 82567LM Gigabit Network vendor: Dell driver: e1000e 
  v: kernel port: efe0 bus ID: 00:19.0 chip ID: 8086:10f5 
  IF: enp0s25 state: down mac: <filter> 
  Device-2: Intel WiFi Link 5100 driver: iwlwifi v: kernel port: df00 
  bus ID: 0c:00.0 chip ID: 8086:4232 
  IF: wlp12s0 state: up mac: <filter> 
RAID:
  Hardware-1: Intel 82801 Mobile SATA Controller [RAID mode] driver: ahci 
  v: 3.0 port: 6ea0 bus ID: 00:1f.2 chip ID: 8086.282a rev: 03 
Drives:
  Local Storage: total: 149.05 GiB used: 20.71 GiB (13.9%) 
  SMART Message: Required tool smartctl not installed. Check --recommends 
  ID-1: /dev/sda maj-min: 8:0 vendor: Fujitsu model: MHW2160BJ FFS G2 
  size: 149.05 GiB block size: physical: 512 B logical: 512 B 
  speed: 3.0 Gb/s serial: <filter> rev: 001C 
Partition:
  ID-1: / raw size: 140.6 GiB size: 137.39 GiB (97.72%) 
  used: 20.71 GiB (15.1%) fs: ext4 dev: /dev/dm-0 maj-min: 254:0 
  mapped: luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
Swap:
  Kernel: swappiness: 60 (default) cache pressure: 100 (default) 
  ID-1: swap-1 type: partition size: 8.45 GiB used: 0 KiB (0.0%) 
  priority: -2 dev: /dev/dm-1 maj-min: 254:1 
  mapped: luks-fcf6c9a1-42d5-4c64-b363-42f5c1e43ce3 
Sensors:
  System Temperatures: cpu: 39.0 C mobo: N/A sodimm: SODIMM C gpu: nouveau 
  temp: 50.0 C 
  Fan Speeds (RPM): cpu: 2837 
Info:
  Processes: 186 Uptime: 1h 10m wakeups: 1 Memory: 3.83 GiB 
  used: 1.85 GiB (48.3%) Init: systemd v: 247 Compilers: gcc: 10.2.0 
  Packages: pacman: 1261 lib: 394 Shell: Bash v: 5.1.0 
  running in: gnome-terminal inxi: 3.2.02 

Thank you for your help troubleshooting!

You can safe turn off the watchdog - you can do this using the maxperfwiz script by @cscs

1 Like

Hi @linux-aarhus. I ran through the script and applied all recommended changes as prompted. However, the computer still behaves as described in first post failing to restart or shutdown.

Just went over your inxi output - try reinstalling the intel-ucode package.

Why is your controller in RAID mode? Not that I know if it makes a difference - it just stands out - I mean on a laptop?

The system age stands out too - have you tried using the 5.4 LTS kernel?

Hi @linux-aarhus. Yes, it is a very old machine, but all that I have for now unfortunately. I am not sure why the controller is in RAID mode. I originally was using 5.4 kernel but only upgraded to 5.10 to test if it would fix shutdown/reboot problems. I have since downgraded back to 5.4. Below is results of intel-ucode reinstall and updated inxi output. The laptop still has same failed behavior on shutdown and reboot.

sudo pacman -Syyu intel-ucode
[sudo] password for name: 
:: Synchronizing package databases...
 core                  167.6 KiB  1067 KiB/s 00:00 [######################] 100%
 extra                1993.3 KiB  6.28 MiB/s 00:00 [######################] 100%
 community               6.4 MiB  8.91 MiB/s 00:01 [######################] 100%
 multilib              181.8 KiB  4.44 MiB/s 00:00 [######################] 100%
warning: intel-ucode-20201118-1 is up to date -- reinstalling
:: Starting full system upgrade...
resolving dependencies...
looking for conflicting packages...

Packages (1) intel-ucode-20201118-1

Total Installed Size:  3.46 MiB
Net Upgrade Size:      0.00 MiB

:: Proceed with installation? [Y/n] y
(1/1) checking keys in keyring                     [######################] 100%
(1/1) checking package integrity                   [######################] 100%
(1/1) loading package files                        [######################] 100%
(1/1) checking for file conflicts                  [######################] 100%
(1/1) checking available disk space                [######################] 100%
:: Processing package changes...
(1/1) reinstalling intel-ucode                     [######################] 100%
:: Running post-transaction hooks...
(1/1) Arming ConditionNeedsUpdate...

inxi -Fza

System:    Kernel: 5.4.89-1-MANJARO x86_64 bits: 64 compiler: gcc v: 10.2.1 
           parameters: BOOT_IMAGE=/boot/vmlinuz-5.4-x86_64 root=UUID=03797b50-8d13-441b-8fe0-7801ff62f632 rw quiet 
           cryptdevice=UUID=8e0a0035-265a-4f38-940d-a7eb0cb3614c:luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
           root=/dev/mapper/luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
           resume=/dev/mapper/luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
           Desktop: Cinnamon 4.8.6 tk: GTK 3.24.24 dm: LightDM 1.30.0 Distro: Manjaro Linux 
Machine:   Type: Portable System: Dell product: Latitude E6500 v: N/A serial: <filter> Chassis: type: 8 serial: <filter> 
           Mobo: Dell model: 0PP476 serial: <filter> BIOS: Dell v: A01 date: 07/22/2008 
CPU:       Info: Dual Core model: Intel Core2 Duo P8400 bits: 64 type: MCP arch: Penryn family: 6 model-id: 17 (23) 
           stepping: 6 microcode: 60F L2 cache: 3 MiB 
           flags: lm nx pae sse sse2 sse3 sse4_1 ssse3 vmx bogomips: 9047 
           Speed: 798 MHz min/max: 800/2268 MHz Core speeds (MHz): 1: 798 2: 798 
           Vulnerabilities: Type: itlb_multihit status: KVM: Vulnerable 
           Type: l1tf mitigation: PTE Inversion 
           Type: mds status: Vulnerable: Clear CPU buffers attempted, no microcode; SMT disabled 
           Type: meltdown mitigation: PTI 
           Type: spec_store_bypass status: Vulnerable 
           Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer sanitization 
           Type: spectre_v2 mitigation: Full generic retpoline, STIBP: disabled, RSB filling 
           Type: srbds status: Not affected 
           Type: tsx_async_abort status: Not affected 
Graphics:  Device-1: NVIDIA G98M [Quadro NVS 160M] vendor: Dell driver: nouveau v: kernel bus ID: 01:00.0 chip ID: 10de:06eb 
           Display: x11 server: X.Org 1.20.10 driver: loaded: nouveau unloaded: modesetting alternate: fbdev,nv,vesa 
           display ID: :0 screens: 1 
           Screen-1: 0 s-res: 1440x900 s-dpi: 96 s-size: 381x238mm (15.0x9.4") s-diag: 449mm (17.7") 
           OpenGL: renderer: NV98 v: 3.3 Mesa 20.3.3 direct render: Yes 
Audio:     Device-1: Intel 82801I HD Audio vendor: Dell driver: snd_hda_intel v: kernel bus ID: 00:1b.0 chip ID: 8086:293e 
           Sound Server: ALSA v: k5.4.89-1-MANJARO 
Network:   Device-1: Intel 82567LM Gigabit Network vendor: Dell driver: e1000e v: 3.2.6-k port: efe0 bus ID: 00:19.0 
           chip ID: 8086:10f5 
           IF: enp0s25 state: down mac: <filter> 
           Device-2: Intel WiFi Link 5100 driver: iwlwifi v: kernel port: df00 bus ID: 0c:00.0 chip ID: 8086:4232 
           IF: wlp12s0 state: up mac: <filter> 
RAID:      Hardware-1: Intel 82801 Mobile SATA Controller [RAID mode] driver: ahci v: 3.0 port: 6ea0 bus ID: 00:1f.2 
           chip ID: 8086.282a rev: 03 
Drives:    Local Storage: total: 149.05 GiB used: 20.79 GiB (13.9%) 
           SMART Message: Required tool smartctl not installed. Check --recommends 
           ID-1: /dev/sda maj-min: 8:0 vendor: Fujitsu model: MHW2160BJ FFS G2 size: 149.05 GiB block size: physical: 512 B 
           logical: 512 B speed: 3.0 Gb/s serial: <filter> rev: 001C 
Partition: ID-1: / raw size: 140.6 GiB size: 137.39 GiB (97.72%) used: 20.79 GiB (15.1%) fs: ext4 dev: /dev/dm-0 
           maj-min: 254:0 mapped: luks-8e0a0035-265a-4f38-940d-a7eb0cb3614c 
Swap:      Kernel: swappiness: 10 (default 60) cache pressure: 75 (default 100) 
           ID-1: swap-1 type: partition size: 8.45 GiB used: 0 KiB (0.0%) priority: -2 dev: /dev/dm-1 maj-min: 254:1 
           mapped: luks-fcf6c9a1-42d5-4c64-b363-42f5c1e43ce3 
Sensors:   System Temperatures: cpu: 42.0 C mobo: N/A sodimm: SODIMM C gpu: nouveau temp: 52.0 C 
           Fan Speeds (RPM): cpu: 2798 
Info:      Processes: 174 Uptime: 7m wakeups: 1 Memory: 3.83 GiB used: 1.47 GiB (38.3%) Init: systemd v: 247 Compilers: 
           gcc: 10.2.0 Packages: pacman: 1269 lib: 395 Shell: Bash v: 5.1.0 running in: gnome-terminal inxi: 3.2.02

Thanks!

There is nothing wrong with having an old machine - if it works and fulfill it’s purpose then :+1: - I was just throwing ideas - and as I said - I not sure those particular settings makes a difference.

I did a search on your system and found a wiki entry on Centos.org.

It appears to reference a system comparable to yours.

I also found an article on arch wiki on performance improvements mentioning watchdog module.

I don’t know how much of this is new to you - any case apologies if you already have read them.

The laptop has worked great for my purposes up until the most recent update. I’d like to continue using it rather than adding disposing of it.

Thank you for doing the research. I did see the ArchWiki link prior and added the nowatchdog to GRUB_CMDLINE_LINUX_DEFAULT= in /etc/default/grub as described here, but that didn’t make a difference. The wiki says “to disable watchdog timers (both software and hardware), append nowatchdog to your boot parameters.” Besides the aforementioned change and running the maxperfwiz script, I am not sure how to add nowatchdog to boot parameters?

The centos link is interesting. While I have used centos some in the past, I prefer manjaro and am hoping to keep it. Are you suggesting to switch to centos? Any additional suggestions to correct the reboot/shutdown failure?

Thanks.

No - I am not suggesting you should switch. But the centos link mentioned a setting which made the sleep function possible without the Nvidia driver.

You mentioned Nvidia as a possible cause of the issue.

The arch wiki link mentions that

Note: Some users reported the nowatchdog parameter does not work as expected but they have successfully disabled the watchdog (at least the hardware one) by blacklisting the above-mentioned module.

Which leads to the suggestion of blacklisting the watchdog module as described.

The centos wiki references adding the password to the BIOS for suspend to work. Would this affect shutdown/reboot as well or just suspend? I am using the opensource drivers for nvidia and have assumed that nvidia is the problem since historically that has often been the cause of many issues.

Do you know how to “blacklist” iTCO_wdt? I believe I do so within a .conf file in /etc/modprobe.d. Do I need to modify an existing file or should I create one specifically for this? Such as /etc/modprobe.d/myfile.conf. Within the conf file, I believe all it takes is just to add blacklist iTCO_wdt?

Thanks

Curious if there are any further suggestions from the Manjaro community? I have run out of ideas and hoping that it is not time to scrap this laptop. Thanks.