System refuses to reboot or shutdown sometimes

Uhm… I’m experiencing some weird behavior after the latest update. My system refuses to reboot or shutdown sometimes. Using shutdown or sysemctl poweroff or reboot (both with and without -force) just results in a timeout. I can’t pin down the reason. It just happens sometimes. Using TTY doesn’t help either, I just get a blinkling cursor and no login.

System:
  Host: jonas Kernel: 5.13.4-1-MANJARO x86_64 bits: 64 Desktop: GNOME 40.3 
  Distro: Manjaro Linux 
Machine:
  Type: Desktop Mobo: Gigabyte model: B550 AORUS PRO v: x.x 
  serial: <superuser required> BIOS: American Megatrends v: F10 
  date: 09/18/2020 
CPU:
  Info: 8-Core model: AMD Ryzen 7 3800XT bits: 64 type: MT MCP cache: 
  L2: 4 MiB 
  Speed: 2179 MHz min/max: 2200/3900 MHz Core speeds (MHz): 1: 2179 2: 2080 
  3: 2100 4: 2101 5: 2061 6: 2176 7: 2053 8: 3593 9: 2100 10: 2067 11: 2053 
  12: 2053 13: 2053 14: 3593 15: 2867 16: 2212 
Graphics:
  Device-1: NVIDIA GA104 [GeForce RTX 3060 Ti] driver: nvidia v: 470.57.02 
  Device-2: Logitech HD Webcam C510 type: USB driver: snd-usb-audio,uvcvideo 
  Display: x11 server: X.org 1.20.11 driver: loaded: nvidia 
  resolution: <missing: xdpyinfo> 
  OpenGL: renderer: NVIDIA GeForce RTX 3060 Ti/PCIe/SSE2 
  v: 4.6.0 NVIDIA 470.57.02 
Audio:
  Device-1: NVIDIA GA104 High Definition Audio driver: snd_hda_intel 
  Device-2: AMD Starship/Matisse HD Audio driver: snd_hda_intel 
  Device-3: Logitech HD Webcam C510 type: USB driver: snd-usb-audio,uvcvideo 
  Sound Server-1: ALSA v: k5.13.4-1-MANJARO running: yes 
  Sound Server-2: PulseAudio v: 14.2 running: yes 
  Sound Server-3: PipeWire v: 0.3.32 running: yes 
Network:
  Device-1: Realtek RTL8125 2.5GbE driver: r8169 
  IF: eno1 state: up speed: 1000 Mbps duplex: full mac: 18:c0:4d:3d:86:9a 
  IF-ID-1: br-51f354fd0888 state: up speed: 10000 Mbps duplex: unknown 
  mac: 02:42:dc:6a:3f:a1 
  IF-ID-2: br-e621a92c68ee state: up speed: 10000 Mbps duplex: unknown 
  mac: 02:42:30:61:9d:94 
  IF-ID-3: docker0 state: down mac: 02:42:6c:f3:f5:04 
  IF-ID-4: veth096aef6 state: up speed: 10000 Mbps duplex: full 
  mac: ce:23:de:37:c2:0d 
  IF-ID-5: veth30bb934 state: up speed: 10000 Mbps duplex: full 
  mac: 8a:69:73:a9:41:0a 
  IF-ID-6: veth900d55a state: up speed: 10000 Mbps duplex: full 
  mac: 62:6d:c0:4b:56:8a 
  IF-ID-7: vethb2c8f15 state: up speed: 10000 Mbps duplex: full 
  mac: 1a:29:c6:49:27:f3 
  IF-ID-8: vethb3088cf state: up speed: 10000 Mbps duplex: full 
  mac: be:ef:e8:e8:2d:72 
  IF-ID-9: vethf1119d5 state: up speed: 10000 Mbps duplex: full 
  mac: 46:74:09:dd:5a:53 
  IF-ID-10: vethfefa666 state: up speed: 10000 Mbps duplex: full 
  mac: f6:14:ec:1c:b4:7e 
Drives:
  Local Storage: total: 5.46 TiB used: 1.16 TiB (21.2%) 
  ID-1: /dev/nvme0n1 vendor: Crucial model: CT1000P5SSD8 size: 931.51 GiB 
  ID-2: /dev/sda vendor: Western Digital model: WD40EZRZ-00GXCB0 
  size: 3.64 TiB 
  ID-3: /dev/sdb vendor: Samsung model: SSD 860 QVO 1TB size: 931.51 GiB 
Partition:
  ID-1: / size: 915.81 GiB used: 374.67 GiB (40.9%) fs: ext4 
  dev: /dev/nvme0n1p1 
Swap:
  ID-1: swap-1 type: file size: 512 MiB used: 0 KiB (0.0%) file: /swapfile 
Sensors:
  System Temperatures: cpu: 53.5 C mobo: 36.0 C gpu: nvidia temp: 53 C 
  Fan Speeds (RPM): fan-1: 0 fan-2: 0 fan-3: 543 gpu: nvidia fan: 0% 
Info:
  Processes: 407 Uptime: 21m Memory: 31.3 GiB used: 4.85 GiB (15.5%) 
  Shell: Zsh inxi: 3.3.06

This is probably Nvidia related (as weird as it can sound). To confirm it remove the Nvidia drivers (and if you do I don’t know if you will actually reboot properly or on a black screen, maybe nouveau driver will not work for you), reboot, then try to do some tests with reboot/shutdown and see if you have the issue without Nvidia drivers.

I had the issue long time ago and currently it doesn’t hang anymore. The issue is random.

It will not solve the issue but you can confirm this is Nvidia related this way.

Before removing the Nvidia driver I would test other kernels like 5.12 or 5.10 first, less risky that you end up with an unbootable system.

1 Like

Also want to point out that you should update your bios: B550 AORUS PRO (rev. 1.0) Gallery | Motherboard - GIGABYTE Global

Ok… I did a clean reinstall. I upgraded my BIOS and, as it turns out, my system was using legacy boot. As a result I wasn’t able to use systemctl reboot --firmware-setup and I had Fast-Boot enabled… Flashing my BIOS using Flashback did work though.

I’m back on Kernel 5.10 and EFI is now enabled now. Moving to Nouveau is not an option for me now as I need CUDA for work stuff.

I’ll let you know if I’m encountering that bug again. Thank you all for your kind words and support. Please let me know, if there is any kind of diagnostic I can run, in case things go bad again. My shell did seem to be responsive.