Optimus-manager coredumps after upgrade

Hi,

I updated my system today. Last time I updated was about 1 month ago. Nvidia drivers and optimus-manager were working ok before the update.

After the update, I can no longer launch optimus-manager. The error I get is

 john@ja-clevo ~]$ optimus-manager
> ERROR: the latest GPU setup attempt failed at Xorg pre-start hook.
> Log at /var/log/optimus-manager/switch/switch-20211203T172638.log
> Cannot execute command because of previous errors
john@ja-clevo ~]$ optimus-manager-qt 
Unable to read current mode from Optimus Manager state file
Aborted (core dumped)

When I view the optimus manager log, I see:

optimus-manager log
8] INFO: # Xorg pre-start hook
[8] INFO: Previous state was: {'type': 'pending_pre_xorg_start', 'requested_mode': 'hybrid', 'current_mode': None}
[8] INFO: Requested mode is: hybrid
[442] INFO: Available modules: ['nouveau', 'nvidia', 'nvidia_drm', 'nvidia_modeset', 'nvidia_uvm']
[442] INFO: Unloading modules ['nouveau'] (if loaded)
[445] INFO: switching=none, nothing to do
[485] INFO: Loading module nvidia
[662] ERROR: Xorg pre-start setup error
Traceback (most recent call last):
  File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 245, in _load_module
    subprocess.check_call(
  File "/usr/lib/python3.9/subprocess.py", line 373, in check_call
    raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'modprobe nvidia NVreg_UsePageAttributeTable=1' returned non-zero exit status 1.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/lib/python3.9/site-packages/optimus_manager/hooks/pre_xorg_start.py", line 51, in main
    setup_kernel_state(config, prev_state, requested_mode)
  File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 22, in setup_kernel_state
    _nvidia_up(config, hybrid=(requested_mode == "hybrid"))
  File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 95, in _nvidia_up
    _load_nvidia_modules(config, available_modules)
  File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 164, in _load_nvidia_modules
    _load_module(available_modules, "nvidia", options=nvidia_options)
  File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 249, in _load_module
    raise KernelSetupError(f"Error running modprobe for {module}: {e.stderr}") from e
optimus_manager.kernel.KernelSetupError: Error running modprobe for nvidia: None
[663] INFO: Removing /etc/X11/xorg.conf.d/10-optimus-manager.conf (if present)
[663] INFO: Writing state {'type': 'pre_xorg_start_failed', 'switch_id': '20211203T172638', 'requested_mode': 'hybrid'}

I tried stopping and starting the optimus manager service but it still gives me an error exit code:

optimus-manager.service
● optimus-manager.service - Optimus Manager Commands Daemon
     Loaded: loaded (/usr/lib/systemd/system/optimus-manager.service; enabled; vendor preset: disabled)
     Active: active (running) since Fri 2021-12-03 17:50:42 EST; 4s ago
    Process: 4802 ExecStartPre=/usr/bin/python3 -u -m optimus_manager.hooks.pre_daemon_start (code=exited, status=0/SUCCESS)
    Process: 4803 ExecStartPre=/usr/bin/python3 -u -m optimus_manager.hooks.pre_xorg_start (code=exited, status=1/FAILURE)
   Main PID: 4818 (python3)
      Tasks: 1 (limit: 38044)
     Memory: 6.3M
        CPU: 728ms
     CGroup: /system.slice/optimus-manager.service
             └─4818 /usr/bin/python3 -u -m optimus_manager.daemon

Dec 03 17:50:42 ja-clevo python3[4803]:     _load_module(available_modules, "nvidia", options=nvidia_options)
Dec 03 17:50:42 ja-clevo python3[4803]:   File "/usr/lib/python3.9/site-packages/optimus_manager/kernel.py", line 249, in _load_module
Dec 03 17:50:42 ja-clevo python3[4803]:     raise KernelSetupError(f"Error running modprobe for {module}: {e.stderr}") from e
Dec 03 17:50:42 ja-clevo python3[4803]: optimus_manager.kernel.KernelSetupError: Error running modprobe for nvidia: None
Dec 03 17:50:42 ja-clevo python3[4803]: [617] INFO: Removing /etc/X11/xorg.conf.d/10-optimus-manager.conf (if present)
Dec 03 17:50:42 ja-clevo python3[4803]: [617] INFO: Writing state {'type': 'pre_xorg_start_failed', 'switch_id': '20211203T175041', 'requested_mode': 'hybrid'}
Dec 03 17:50:42 ja-clevo systemd[1]: Started Optimus Manager Commands Daemon.
Dec 03 17:50:42 ja-clevo python3[4818]: [1] INFO: # Commands daemon
Dec 03 17:50:42 ja-clevo python3[4818]: [1] INFO: Opening UNIX socket
Dec 03 17:50:42 ja-clevo python3[4818]: [1] INFO: Awaiting commands
inxi -Fza
System:    Kernel: 5.14.10-1-MANJARO x86_64 bits: 64 compiler: gcc v: 11.1.0
           parameters: initrd=\amd-ucode.img initrd=\Manjaro\amd-ucode.img initrd=\intel-ucode.img
           initrd=\initramfs-5.14-x86_64.img root=UUID=c4cf79bc-b324-4058-83ba-bead81dc6c08 rw
           Desktop: KDE Plasma 5.23.3 tk: Qt 5.15.2 wm: kwin_x11 vt: 1 dm: SDDM Distro: Manjaro Linux base: Arch Linux
Graphics:  Device-1: Intel CometLake-H GT2 [UHD Graphics] vendor: CLEVO/KAPOK driver: i915 v: kernel bus-ID: 00:02.0
           chip-ID: 8086:9bc4 class-ID: 0300
           Device-2: NVIDIA TU106M [GeForce RTX 2060 Mobile] vendor: CLEVO/KAPOK driver: N/A
           alternate: nouveau, nvidia_drm, nvidia bus-ID: 01:00.0 chip-ID: 10de:1f15 class-ID: 0300
           Device-3: Acer BisonCam NB Pro type: USB driver: uvcvideo bus-ID: 1-8:2 chip-ID: 5986:9102 class-ID: 0e02
           Display: x11 server: X.Org 1.21.1.1 compositor: kwin_x11 driver: loaded: modesetting alternate: fbdev,vesa
           display-ID: :0 screens: 1
           Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.0x11.2") s-diag: 582mm (22.9")
           Monitor-1: eDP-1 res: 1920x1080 hz: 144 dpi: 142 size: 344x194mm (13.5x7.6") diag: 395mm (15.5")
           OpenGL: renderer: Mesa Intel UHD Graphics (CML GT2) v: 4.6 Mesa 21.2.5 direct render: Yes

I’ve been through the troubleshooting guide on https://github.com/Askannz/optimus-manager/blob/master/README.md#important–manjaro-kde-users but my system check out.

I’d appreciate any tips or advise to help resolve.

Thanks,

So it looks like only my nouveau module is loaded:

[john@ja-clevo ~]$ sudo modprobe nouveau
john@ja-clevo ~]$ sudo modprobe nvidia -vvv
modprobe: INFO: custom logging function 0x55d554168310 registered
insmod /lib/modules/5.14.10-1-MANJARO/extramodules/nvidia.ko.gz "NVreg_DynamicPowerManagement=0x02" 
modprobe: INFO: Failed to insert module '/lib/modules/5.14.10-1-MANJARO/extramodules/nvidia.ko.gz': Exec format error
modprobe: ERROR: could not insert 'nvidia': Exec format error
modprobe: INFO: context 0x55d554a4d4d0 released

I was able to resolve this!

I noticed that my /efi partition was 100% full. I reasoned that perhaps the update did not complete successfully because of a lack of filesystem space. I had initramfs-5.10 and initramfs-5.14 normal and fallback .img files in /efi and /efi/Manjaro; the initramfs files should only be in /efi.

So I dropped into a chroot session.
Deleted all initramfs*.img files from /efi/Manjaro.
ran mkinitcpio -P
exit
I then rebooted

All now ok.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.