NVIDIA Card Fails to Suspend

Hi all,

I have a problem suspending my laptop from time to time.
Some times when I suspend my laptop it awakes after a couple of seconds with a locked screen. Every time I can find the following excerpt in the suspend journal:

Aug 16 22:06:28 machwx9 systemd-logind[475]: The system will suspend now!
Aug 16 22:06:28 machwx9 ModemManager[531]: <info>  [sleep-monitor-systemd] system is about to suspend
Aug 16 22:06:29 machwx9 systemd[1]: Starting System Suspend...
Aug 16 22:06:29 machwx9 systemd-sleep[4784]: Entering sleep state 'suspend'...
Aug 16 22:06:29 machwx9 kernel: PM: suspend entry (deep)
Aug 16 22:06:30 machwx9 kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Aug 16 22:06:30 machwx9 kernel: nvidia 0000:01:00.0: PM: pci_pm_suspend(): nv_pmops_suspend+0x0/0x30 [nvidia] returns -5
Aug 16 22:06:30 machwx9 kernel: nvidia 0000:01:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -5
Aug 16 22:06:30 machwx9 kernel: nvidia 0000:01:00.0: PM: failed to suspend async: error -5
Aug 16 22:06:30 machwx9 kernel: PM: Some devices failed to suspend, or early wake event detected
Aug 16 22:06:30 machwx9 kernel: PM: suspend exit
Aug 16 22:06:30 machwx9 kernel: PM: suspend entry (s2idle)
Aug 16 22:06:31 machwx9 kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Aug 16 22:06:31 machwx9 kernel: nvidia 0000:01:00.0: PM: pci_pm_suspend(): nv_pmops_suspend+0x0/0x30 [nvidia] returns -5
Aug 16 22:06:31 machwx9 kernel: nvidia 0000:01:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -5
Aug 16 22:06:31 machwx9 kernel: nvidia 0000:01:00.0: PM: failed to suspend async: error -5
Aug 16 22:06:31 machwx9 kernel: PM: Some devices failed to suspend, or early wake event detected
Aug 16 22:06:31 machwx9 kernel: PM: suspend exit
Aug 16 22:06:31 machwx9 systemd[1]: systemd-suspend.service: Main process exited, code=exited, status=1/FAILURE
Aug 16 22:06:31 machwx9 systemd[1]: systemd-suspend.service: Failed with result 'exit-code'.
Aug 16 22:06:31 machwx9 systemd[1]: Failed to start System Suspend.
Aug 16 22:06:31 machwx9 systemd[1]: Dependency failed for Suspend.
Aug 16 22:06:31 machwx9 systemd[1]: suspend.target: Job suspend.target/start failed with result 'dependency'.

I can see that the nvidia 0000:01:00.0: PM: failed to suspend async: error -5 seems to be the root-cause of the problem since it fails to suspend but I have no idea how to fix it.

Any ideas are appreciated!

Thanks and cheers
Ledex

Have you tried all this?

https://wiki.archlinux.org/title/Power_management/Suspend_and_hibernate#Suspend/hibernate_does_not_work,_or_does_not_work_consistently

And

https://wiki.archlinux.org/title/NVIDIA/Tips_and_tricks#Preserve_video_memory_after_suspend

Hi bedna,

thanks for the links.

  • First I tried disabling kernal-module nvidiafb, didn’t work.
  • Then I checked if there are any i2c devices that trigger the wakeup ($ ls /sys/bus/i2c/devices/*/power/wakeup), there were none.
  • After that I followed your second link to also preserve the video-memory when suspending. Doing that lead to the journal-entry below. However, since the issue still persisted I disabled that feature again.
Aug 17 13:00:06 machwx9 kernel: NVRM: GPU 0000:01:00.0: PreserveVideoMemoryAllocations module parameter is set. System Power Management attempted without driver procfs suspend interface. Please refer to the 'Configuring Power Management Support' section in the driver README.
  • Last thing I tried was to change the suspend-method from deep to s2idle, but sadly agiain without any change.

I’ll investigate a bit further this afternoon.

Cheers
Karsten

Yeah, you are further in the research than I am. I gave up on trying to get it to work. Sorry…

Well I feel you… I removed the video-hybrid-intel-nvidia-prime-driver and with the video-linux-driver the suspend works.

Still that is a workaround rather than a solution…

provide full system info:
inxi -zav7
and the log output already shows that you were using s2idle:

so are these logs after you changed from deep to s2idle?

Hi sorry for not sharing it originally:

System:
  Kernel: 6.1.44-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 13.2.1
    clocksource: tsc available: acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.1-x86_64
    root=UUID=914d8524-8547-4bfc-b1f3-bbaf0b28e77f rw quiet splash apparmor=1
    security=apparmor udev.log_priority=3
  Desktop: GNOME v: 44.3 tk: GTK v: 3.24.38 wm: gnome-shell dm: GDM v: 44.1
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Laptop System: HUAWEI product: MACH-WX9 v: M1A
    serial: <superuser required>
  Mobo: HUAWEI model: MACH-WX9 v: M1A serial: <superuser required>
    UEFI: HUAWEI v: 1.37 date: 05/12/2021
Battery:
  ID-1: BAT0 charge: 15.0 Wh (29.7%) condition: 50.5/56.3 Wh (89.7%)
    power: 3.9 W volts: 7.6 min: 7.6 model: DYNAPACK HB4593R1ECW-22T0
    type: Li-ion serial: <filter> status: charging cycles: 217
Memory:
  System RAM: total: 16 GiB note: est. available: 15.38 GiB
    used: 3.77 GiB (24.5%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges
    required.
CPU:
  Info: model: Intel Core i7-8550U bits: 64 type: MT MCP arch: Coffee Lake
    gen: core 8 level: v3 note: check built: 2017 process: Intel 14nm family: 6
    model-id: 0x8E (142) stepping: 0xA (10) microcode: 0xF4
  Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
    L1: 256 KiB desc: d-4x32 KiB; i-4x32 KiB L2: 1024 KiB desc: 4x256 KiB
    L3: 8 MiB desc: 1x8 MiB
  Speed (MHz): avg: 1250 high: 2000 min/max: 400/4000 scaling:
    driver: intel_pstate governor: powersave cores: 1: 800 2: 2000 3: 2000
    4: 2000 5: 800 6: 800 7: 800 8: 800 bogomips: 32012
  Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat
    arch_capabilities arch_perfmon art avx avx2 bmi1 bmi2 bts clflush
    clflushopt cmov constant_tsc cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64
    dtherm dts epb ept ept_ad erms est f16c flexpriority flush_l1d fma fpu
    fsgsbase fxsr ht hwp hwp_act_window hwp_epp hwp_notify ibpb ibrs ida
    intel_pt invpcid invpcid_single lahf_lm lm mca mce md_clear mmx monitor
    movbe mpx msr mtrr nonstop_tsc nopl nx pae pat pbe pcid pclmulqdq pdcm
    pdpe1gb pebs pge pln pni popcnt pse pse36 pti pts rdrand rdseed rdtscp
    rep_good sdbg sep sgx smap smep ss ssbd sse sse2 sse4_1 sse4_2 ssse3
    stibp syscall tm tm2 tpr_shadow tsc tsc_adjust tsc_deadline_timer vme vmx
    vnmi vpid x2apic xgetbv1 xsave xsavec xsaveopt xsaves xtopology xtpr
  Vulnerabilities:
  Type: gather_data_sampling mitigation: Microcode
  Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf mitigation: PTE Inversion; VMX: conditional cache flushes, SMT
    vulnerable
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: mmio_stale_data mitigation: Clear CPU buffers; SMT vulnerable
  Type: retbleed mitigation: IBRS
  Type: spec_rstack_overflow status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: IBRS, IBPB: conditional, STIBP: conditional,
    RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Intel UHD Graphics 620 vendor: Huawei driver: i915 v: kernel
    arch: Gen-9.5 process: Intel 14nm built: 2016-20 ports: active: eDP-1
    empty: DP-1,DP-2,HDMI-A-1 bus-ID: 00:02.0 chip-ID: 8086:5917
    class-ID: 0300
  Device-2: NVIDIA GP108M [GeForce MX150] vendor: Huawei driver: N/A
    alternate: nouveau non-free: 535.xx+ status: current (as of 2023-08)
    arch: Maxwell code: GMxxx process: TSMC 28nm built: 2014-19 pcie: gen: 3
    speed: 8 GT/s lanes: 4 bus-ID: 01:00.0 chip-ID: 10de:1d12 class-ID: 0302
  Device-3: Cheng Uei Precision Industry (Foxlink) HD Camera
    driver: uvcvideo type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0
    bus-ID: 1-7:3 chip-ID: 05c8:03c0 class-ID: 0e02
  Display: wayland server: X.org v: 1.21.1.8 with: Xwayland v: 23.1.2
    compositor: gnome-shell driver: X: loaded: modesetting,nvidia
    unloaded: nouveau alternate: fbdev,nv,vesa dri: iris gpu: i915
    display-ID: 0
  Monitor-1: eDP-1 model-id: JDI 0x422a built: 2017 res: 3000x2000 dpi: 260
    gamma: 1.2 size: 293x196mm (11.54x7.72") diag: 353mm (13.9") ratio: 3:2
    modes: 3000x2000
  API: OpenGL v: 4.6 Mesa 23.1.5 renderer: Mesa Intel UHD Graphics 620 (KBL
    GT2) direct-render: Yes
Audio:
  Device-1: Intel Sunrise Point-LP HD Audio vendor: Huawei
    driver: snd_hda_intel v: kernel alternate: snd_soc_skl, snd_soc_avs,
    snd_sof_pci_intel_skl bus-ID: 00:1f.3 chip-ID: 8086:9d71 class-ID: 0403
  API: ALSA v: k6.1.44-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 0.3.77 status: off tools: pw-cli
  Server-3: PulseAudio v: 16.1 status: active with: pulseaudio-alsa
    type: plugin tools: pacat,pactl
Network:
  Device-1: Intel Wireless 8265 / 8275 vendor: Huawei driver: iwlwifi
    v: kernel pcie: gen: 1 speed: 2.5 GT/s lanes: 1 bus-ID: 3c:00.0
    chip-ID: 8086:24fd class-ID: 0280
  IF: wlp60s0 state: up mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  WAN IP: <filter>
Bluetooth:
  Device-1: Intel Bluetooth wireless interface driver: btusb v: 0.8 type: USB
    rev: 2.0 speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-5:2 chip-ID: 8087:0a2b
    class-ID: e001
  Report: rfkill ID: hci0 rfk-id: 4 state: up address: see --recommends
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 476.94 GiB used: 49.88 GiB (10.5%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: LITE-ON model: CA3-8D512
    size: 476.94 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: C49640A temp: 51.9 C
    scheme: GPT
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 476.64 GiB size: 468.09 GiB (98.21%)
    used: 49.88 GiB (10.7%) fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
    label: N/A uuid: 914d8524-8547-4bfc-b1f3-bbaf0b28e77f
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 288 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1 label: N/A
    uuid: 0720-E3BC
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: file size: 26 GiB used: 0 KiB (0.0%) priority: -2
    file: /swapfile
Unmounted:
  Message: No unmounted partitions found.
USB:
  Hub-1: 1-0:1 info: hi-speed hub with single TT ports: 12 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 1-5:2 info: Intel Bluetooth wireless interface type: bluetooth
    driver: btusb interfaces: 2 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s) lanes: 1
    mode: 1.1 power: 100mA chip-ID: 8087:0a2b class-ID: e001
  Device-2: 1-7:3 info: Cheng Uei Precision Industry (Foxlink) HD Camera
    type: video driver: uvcvideo interfaces: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 500mA
    chip-ID: 05c8:03c0 class-ID: 0e02
  Hub-2: 2-0:1 info: super-speed hub ports: 6 rev: 3.0
    speed: 5 Gb/s (596.0 MiB/s) lanes: 1 mode: 3.2 gen-1x1 chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 58.0 C pch: 55.0 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Processes: 255 Uptime: 1h 26m wakeups: 26 Init: systemd v: 253
  default: graphical tool: systemctl Compilers: gcc: 13.2.1 clang: 15.0.7
  Packages: pm: pacman pkgs: 1340 libs: 429 tools: gnome-software,pamac
  pm: flatpak pkgs: 0 Shell: Zsh v: 5.9 running-in: gnome-terminal
  inxi: 3.3.29

Actually the logs were taken when the suspend-mode was still set to deep, a few lines above you can also see the same line with (deep).

I’m not really sure why the are both logs for deep and s2idle. Seems like the system is trying to suspend with deep-mode first and then with s2idle…?

Device-2: NVIDIA GP108M [GeForce MX150] vendor: Huawei driver: N/A

there is no driver installed and therefore cannot work properly.

ok, so you are also running on wayland, we will test with xorg later…
but as olli points out you dont have nvidia drivers loaded, that means that your gpu is not being managed, so it can overheat and suffer hardware damage…
and write here exactly what you did from the links above, so that it is easier to assist you…

it looks like it, so we will also try with deep kernel parameter, but first explain all the ‘tinkering’ you did…

Sorry guys I uninstalled the NVIDIA drivers to test if the suspend works then (which it did). I’ve now reinstalled the NVIDIA-driver:

Graphics:
  Device-1: Intel UHD Graphics 620 vendor: Huawei driver: i915 v: kernel
    arch: Gen-9.5 process: Intel 14nm built: 2016-20 ports: active: eDP-1
    empty: DP-1,DP-2,HDMI-A-1 bus-ID: 00:02.0 chip-ID: 8086:5917
    class-ID: 0300
  Device-2: NVIDIA GP108M [GeForce MX150] vendor: Huawei driver: nvidia
    v: 535.98 alternate: nouveau,nvidia_drm non-free: 535.xx+
    status: current (as of 2023-08) arch: Maxwell code: GMxxx
    process: TSMC 28nm built: 2014-19 pcie: gen: 1 speed: 2.5 GT/s lanes: 4
    link-max: gen: 3 speed: 8 GT/s bus-ID: 01:00.0 chip-ID: 10de:1d12
    class-ID: 0302
  Device-3: Cheng Uei Precision Industry (Foxlink) HD Camera
    driver: uvcvideo type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0
    bus-ID: 1-7:3 chip-ID: 05c8:03c0 class-ID: 0e02
  Display: wayland server: X.org v: 1.21.1.8 with: Xwayland v: 23.1.2
    compositor: gnome-shell driver: X: loaded: modesetting,nvidia
    unloaded: nouveau alternate: fbdev,nv,vesa dri: iris gpu: i915
    display-ID: 0
  Monitor-1: eDP-1 model-id: JDI 0x422a built: 2017 res: 3000x2000 dpi: 260
    gamma: 1.2 size: 293x196mm (11.54x7.72") diag: 353mm (13.9") ratio: 3:2
    modes: 3000x2000
  API: OpenGL v: 4.6 Mesa 23.1.5 renderer: Mesa Intel UHD Graphics 620 (KBL
    GT2) direct-render: Yes

Somehow the suspend works now after a re-install of the NVIDIA-driver…

Aug 18 00:48:00 machwx9 systemd-logind[507]: The system will suspend now!
Aug 18 00:48:00 machwx9 ModemManager[565]: <info>  [sleep-monitor-systemd] system is about to suspend
Aug 18 00:48:02 machwx9 systemd[1]: Starting NVIDIA system suspend actions...
Aug 18 00:48:02 machwx9 suspend[4424]: nvidia-suspend.service
Aug 18 00:48:02 machwx9 logger[4424]: <13>Aug 18 00:48:02 suspend: nvidia-suspend.service
Aug 18 00:48:02 machwx9 systemd[1]: nvidia-suspend.service: Deactivated successfully.
Aug 18 00:48:02 machwx9 systemd[1]: Finished NVIDIA system suspend actions.
Aug 18 00:48:02 machwx9 systemd[1]: Starting System Suspend...
Aug 18 00:48:02 machwx9 kernel: PM: suspend entry (deep)
Aug 18 00:48:02 machwx9 systemd-sleep[4430]: Entering sleep state 'suspend'...
Aug 18 00:48:07 machwx9 kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Aug 18 00:48:07 machwx9 kernel: PM: suspend exit
Aug 18 00:48:07 machwx9 systemd[1]: systemd-suspend.service: Deactivated successfully.
Aug 18 00:48:07 machwx9 systemd[1]: Finished System Suspend.
Aug 18 00:48:07 machwx9 systemd[1]: Reached target Suspend.
Aug 18 00:48:07 machwx9 systemd[1]: Stopped target Suspend.
Aug 18 00:48:07 machwx9 logger[4492]: <13>Aug 18 00:48:07 suspend: nvidia-resume.service

so it’s solved if the suspend was your only issue.
i recommend to install optimus-manager and proceed as described here:

Yes thank you very much for your help!

I generally dislike optimus-manager and optimus-switch … partially because they are buggy and partially because people often confuse their usefulness.
(ie - not realizing how prime-run works in the first place and that ‘hybrid’ is already in place)
And optimus-manager is extra special (read extra bad).
(note the multiple extra sections at github where you have to hack at gdm or sddm … ew)

If one actually needs such a thing … then I might suggest one that at least appears slightly less ludicrous.

envycontrol

So sadly the saga isn’t over yet…

I’ve installed optimus-manager now, replaced gdm with gdm-prime, and forced Xorg-sessions. Optimus-manager is currently set to hybrid (seems like the best battery-performance-tradeoff).
Know when I suspend the laptop, it awakes again shortly after, but the keyboard and track-pad are not working. Only the display turns on again. This forces me to hard-reset my system. In the journal you can see that all device are removed, probaply in preperation for the suspend (?).

Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "54"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event9  - SYNA1D31:00 06CB:CD48 Touchpad: device removed

However, the error from the begining is back again, but now one can also see the trace that seems to be thrown in the nvidia-sleep.sh. Maybe I should move the topic to https://forums.developer.nvidia.com?

Aug 18 13:52:27 machwx9 kernel: ------------[ cut here ]------------
Aug 18 13:52:27 machwx9 kernel: WARNING: CPU: 3 PID: 2485 at /build/linux61-nvidia/src/NVIDIA-Linux-x86_64-535.98-no-compat32/kernel/nvidia/nv.c:3993 nv_set_system_power_state+0x2f2/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Modules linked in: rfcomm nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) cmac algif_hash algif_skcipher af_alg snd_sof_pci_intel_skl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_bus snd_hda_codec_hdmi snd_soc_avs intel_tcc_cooling snd_soc_hda_codec x86_pkg_temp_thermal snd_ctl_led intel_powerclamp snd_hda_codec_realtek snd_soc_skl coretemp snd_hda_codec_generic snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_sst_ipc qrtr snd_soc_sst_dsp snd_soc_acpi_intel_match kvm snd_soc_acpi irqbypass crct10dif_pclmul snd_soc_core crc32_pclmul joydev polyval_clmulni snd_compress polyval_generic bnep gf128mul ac97_bus ghash_clmulni_intel sha512_ssse3 snd_pcm_dmaengine snd_hda_intel mousedev iwlmvm snd_intel_dspcfg aesni_intel snd_intel_sdw_acpi uvcvideo btusb i915 crypto_simd videobuf2_vmalloc mac80211 processor_thermal_device_pci_legacy videobuf2_memops btrtl
Aug 18 13:52:27 machwx9 kernel:  processor_thermal_device videobuf2_v4l2 cryptd hid_multitouch videobuf2_common spi_pxa2xx_platform iTCO_wdt btbcm snd_hda_codec 8250_dw libarc4 rapl dw_dmac btintel videodev processor_thermal_rfim huawei_wmi iwlwifi snd_hda_core processor_thermal_mbox intel_pmc_bxt ledtrig_audio btmtk intel_cstate snd_hwdep mei_pxp mei_hdcp intel_rapl_msr iTCO_vendor_support wmi_bmof sparse_keymap intel_wmi_thunderbolt mc intel_uncore bluetooth pcspkr drm_buddy snd_pcm cfg80211 ttm snd_timer processor_thermal_rapl ecdh_generic drm_display_helper snd intel_xhci_usb_role_switch intel_lpss_pci mei_me i2c_i801 cec acpi_als intel_rapl_common intel_lpss rfkill soundcore i2c_smbus roles mei i2c_hid_acpi idma64 intel_soc_dts_iosf intel_gtt industrialio_triggered_buffer intel_pch_thermal vfat video kfifo_buf i2c_hid fat int3403_thermal int3400_thermal industrialio int340x_thermal_zone wmi acpi_pad acpi_thermal_rel dptf_power mac_hid uinput crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables
Aug 18 13:52:27 machwx9 kernel:  ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd libps2 vivaldi_fmap nvme nvme_core crc32c_intel xhci_pci nvme_common xhci_pci_renesas i8042 serio
Aug 18 13:52:27 machwx9 kernel: CPU: 3 PID: 2485 Comm: nvidia-sleep.sh Tainted: P           OE      6.1.44-1-MANJARO #1 a5ae25468b7d8bc523fd3843f165bfe956aae550
Aug 18 13:52:27 machwx9 kernel: Hardware name: HUAWEI MACH-WX9/MACH-WX9, BIOS 1.37 05/12/2021
Aug 18 13:52:27 machwx9 kernel: RIP: 0010:nv_set_system_power_state+0x2f2/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Code: ad e8 05 00 00 48 85 ed 75 d3 e9 d9 fd ff ff 0f 0b e9 13 fe ff ff 48 8b 3d 83 a3 87 03 4c 89 f6 e8 e3 dd 37 d1 e9 12 ff ff ff <0f> 0b 4c 89 ef e8 d4 1d ca d1 4d 85 f6 74 0d e8 1a 73 a8 00 84 c0
Aug 18 13:52:27 machwx9 kernel: RSP: 0018:ffffae40c494bd70 EFLAGS: 00010206
Aug 18 13:52:27 machwx9 kernel: RAX: 000000000000000f RBX: 0000000000000001 RCX: ffffae40c494bcf0
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000000 RSI: 0000000000000246 RDI: ffffae40c494bca0
Aug 18 13:52:27 machwx9 kernel: RBP: 000000000000000f R08: 0000000000000000 R09: ffff94af121c5f60
Aug 18 13:52:27 machwx9 kernel: R10: ffffffffc4b4f250 R11: ffffffffc4b4f290 R12: ffff94af1907e000
Aug 18 13:52:27 machwx9 kernel: R13: ffff94af1907e5c0 R14: ffff94af121c3000 R15: ffff94af121c3000
Aug 18 13:52:27 machwx9 kernel: FS:  00007f5640f16000(0000) GS:ffff94b271ec0000(0000) knlGS:0000000000000000
Aug 18 13:52:27 machwx9 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 13:52:27 machwx9 kernel: CR2: 000055753be610a8 CR3: 0000000110b4c003 CR4: 00000000003706e0
Aug 18 13:52:27 machwx9 kernel: Call Trace:
Aug 18 13:52:27 machwx9 kernel:  <TASK>
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? __warn+0x7d/0xd0
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? report_bug+0xe6/0x150
Aug 18 13:52:27 machwx9 kernel:  ? handle_bug+0x3c/0x80
Aug 18 13:52:27 machwx9 kernel:  ? exc_invalid_op+0x17/0x70
Aug 18 13:52:27 machwx9 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  nv_procfs_write_suspend+0xe8/0x160 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  proc_reg_write+0x57/0xa0
Aug 18 13:52:27 machwx9 kernel:  vfs_write+0xc8/0x3f0
Aug 18 13:52:27 machwx9 kernel:  ksys_write+0x6f/0xf0
Aug 18 13:52:27 machwx9 kernel:  do_syscall_64+0x5d/0x90
Aug 18 13:52:27 machwx9 kernel:  ? do_user_addr_fault+0x237/0x580
Aug 18 13:52:27 machwx9 kernel:  ? exc_page_fault+0x7c/0x180
Aug 18 13:52:27 machwx9 kernel:  entry_SYSCALL_64_after_hwframe+0x69/0xd3
Aug 18 13:52:27 machwx9 kernel: RIP: 0033:0x7f5640d04724
Aug 18 13:52:27 machwx9 kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d 45 1c 14 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
Aug 18 13:52:27 machwx9 kernel: RSP: 002b:00007ffc8ec4ac58 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f5640d04724
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000008 RSI: 000055753be60ca0 RDI: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RBP: 000055753be60ca0 R08: 0000000000000410 R09: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 18 13:52:27 machwx9 kernel: R13: 00007f5640e3f5c0 R14: 00007f5640ddd0e0 R15: 0000000000000000
Aug 18 13:52:27 machwx9 kernel:  </TASK>
Aug 18 13:52:27 machwx9 kernel: ---[ end trace 0000000000000000 ]---
[...]
Aug 18 13:52:27 machwx9 systemd[1]: nvidia-suspend.service: Failed with result 'exit-code'.
Aug 18 13:52:27 machwx9 systemd[1]: Failed to start NVIDIA system suspend actions.

Anyway thanks again for your help!

Here is the complete journal:

Aug 18 13:51:52 machwx9 systemd-logind[441]: Power key pressed short.
[...]
Aug 18 13:51:57 machwx9 systemd[1]: Reached target Sleep.
Aug 18 13:51:57 machwx9 systemd[1]: Starting NVIDIA system suspend actions...
Aug 18 13:51:57 machwx9 suspend[2483]: nvidia-suspend.service
Aug 18 13:51:57 machwx9 logger[2483]: <13>Aug 18 13:51:57 suspend: nvidia-suspend.service
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "45"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event2  - Power Button: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "48"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event10 - Video Bus: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "49"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event11 - Video Bus: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "50"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event1  - Power Button: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "51"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event6  - HD Camera: HD Camera: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "52"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event5  - SYNA2393:00 06CB:19AC: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "53"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event7  - SYNA1D31:00 06CB:CD48 Mouse: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "54"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event9  - SYNA1D31:00 06CB:CD48 Touchpad: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "55"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event8  - Huawei WMI hotkeys: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (**) Option "fd" "56"
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) event3  - AT Translated Set 2 keyboard: device removed
Aug 18 13:51:57 machwx9 /usr/lib/gdm-x-session[1187]: (II) AIGLX: Suspending AIGLX clients for VT switch
Aug 18 13:52:00 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x00000898, 0x000008b4)
Aug 18 13:52:02 machwx9 systemd[1]: NetworkManager-dispatcher.service: Deactivated successfully.
Aug 18 13:52:07 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x00000898, 0x000008b4)
Aug 18 13:52:10 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x00000898, 0x000008f8)
Aug 18 13:52:13 machwx9 dbus-daemon[1182]: [session uid=1000 pid=1182] Failed to activate service 'org.gnome.Shell.Screencast': timed out (service_start_timeout=120000ms)
Aug 18 13:52:14 machwx9 dbus-daemon[1182]: [session uid=1000 pid=1182] Activating service name='org.gnome.DejaDup' requested by ':1.64' (uid=1000 pid=1693 comm="/usr/lib/deja-dup/deja-dup-monitor")
Aug 18 13:52:15 machwx9 dbus-daemon[1182]: [session uid=1000 pid=1182] Successfully activated service 'org.gnome.DejaDup'
Aug 18 13:52:17 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x00000898, 0x000008f8)
Aug 18 13:52:20 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x00000898, 0x00000900)
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x00000898, 0x00000900)
Aug 18 13:52:27 machwx9 kernel: ------------[ cut here ]------------
Aug 18 13:52:27 machwx9 kernel: WARNING: CPU: 3 PID: 2485 at /build/linux61-nvidia/src/NVIDIA-Linux-x86_64-535.98-no-compat32/kernel/nvidia/nv.c:3993 nv_set_system_power_state+0x2f2/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Modules linked in: rfcomm nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) cmac algif_hash algif_skcipher af_alg snd_sof_pci_intel_skl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_bus snd_hda_codec_hdmi snd_soc_avs intel_tcc_cooling snd_soc_hda_codec x86_pkg_temp_thermal snd_ctl_led intel_powerclamp snd_hda_codec_realtek snd_soc_skl coretemp snd_hda_codec_generic snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_sst_ipc qrtr snd_soc_sst_dsp snd_soc_acpi_intel_match kvm snd_soc_acpi irqbypass crct10dif_pclmul snd_soc_core crc32_pclmul joydev polyval_clmulni snd_compress polyval_generic bnep gf128mul ac97_bus ghash_clmulni_intel sha512_ssse3 snd_pcm_dmaengine snd_hda_intel mousedev iwlmvm snd_intel_dspcfg aesni_intel snd_intel_sdw_acpi uvcvideo btusb i915 crypto_simd videobuf2_vmalloc mac80211 processor_thermal_device_pci_legacy videobuf2_memops btrtl
Aug 18 13:52:27 machwx9 kernel:  processor_thermal_device videobuf2_v4l2 cryptd hid_multitouch videobuf2_common spi_pxa2xx_platform iTCO_wdt btbcm snd_hda_codec 8250_dw libarc4 rapl dw_dmac btintel videodev processor_thermal_rfim huawei_wmi iwlwifi snd_hda_core processor_thermal_mbox intel_pmc_bxt ledtrig_audio btmtk intel_cstate snd_hwdep mei_pxp mei_hdcp intel_rapl_msr iTCO_vendor_support wmi_bmof sparse_keymap intel_wmi_thunderbolt mc intel_uncore bluetooth pcspkr drm_buddy snd_pcm cfg80211 ttm snd_timer processor_thermal_rapl ecdh_generic drm_display_helper snd intel_xhci_usb_role_switch intel_lpss_pci mei_me i2c_i801 cec acpi_als intel_rapl_common intel_lpss rfkill soundcore i2c_smbus roles mei i2c_hid_acpi idma64 intel_soc_dts_iosf intel_gtt industrialio_triggered_buffer intel_pch_thermal vfat video kfifo_buf i2c_hid fat int3403_thermal int3400_thermal industrialio int340x_thermal_zone wmi acpi_pad acpi_thermal_rel dptf_power mac_hid uinput crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables
Aug 18 13:52:27 machwx9 kernel:  ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd libps2 vivaldi_fmap nvme nvme_core crc32c_intel xhci_pci nvme_common xhci_pci_renesas i8042 serio
Aug 18 13:52:27 machwx9 kernel: CPU: 3 PID: 2485 Comm: nvidia-sleep.sh Tainted: P           OE      6.1.44-1-MANJARO #1 a5ae25468b7d8bc523fd3843f165bfe956aae550
Aug 18 13:52:27 machwx9 kernel: Hardware name: HUAWEI MACH-WX9/MACH-WX9, BIOS 1.37 05/12/2021
Aug 18 13:52:27 machwx9 kernel: RIP: 0010:nv_set_system_power_state+0x2f2/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Code: ad e8 05 00 00 48 85 ed 75 d3 e9 d9 fd ff ff 0f 0b e9 13 fe ff ff 48 8b 3d 83 a3 87 03 4c 89 f6 e8 e3 dd 37 d1 e9 12 ff ff ff <0f> 0b 4c 89 ef e8 d4 1d ca d1 4d 85 f6 74 0d e8 1a 73 a8 00 84 c0
Aug 18 13:52:27 machwx9 kernel: RSP: 0018:ffffae40c494bd70 EFLAGS: 00010206
Aug 18 13:52:27 machwx9 kernel: RAX: 000000000000000f RBX: 0000000000000001 RCX: ffffae40c494bcf0
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000000 RSI: 0000000000000246 RDI: ffffae40c494bca0
Aug 18 13:52:27 machwx9 kernel: RBP: 000000000000000f R08: 0000000000000000 R09: ffff94af121c5f60
Aug 18 13:52:27 machwx9 kernel: R10: ffffffffc4b4f250 R11: ffffffffc4b4f290 R12: ffff94af1907e000
Aug 18 13:52:27 machwx9 kernel: R13: ffff94af1907e5c0 R14: ffff94af121c3000 R15: ffff94af121c3000
Aug 18 13:52:27 machwx9 kernel: FS:  00007f5640f16000(0000) GS:ffff94b271ec0000(0000) knlGS:0000000000000000
Aug 18 13:52:27 machwx9 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 13:52:27 machwx9 kernel: CR2: 000055753be610a8 CR3: 0000000110b4c003 CR4: 00000000003706e0
Aug 18 13:52:27 machwx9 kernel: Call Trace:
Aug 18 13:52:27 machwx9 kernel:  <TASK>
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? __warn+0x7d/0xd0
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? report_bug+0xe6/0x150
Aug 18 13:52:27 machwx9 kernel:  ? handle_bug+0x3c/0x80
Aug 18 13:52:27 machwx9 kernel:  ? exc_invalid_op+0x17/0x70
Aug 18 13:52:27 machwx9 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x2f2/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  nv_procfs_write_suspend+0xe8/0x160 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  proc_reg_write+0x57/0xa0
Aug 18 13:52:27 machwx9 kernel:  vfs_write+0xc8/0x3f0
Aug 18 13:52:27 machwx9 kernel:  ksys_write+0x6f/0xf0
Aug 18 13:52:27 machwx9 kernel:  do_syscall_64+0x5d/0x90
Aug 18 13:52:27 machwx9 kernel:  ? do_user_addr_fault+0x237/0x580
Aug 18 13:52:27 machwx9 kernel:  ? exc_page_fault+0x7c/0x180
Aug 18 13:52:27 machwx9 kernel:  entry_SYSCALL_64_after_hwframe+0x69/0xd3
Aug 18 13:52:27 machwx9 kernel: RIP: 0033:0x7f5640d04724
Aug 18 13:52:27 machwx9 kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d 45 1c 14 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
Aug 18 13:52:27 machwx9 kernel: RSP: 002b:00007ffc8ec4ac58 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f5640d04724
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000008 RSI: 000055753be60ca0 RDI: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RBP: 000055753be60ca0 R08: 0000000000000410 R09: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 18 13:52:27 machwx9 kernel: R13: 00007f5640e3f5c0 R14: 00007f5640ddd0e0 R15: 0000000000000000
Aug 18 13:52:27 machwx9 kernel:  </TASK>
Aug 18 13:52:27 machwx9 kernel: ---[ end trace 0000000000000000 ]---
Aug 18 13:52:27 machwx9 kernel: ------------[ cut here ]------------
Aug 18 13:52:27 machwx9 kernel: WARNING: CPU: 3 PID: 2485 at /build/linux61-nvidia/src/NVIDIA-Linux-x86_64-535.98-no-compat32/kernel/nvidia/nv.c:4206 nv_set_system_power_state+0x30e/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Modules linked in: rfcomm nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) cmac algif_hash algif_skcipher af_alg snd_sof_pci_intel_skl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_bus snd_hda_codec_hdmi snd_soc_avs intel_tcc_cooling snd_soc_hda_codec x86_pkg_temp_thermal snd_ctl_led intel_powerclamp snd_hda_codec_realtek snd_soc_skl coretemp snd_hda_codec_generic snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_sst_ipc qrtr snd_soc_sst_dsp snd_soc_acpi_intel_match kvm snd_soc_acpi irqbypass crct10dif_pclmul snd_soc_core crc32_pclmul joydev polyval_clmulni snd_compress polyval_generic bnep gf128mul ac97_bus ghash_clmulni_intel sha512_ssse3 snd_pcm_dmaengine snd_hda_intel mousedev iwlmvm snd_intel_dspcfg aesni_intel snd_intel_sdw_acpi uvcvideo btusb i915 crypto_simd videobuf2_vmalloc mac80211 processor_thermal_device_pci_legacy videobuf2_memops btrtl
Aug 18 13:52:27 machwx9 kernel:  processor_thermal_device videobuf2_v4l2 cryptd hid_multitouch videobuf2_common spi_pxa2xx_platform iTCO_wdt btbcm snd_hda_codec 8250_dw libarc4 rapl dw_dmac btintel videodev processor_thermal_rfim huawei_wmi iwlwifi snd_hda_core processor_thermal_mbox intel_pmc_bxt ledtrig_audio btmtk intel_cstate snd_hwdep mei_pxp mei_hdcp intel_rapl_msr iTCO_vendor_support wmi_bmof sparse_keymap intel_wmi_thunderbolt mc intel_uncore bluetooth pcspkr drm_buddy snd_pcm cfg80211 ttm snd_timer processor_thermal_rapl ecdh_generic drm_display_helper snd intel_xhci_usb_role_switch intel_lpss_pci mei_me i2c_i801 cec acpi_als intel_rapl_common intel_lpss rfkill soundcore i2c_smbus roles mei i2c_hid_acpi idma64 intel_soc_dts_iosf intel_gtt industrialio_triggered_buffer intel_pch_thermal vfat video kfifo_buf i2c_hid fat int3403_thermal int3400_thermal industrialio int340x_thermal_zone wmi acpi_pad acpi_thermal_rel dptf_power mac_hid uinput crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables
Aug 18 13:52:27 machwx9 kernel:  ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd libps2 vivaldi_fmap nvme nvme_core crc32c_intel xhci_pci nvme_common xhci_pci_renesas i8042 serio
Aug 18 13:52:27 machwx9 kernel: CPU: 3 PID: 2485 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.1.44-1-MANJARO #1 a5ae25468b7d8bc523fd3843f165bfe956aae550
Aug 18 13:52:27 machwx9 kernel: Hardware name: HUAWEI MACH-WX9/MACH-WX9, BIOS 1.37 05/12/2021
Aug 18 13:52:27 machwx9 kernel: RIP: 0010:nv_set_system_power_state+0x30e/0x470 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Code: 03 4c 89 f6 e8 e3 dd 37 d1 e9 12 ff ff ff 0f 0b 4c 89 ef e8 d4 1d ca d1 4d 85 f6 74 0d e8 1a 73 a8 00 84 c0 0f 85 3a 01 00 00 <0f> 0b 48 c7 c7 c0 c1 22 c5 45 31 ed e8 b1 1d ca d1 48 c7 c7 c0 c1
Aug 18 13:52:27 machwx9 kernel: RSP: 0018:ffffae40c494bd70 EFLAGS: 00010202
Aug 18 13:52:27 machwx9 kernel: RAX: ffff94af121c0001 RBX: 0000000000000001 RCX: 0000000000013003
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000012e03 RSI: 2332632241cf17e4 RDI: 0000398e4de234a0
Aug 18 13:52:27 machwx9 kernel: RBP: 000000000000000f R08: 0000000000000000 R09: ffff94af121c5f60
Aug 18 13:52:27 machwx9 kernel: R10: ffffffffc4b4f250 R11: ffffffffc4b4f290 R12: ffff94af1907e000
Aug 18 13:52:27 machwx9 kernel: R13: ffff94af1907e5c0 R14: ffff94af121c3000 R15: ffff94af121c3000
Aug 18 13:52:27 machwx9 kernel: FS:  00007f5640f16000(0000) GS:ffff94b271ec0000(0000) knlGS:0000000000000000
Aug 18 13:52:27 machwx9 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 13:52:27 machwx9 kernel: CR2: 000055753be610a8 CR3: 0000000110b4c003 CR4: 00000000003706e0
Aug 18 13:52:27 machwx9 kernel: Call Trace:
Aug 18 13:52:27 machwx9 kernel:  <TASK>
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x30e/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? __warn+0x7d/0xd0
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x30e/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? report_bug+0xe6/0x150
Aug 18 13:52:27 machwx9 kernel:  ? handle_bug+0x3c/0x80
Aug 18 13:52:27 machwx9 kernel:  ? exc_invalid_op+0x17/0x70
Aug 18 13:52:27 machwx9 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 18 13:52:27 machwx9 kernel: rfkill: input handler enabled
Aug 18 13:52:27 machwx9 kernel:  ? nv_set_system_power_state+0x30e/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  nv_procfs_write_suspend+0xe8/0x160 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  proc_reg_write+0x57/0xa0
Aug 18 13:52:27 machwx9 kernel:  vfs_write+0xc8/0x3f0
Aug 18 13:52:27 machwx9 kernel:  ksys_write+0x6f/0xf0
Aug 18 13:52:27 machwx9 kernel:  do_syscall_64+0x5d/0x90
Aug 18 13:52:27 machwx9 kernel:  ? do_user_addr_fault+0x237/0x580
Aug 18 13:52:27 machwx9 kernel:  ? exc_page_fault+0x7c/0x180
Aug 18 13:52:27 machwx9 kernel:  entry_SYSCALL_64_after_hwframe+0x69/0xd3
Aug 18 13:52:27 machwx9 kernel: RIP: 0033:0x7f5640d04724
Aug 18 13:52:27 machwx9 kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d 45 1c 14 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
Aug 18 13:52:27 machwx9 kernel: RSP: 002b:00007ffc8ec4ac58 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f5640d04724
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000008 RSI: 000055753be60ca0 RDI: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RBP: 000055753be60ca0 R08: 0000000000000410 R09: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 18 13:52:27 machwx9 kernel: R13: 00007f5640e3f5c0 R14: 00007f5640ddd0e0 R15: 0000000000000000
Aug 18 13:52:27 machwx9 kernel:  </TASK>
Aug 18 13:52:27 machwx9 kernel: ---[ end trace 0000000000000000 ]---
Aug 18 13:52:27 machwx9 kernel: ------------[ cut here ]------------
Aug 18 13:52:27 machwx9 kernel: WARNING: CPU: 3 PID: 2485 at /build/linux61-nvidia/src/NVIDIA-Linux-x86_64-535.98-no-compat32/kernel/nvidia/nv.c:3943 nv_restore_user_channels+0x4e/0x1e0 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Modules linked in: rfcomm nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) cmac algif_hash algif_skcipher af_alg snd_sof_pci_intel_skl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_bus snd_hda_codec_hdmi snd_soc_avs intel_tcc_cooling snd_soc_hda_codec x86_pkg_temp_thermal snd_ctl_led intel_powerclamp snd_hda_codec_realtek snd_soc_skl coretemp snd_hda_codec_generic snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_sst_ipc qrtr snd_soc_sst_dsp snd_soc_acpi_intel_match kvm snd_soc_acpi irqbypass crct10dif_pclmul snd_soc_core crc32_pclmul joydev polyval_clmulni snd_compress polyval_generic bnep gf128mul ac97_bus ghash_clmulni_intel sha512_ssse3 snd_pcm_dmaengine snd_hda_intel mousedev iwlmvm snd_intel_dspcfg aesni_intel snd_intel_sdw_acpi uvcvideo btusb i915 crypto_simd videobuf2_vmalloc mac80211 processor_thermal_device_pci_legacy videobuf2_memops btrtl
Aug 18 13:52:27 machwx9 kernel:  processor_thermal_device videobuf2_v4l2 cryptd hid_multitouch videobuf2_common spi_pxa2xx_platform iTCO_wdt btbcm snd_hda_codec 8250_dw libarc4 rapl dw_dmac btintel videodev processor_thermal_rfim huawei_wmi iwlwifi snd_hda_core processor_thermal_mbox intel_pmc_bxt ledtrig_audio btmtk intel_cstate snd_hwdep mei_pxp mei_hdcp intel_rapl_msr iTCO_vendor_support wmi_bmof sparse_keymap intel_wmi_thunderbolt mc intel_uncore bluetooth pcspkr drm_buddy snd_pcm cfg80211 ttm snd_timer processor_thermal_rapl ecdh_generic drm_display_helper snd intel_xhci_usb_role_switch intel_lpss_pci mei_me i2c_i801 cec acpi_als intel_rapl_common intel_lpss rfkill soundcore i2c_smbus roles mei i2c_hid_acpi idma64 intel_soc_dts_iosf intel_gtt industrialio_triggered_buffer intel_pch_thermal vfat video kfifo_buf i2c_hid fat int3403_thermal int3400_thermal industrialio int340x_thermal_zone wmi acpi_pad acpi_thermal_rel dptf_power mac_hid uinput crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables
Aug 18 13:52:27 machwx9 kernel:  ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd libps2 vivaldi_fmap nvme nvme_core crc32c_intel xhci_pci nvme_common xhci_pci_renesas i8042 serio
Aug 18 13:52:27 machwx9 kernel: CPU: 3 PID: 2485 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.1.44-1-MANJARO #1 a5ae25468b7d8bc523fd3843f165bfe956aae550
Aug 18 13:52:27 machwx9 kernel: Hardware name: HUAWEI MACH-WX9/MACH-WX9, BIOS 1.37 05/12/2021
Aug 18 13:52:27 machwx9 kernel: RIP: 0010:nv_restore_user_channels+0x4e/0x1e0 [nvidia]
Aug 18 13:52:27 machwx9 kernel: Code: 24 c0 05 00 00 4c 89 ef e8 ff 55 ca d1 f6 43 10 01 74 73 48 89 de 31 ff e8 6f 49 a8 00 41 89 c6 85 c0 0f 84 4b 01 00 00 31 ed <0f> 0b 49 81 c4 e8 06 00 00 4c 89 e7 e8 d1 55 ca d1 be 01 00 00 00
Aug 18 13:52:27 machwx9 kernel: RSP: 0018:ffffae40c494bd40 EFLAGS: 00010206
Aug 18 13:52:27 machwx9 kernel: RAX: 000000000000000f RBX: ffff94af1907e000 RCX: ffffae40c494bcc0
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000000 RSI: 0000000000000246 RDI: ffffae40c494bc70
Aug 18 13:52:27 machwx9 kernel: RBP: ffff94af121c3000 R08: 0000000000000000 R09: ffff94af121c5f60
Aug 18 13:52:27 machwx9 kernel: R10: ffffffffc4b4f250 R11: ffffffffc4b4f290 R12: ffff94af1907e000
Aug 18 13:52:27 machwx9 kernel: R13: ffff94af1907e5c0 R14: 000000000000000f R15: ffff94af121c3000
Aug 18 13:52:27 machwx9 kernel: FS:  00007f5640f16000(0000) GS:ffff94b271ec0000(0000) knlGS:0000000000000000
Aug 18 13:52:27 machwx9 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 13:52:27 machwx9 kernel: CR2: 000055753be610a8 CR3: 0000000110b4c003 CR4: 00000000003706e0
Aug 18 13:52:27 machwx9 kernel: Call Trace:
Aug 18 13:52:27 machwx9 kernel:  <TASK>
Aug 18 13:52:27 machwx9 kernel:  ? nv_restore_user_channels+0x4e/0x1e0 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? __warn+0x7d/0xd0
Aug 18 13:52:27 machwx9 kernel:  ? nv_restore_user_channels+0x4e/0x1e0 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  ? report_bug+0xe6/0x150
Aug 18 13:52:27 machwx9 kernel:  ? handle_bug+0x3c/0x80
Aug 18 13:52:27 machwx9 kernel:  ? exc_invalid_op+0x17/0x70
Aug 18 13:52:27 machwx9 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 18 13:52:27 machwx9 kernel:  ? nv_restore_user_channels+0x4e/0x1e0 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  nv_set_system_power_state+0x344/0x470 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  nv_procfs_write_suspend+0xe8/0x160 [nvidia c58a6b9ea5529332a979e79b5eff438b4f038a6d]
Aug 18 13:52:27 machwx9 kernel:  proc_reg_write+0x57/0xa0
Aug 18 13:52:27 machwx9 kernel:  vfs_write+0xc8/0x3f0
Aug 18 13:52:27 machwx9 kernel:  ksys_write+0x6f/0xf0
Aug 18 13:52:27 machwx9 kernel:  do_syscall_64+0x5d/0x90
Aug 18 13:52:27 machwx9 kernel:  ? do_user_addr_fault+0x237/0x580
Aug 18 13:52:27 machwx9 kernel:  ? exc_page_fault+0x7c/0x180
Aug 18 13:52:27 machwx9 kernel:  entry_SYSCALL_64_after_hwframe+0x69/0xd3
Aug 18 13:52:27 machwx9 kernel: RIP: 0033:0x7f5640d04724
Aug 18 13:52:27 machwx9 kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d 45 1c 14 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
Aug 18 13:52:27 machwx9 kernel: RSP: 002b:00007ffc8ec4ac58 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f5640d04724
Aug 18 13:52:27 machwx9 kernel: RDX: 0000000000000008 RSI: 000055753be60ca0 RDI: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: RBP: 000055753be60ca0 R08: 0000000000000410 R09: 0000000000000001
Aug 18 13:52:27 machwx9 kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 18 13:52:27 machwx9 kernel: R13: 00007f5640e3f5c0 R14: 00007f5640ddd0e0 R15: 0000000000000000
Aug 18 13:52:27 machwx9 kernel:  </TASK>
Aug 18 13:52:27 machwx9 kernel: ---[ end trace 0000000000000000 ]---
Aug 18 13:52:27 machwx9 org.gnome.DejaDup[2497]: libEGL warning: DRI3: Screen seems not DRI3 capable
Aug 18 13:52:27 machwx9 org.gnome.DejaDup[2497]: libEGL warning: failed to open /dev/dri/card0: Permission denied
Aug 18 13:52:27 machwx9 org.gnome.DejaDup[2497]: libEGL warning: DRI2: could not open /dev/dri/card0 (Permission denied)
Aug 18 13:52:27 machwx9 systemd[1]: nvidia-suspend.service: Main process exited, code=exited, status=1/FAILURE
Aug 18 13:52:27 machwx9 nvidia-sleep.sh[2485]: /usr/bin/nvidia-sleep.sh: line 20: echo: write error: Input/output error
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:66
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:73
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:69
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:65
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:75
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:70
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:74
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:71
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:72
Aug 18 13:52:27 machwx9 /usr/lib/gdm-x-session[1187]: (II) systemd-logind: got pause for 13:67
Aug 18 13:52:27 machwx9 systemd[1]: nvidia-suspend.service: Failed with result 'exit-code'.
Aug 18 13:52:27 machwx9 systemd[1]: Failed to start NVIDIA system suspend actions.
Aug 18 13:52:27 machwx9 systemd[1]: Starting System Suspend...
Aug 18 13:52:27 machwx9 gsd-power[1581]: Release of light sensors failed: GDBus.Error:org.freedesktop.DBus.Error.AccessDenied: Not Authorized: Sensor claim not allowed
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [13] INFO: # Pre-suspend hook
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [13] INFO: Previous state was: {'type': 'done', 'switch_id': '20230818T134946', 'current_mode': 'hybrid'}
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [15] INFO: Switching option: none
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [15] INFO: Nothing to do
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [16] INFO: Writing state {'type': 'pending_post_resume', 'switch_id': '20230818T134946', 'current_mode': 'hybrid'}
Aug 18 13:52:27 machwx9 systemd-sleep[2505]: [16] INFO: Pre-suspend hook completed successfully.
Aug 18 13:52:27 machwx9 systemd-sleep[2502]: Entering sleep state 'suspend'...
Aug 18 13:52:27 machwx9 kernel: PM: suspend entry (deep)
Aug 18 13:52:28 machwx9 kernel: Filesystems sync: 0.006 seconds
Aug 18 13:52:28 machwx9 kernel: Freezing user space processes
Aug 18 13:52:28 machwx9 kernel: Freezing user space processes completed (elapsed 0.002 seconds)
Aug 18 13:52:28 machwx9 kernel: OOM killer disabled.
Aug 18 13:52:28 machwx9 kernel: Freezing remaining freezable tasks
Aug 18 13:52:28 machwx9 kernel: Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
Aug 18 13:52:28 machwx9 kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Aug 18 13:52:28 machwx9 kernel: nvidia 0000:01:00.0: PM: pci_pm_suspend(): nv_pmops_suspend+0x0/0x30 [nvidia] returns -5
Aug 18 13:52:28 machwx9 kernel: nvidia 0000:01:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -5
Aug 18 13:52:28 machwx9 kernel: nvidia 0000:01:00.0: PM: failed to suspend async: error -5
Aug 18 13:52:28 machwx9 kernel: PM: Some devices failed to suspend, or early wake event detected

Okay I seem to have found a workaround for my issue. Basically there are two ways of power-management for nvidia-drivers: Kernel Driver Callback and systemd (/proc/driver/nvidia/suspend). The later is currently used and does not work for me. However, when I disable the nvidia-suspend-, nvidia-hibernate-, and the nvidia-resume-service, Kernel Driver Callback will be used for power management by the nvidia-deiver and so far that works for me.

More details can be found in this thread:

1 Like

Were you forced to use the 470 driver or does this work on the latest?

Nope this works on the latest version (535.98) as well.

Thank you so much this helped a ton

(idk if this is necro bumping but just delete this if it is sorry! :slight_smile: )

It is…but glad you found a solution.