About two weeks ago there was a huge update (like 500+ packages). Since than, the laptop does not go to sleep (S3). It seems to be an amdgpu issue. I fist thought, it might be a kernel issue but it happens with 5.10 LTS, 5.15 LTS, 5.16 and 5.17rc7. At least 5.10 and 5.15 did work before.
dmesg
[357430.654590] wlp5s0: deauthenticating from XXX by local choice (Reason: 3=DEAUTH_LEAVING)
[357431.028185] audit: type=1334 audit(1648550511.115:691): prog-id=0 op=UNLOAD
[357431.071353] PM: suspend entry (deep)
[357431.076403] Filesystems sync: 0.005 seconds
[357431.265044] Freezing user space processes ... (elapsed 0.002 seconds) done.
[357431.268013] OOM killer disabled.
[357431.268014] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
[357431.269926] printk: Suspending console(s) (use no_console_suspend to debug)
[357431.328936] [drm] free PSP TMR buffer
[357431.362865] amdgpu 0000:03:00.0: amdgpu: BACO reset
[357431.362883] amdgpu 0000:03:00.0: amdgpu: Failed to enter BACO state!
[357431.362886] PM: pci_pm_suspend(): amdgpu_pmops_suspend+0x0/0x70 [amdgpu] returns -5
[357431.363299] PM: dpm_run_callback(): pci_pm_suspend+0x0/0x160 returns -5
[357431.363311] amdgpu 0000:03:00.0: PM: failed to suspend async: error -5
[357431.368552] [drm] free PSP TMR buffer
[357431.392994] amdgpu 0000:07:00.0: amdgpu: MODE2 reset
[357431.439005] PM: Some devices failed to suspend, or early wake event detected
[357431.440015] [drm] PCIE GART of 1024M enabled.
[357431.440020] [drm] PTB located at 0x000000F400900000
[357431.440039] [drm] PSP is resuming...
[357431.440158] pci 0000:00:00.2: can't derive routing for PCI INT A
[357431.440164] pci 0000:00:00.2: PCI INT A: no GSI
[357431.445212] nvme nvme0: Shutdown timeout set to 10 seconds
[357431.446491] nvme nvme0: 8/0/0 default/read/poll queues
[357431.460094] [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
[357431.537406] amdgpu 0000:07:00.0: amdgpu: RAS: optional ras ta ucode is not available
[357431.545219] amdgpu 0000:07:00.0: amdgpu: RAP: optional rap ta ucode is not available
[357431.545221] amdgpu 0000:07:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[357431.545225] amdgpu 0000:07:00.0: amdgpu: SMU is resuming...
[357431.545387] amdgpu 0000:07:00.0: amdgpu: dpm has been disabled
[357431.545593] amdgpu 0000:07:00.0: amdgpu: SMU is resumed successfully!
[357431.546165] [drm] DMUB hardware initialized: version=0x0101001F
[357431.750901] ata1: SATA link down (SStatus 0 SControl 300)
[357431.750907] ata2: SATA link down (SStatus 0 SControl 300)
[357432.228636] [drm] kiq ring mec 2 pipe 1 q 0
[357432.233010] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[357432.233061] [drm] JPEG decode initialized successfully.
[357432.233072] amdgpu 0000:07:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
[357432.233077] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[357432.233079] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[357432.233081] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[357432.233083] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[357432.233085] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[357432.233087] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[357432.233089] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[357432.233091] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[357432.233093] amdgpu 0000:07:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[357432.233096] amdgpu 0000:07:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
[357432.233098] amdgpu 0000:07:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
[357432.233100] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
[357432.233102] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
[357432.233104] amdgpu 0000:07:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
[357432.524549] OOM killer enabled.
[357432.524552] Restarting tasks ... done.
[357432.575211] PM: suspend exit
[357432.575286] PM: suspend entry (s2idle)
[357432.578115] Filesystems sync: 0.002 seconds
[357432.588312] Freezing user space processes ... (elapsed 0.002 seconds) done.
[357432.591071] OOM killer disabled.
[357432.591072] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
[357432.592749] printk: Suspending console(s) (use no_console_suspend to debug)
[357432.592873] amdgpu 0000:07:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle.
To use suspend-to-idle change the sleep mode in BIOS setup.
[357432.743735] ------------[ cut here ]------------
[357432.743737] WARNING: CPU: 15 PID: 387005 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:2402 dm_suspend+0x246/0x260 [amdgpu]
[357432.743917] Modules linked in: ntfs3 ums_realtek uas usb_storage ccm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm cmac algif_hash algif_skcipher af_alg nfnetlink bnep btusb btrtl btbcm btintel btmtk bluetooth ecdh_generic uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc qrtr joydev mousedev snd_acp3x_rn snd_soc_dmic snd_acp3x_pdm_dma intel_rapl_msr snd_sof_amd_renoir hid_multitouch snd_acp_config snd_sof_amd_acp dell_laptop intel_rapl_common dell_wmi iwlmvm snd_sof_pci snd_sof snd_ctl_led dell_smbios amdgpu snd_hda_codec_realtek snd_soc_acpi mac80211 alienware_wmi wmi_bmof dell_wmi_descriptor snd_hda_codec_generic sparse_keymap edac_mce_amd snd_hda_codec_hdmi ledtrig_audio dcdbas snd_soc_core snd_hda_intel libarc4 kvm_amd snd_compress snd_intel_dspcfg ac97_bus snd_intel_sdw_acpi vfat fat kvm snd_hda_codec iwlwifi snd_pcm_dmaengine snd_hda_core snd_hwdep irqbypass snd_pci_acp5x r8169 ucsi_acpi snd_pcm rapl gpu_sched snd_rn_pci_acp3x
[357432.743950] typec_ucsi sp5100_tco wdat_wdt cfg80211 psmouse snd_timer typec snd drm_ttm_helper ttm snd_pci_acp3x soundcore realtek i2c_piix4 k10temp roles wmi video dell_rbtn i2c_hid_acpi rfkill i2c_hid pinctrl_amd acpi_cpufreq mac_hid uinput acpi_call(OE) ipmi_devintf ipmi_msghandler sg fuse crypto_user ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted asn1_encoder tee tpm serio_raw atkbd libps2 crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel xhci_pci ccp i8042 crypto_simd cryptd xhci_pci_renesas rng_core serio
[357432.743976] CPU: 15 PID: 387005 Comm: kworker/u32:29 Tainted: G OE 5.17.0-1-MANJARO #1 4f77b0a89bee9a649532a573ef03ea6bcc62098c
[357432.743978] Hardware name: Dell Inc. G5 5505/0JT83K, BIOS 1.10.0 01/27/2022
[357432.743979] Workqueue: events_unbound async_run_entry_fn
[357432.743983] RIP: 0010:dm_suspend+0x246/0x260 [amdgpu]
[357432.744144] Code: 48 8b 3c c2 e8 2b 26 e5 d2 48 8b 83 d8 3d 01 00 39 a8 64 01 00 00 77 d7 e9 58 fe ff ff 4c 89 e6 4c 89 ff e8 ec 5f 16 00 eb 8c <0f> 0b e9 f5 fd ff ff e8 4e e2 9a d3 66 66 2e 0f 1f 84 00 00 00 00
[357432.744145] RSP: 0018:ffffc3dc0d8d3cf0 EFLAGS: 00010282
[357432.744146] RAX: 0000000000000000 RBX: ffffa072d9280000 RCX: 0000000000000000
[357432.744147] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffa072d9280000
[357432.744148] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[357432.744149] R10: 0000000000000000 R11: 0000000000000000 R12: ffffa072d9280000
[357432.744149] R13: 0000000000000000 R14: 0000000000000002 R15: ffffa072c0122705
[357432.744150] FS: 0000000000000000(0000) GS:ffffa079af9c0000(0000) knlGS:0000000000000000
[357432.744151] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[357432.744152] CR2: 00007f10f0fb8a44 CR3: 0000000281210000 CR4: 0000000000350ee0
[357432.744153] Call Trace:
[357432.744155] <TASK>
[357432.744157] ? nv_common_set_clockgating_state+0xa7/0xc0 [amdgpu 79e63ed5494c3e9e9a579b9905248b5fa17819b4]
[357432.744298] amdgpu_device_ip_suspend_phase1+0x63/0xd0 [amdgpu 79e63ed5494c3e9e9a579b9905248b5fa17819b4]
[357432.744432] amdgpu_device_suspend+0x63/0xf0 [amdgpu 79e63ed5494c3e9e9a579b9905248b5fa17819b4]
[357432.744565] amdgpu_pmops_suspend+0x2f/0x70 [amdgpu 79e63ed5494c3e9e9a579b9905248b5fa17819b4]
[357432.744699] pci_pm_suspend+0x74/0x160
[357432.744702] ? pci_pm_freeze+0xc0/0xc0
[357432.744703] dpm_run_callback+0x49/0x160
[357432.744707] __device_suspend+0x13b/0x500
[357432.744708] async_suspend+0x1b/0x90
[357432.744710] async_run_entry_fn+0x30/0x130
[357432.744712] process_one_work+0x1e5/0x3b0
[357432.744714] worker_thread+0x50/0x3a0
[357432.744715] ? rescuer_thread+0x3a0/0x3a0
[357432.744716] kthread+0xd8/0x100
[357432.744717] ? kthread_complete_and_exit+0x20/0x20
[357432.744719] ret_from_fork+0x22/0x30
[357432.744722] </TASK>
[357432.744723] ---[ end trace 0000000000000000 ]---
[357432.987561] amdgpu 0000:03:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[357432.987705] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[357432.990955] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[357433.168761] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[357433.177835] [drm] free PSP TMR buffer
[357433.202225] amdgpu 0000:07:00.0: amdgpu: MODE2 reset
[357433.217689] amdgpu 0000:03:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[357433.217843] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
[357433.218045] [drm] free PSP TMR buffer
[357435.439594] [drm] psp gfx command DESTROY_TMR(0x7) failed and response status is (0x0)
[357435.439596] [drm:psp_suspend [amdgpu]] *ERROR* Failed to terminate tmr
[357435.439743] [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* suspend of IP block <psp> failed -22
[357435.441165] amdgpu 0000:03:00.0: amdgpu: BACO reset
[357435.441180] amdgpu 0000:03:00.0: amdgpu: Failed to enter BACO state!
[357435.441182] PM: pci_pm_suspend(): amdgpu_pmops_suspend+0x0/0x70 [amdgpu] returns -5
[357435.441313] PM: dpm_run_callback(): pci_pm_suspend+0x0/0x160 returns -5
[357435.441320] amdgpu 0000:03:00.0: PM: failed to suspend async: error -5
[357435.441671] PM: Some devices failed to suspend, or early wake event detected
[357435.441977] pci 0000:00:00.2: can't derive routing for PCI INT A
[357435.441981] pci 0000:00:00.2: PCI INT A: no GSI
[357435.442205] [drm] PCIE GART of 1024M enabled.
[357435.442206] [drm] PTB located at 0x000000F400900000
[357435.442215] [drm] PSP is resuming...
[357435.462257] [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
[357435.494811] nvme nvme0: Shutdown timeout set to 10 seconds
[357435.496463] nvme nvme0: 8/0/0 default/read/poll queues
[357435.539644] amdgpu 0000:07:00.0: amdgpu: RAS: optional ras ta ucode is not available
[357435.547491] amdgpu 0000:07:00.0: amdgpu: RAP: optional rap ta ucode is not available
[357435.547494] amdgpu 0000:07:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[357435.547498] amdgpu 0000:07:00.0: amdgpu: SMU is resuming...
[357435.548135] amdgpu 0000:07:00.0: amdgpu: dpm has been disabled
[357435.548339] amdgpu 0000:07:00.0: amdgpu: SMU is resumed successfully!
[357435.548910] [drm] DMUB hardware initialized: version=0x0101001F
[357435.755238] ata2: SATA link down (SStatus 0 SControl 300)
[357435.755240] ata1: SATA link down (SStatus 0 SControl 300)
[357435.869220] [drm] kiq ring mec 2 pipe 1 q 0
[357435.878032] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[357435.878076] [drm] JPEG decode initialized successfully.
[357435.878084] amdgpu 0000:07:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
[357435.878086] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[357435.878087] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[357435.878088] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[357435.878089] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[357435.878089] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[357435.878090] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[357435.878091] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[357435.878091] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[357435.878092] amdgpu 0000:07:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[357435.878093] amdgpu 0000:07:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
[357435.878094] amdgpu 0000:07:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
[357435.878094] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
[357435.878095] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
[357435.878096] amdgpu 0000:07:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
[357436.522992] OOM killer enabled.
[357436.522994] Restarting tasks ... done.
[357436.572753] PM: suspend exit
[357436.886742] audit: type=1130 audit(1648550516.972:692): pid=1 uid=0 auid=4294967295 ses=4294967295 subj==unconfined msg='unit=systemd-suspend comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
[357436.924811] Generic FE-GE Realtek PHY r8169-0-400:00: attached PHY driver (mii_bus:phy_addr=r8169-0-400:00, irq=MAC)
[357437.118326] r8169 0000:04:00.0 enp4s0: Link is Down
[357438.918375] audit: type=1100 audit(1648550519.005:693): pid=387191 uid=1000 auid=1000 ses=2 subj==unconfined msg='op=PAM:unix_chkpwd acct="XXX" exe="/usr/bin/unix_chkpwd" hostname=? addr=? terminal=? res=success'
[357440.274438] wlp5s0: authenticate with XXX
[357440.274456] wlp5s0: 80 MHz not supported, disabling VHT
[357440.282702] wlp5s0: send auth to XXX (try 1/3)
cat /proc/acpi/wakeup
Device S-state Status Sysfs node
GPP0 S4 *enabled pci:0000:00:01.1
SWUS S4 *enabled pci:0000:01:00.0
SWDS S4 *enabled pci:0000:02:00.0
GPP1 S4 *disabled
GPP2 S4 *disabled
GPP3 S4 *enabled pci:0000:00:02.1
GPP4 S4 *enabled pci:0000:00:02.2
GPP5 S4 *disabled
GP17 S4 *enabled pci:0000:00:08.1
XHC0 S4 *enabled pci:0000:07:00.3
XHC1 S4 *enabled pci:0000:07:00.4
GP19 S4 *disabled
LID S3 *disabled platform:PNP0C0D:00
cat /sys/power/mem_sleep
s2idle [deep]
lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:02.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:02.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:02.4 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 51)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir Device 24: Function 7
01:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch (rev c2)
02:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch
03:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] (rev c2)
03:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 HDMI Audio
04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
05:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)
06:00.0 Non-Volatile memory controller: Phison Electronics Corporation E16 PCIe4 NVMe Controller (rev 01)
07:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Renoir (rev c6)
07:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Renoir Radeon High Definition Audio Controller
07:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
07:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
07:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
07:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] ACP/ACP3X/ACP6x Audio Coprocessor (rev 01)
07:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h HD Audio Controller
08:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 81)
08:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 81)
It did also not work when I disabled S3 and tried to use s2idle
I’m a big fan of AMD but the amdgpu driver is just the worst thing ever.
Any idea how to fix this? Maybe it’s the xf86-video-amdgpu? Or the firmware?
Edit: Downgrading xf86-video-amdgpu does NOT help
Edit 2: Using older Firmware files does also not help.
Additionally the system won’t shutdown properly after one failed attempt to go to sleep (happens always not just with the old firmware)