System Freezes/Overheats on Idle - AMD GPU, KDE Wayland, Kernel 6.12/6.6.94

When my PC is left idle on the desktop (no demanding applications running), after approximately 30 minutes (though sometimes this duration varies), the monitor goes completely black. The system becomes entirely unresponsive, and the PC unit itself feels very hot to the touch, with the fans (especially the GPU fans) spinning aggressively and loudly. A hard reboot (holding the power button) is the only way to regain control of the system.

System:
  Kernel: 6.12.34-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 15.1.1
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/@/boot/vmlinuz-6.12-x86_64
    root=UUID=deff7978-c0f2-4b63-b350-1aa4a1bd8970 rw rootflags=subvol=@ quiet
    splash udev.log_priority=3
  Desktop: KDE Plasma v: 6.3.5 tk: Qt v: N/A info: frameworks v: 6.15.0
    wm: kwin_wayland vt: 1 dm: SDDM Distro: Manjaro base: Arch Linux
Machine:
  Type: Desktop System: ASUS product: N/A v: N/A serial: <superuser required>
  Mobo: ASUSTeK model: PRIME B450M-A II v: Rev X.0x
    serial: <superuser required> part-nu: SKU uuid: <superuser required>
    UEFI: American Megatrends v: 4631 date: 01/14/2025
CPU:
  Info: model: AMD Ryzen 7 5700X3D bits: 64 type: MT MCP arch: Zen 3+ gen: 3
    level: v3 note: check built: 2022 process: TSMC n6 (7nm) family: 0x19 (25)
    model-id: 0x21 (33) stepping: 2 microcode: 0xA201213
  Topology: cpus: 1x dies: 1 clusters: 1 cores: 8 threads: 16 tpc: 2
    smt: enabled cache: L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB L2: 4 MiB
    desc: 8x512 KiB L3: 96 MiB desc: 1x96 MiB
  Speed (MHz): avg: 3193 min/max: 550/4426 boost: enabled scaling:
    driver: amd-pstate-epp governor: powersave cores: 1: 3193 2: 3193 3: 3193
    4: 3193 5: 3193 6: 3193 7: 3193 8: 3193 9: 3193 10: 3193 11: 3193 12: 3193
    13: 3193 14: 3193 15: 3193 16: 3193 bogomips: 102249
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3
  Vulnerabilities:
  Type: gather_data_sampling status: Not affected
  Type: indirect_target_selection status: Not affected
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: reg_file_data_sampling status: Not affected
  Type: retbleed status: Not affected
  Type: spec_rstack_overflow mitigation: Safe RET
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines; IBPB: conditional; IBRS_FW;
    STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not
    affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Advanced Micro Devices [AMD/ATI] Navi 32 [Radeon RX 7700 XT / 7800
    XT] vendor: Gigabyte driver: amdgpu v: kernel arch: RDNA-3 code: Navi-3x
    process: TSMC n5 (5nm) built: 2022+ pcie: gen: 4 speed: 16 GT/s lanes: 16
    ports: active: DP-2 empty: DP-1, HDMI-A-1, HDMI-A-2, Writeback-1
    bus-ID: 0b:00.0 chip-ID: 1002:747e class-ID: 0300
  Device-2: Logitech Webcam C310 driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 3-3:4 chip-ID: 046d:081b
    class-ID: 0102 serial: <filter>
  Display: wayland server: X.org v: 1.21.1.18 with: Xwayland v: 24.1.8
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: modesetting,radeon alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu display-ID: 0
  Monitor-1: DP-2 model: Dell S3422DWG serial: <filter> built: 2022 res:
    mode: 3440x1440 hz: 144 scale: 100% (1) dpi: 110 gamma: 1.2
    size: 797x334mm (31.38x13.15") diag: 864mm (34") modes: max: 3440x1440
    min: 720x400
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: radeonsi
    wayland: drv: radeonsi x11: drv: radeonsi
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 25.1.4-arch1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 7800 XT (radeonsi
    navi32 LLVM 20.1.6 DRM 3.61 6.12.34-1-MANJARO) device-ID: 1002:747e
    memory: 15.62 GiB unified: no display-ID: :1.0
  API: Vulkan v: 1.4.313 layers: 7 device: 0 type: discrete-gpu name: AMD
    Radeon RX 7800 XT (RADV NAVI32) driver: mesa radv v: 25.1.4-arch1.1
    device-ID: 1002:747e surfaces: N/A
  Info: Tools: api: clinfo, eglinfo, glxinfo, vulkaninfo
    de: kscreen-console,kscreen-doctor wl: wayland-info
    x11: xdpyinfo, xprop, xrandr
Audio:
  Device-1: Advanced Micro Devices [AMD/ATI] Navi 31 HDMI/DP Audio
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 0b:00.1 chip-ID: 1002:ab30 class-ID: 0403
  Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio
    vendor: ASUSTeK driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s
    lanes: 16 bus-ID: 0d:00.4 chip-ID: 1022:1487 class-ID: 0403
  Device-3: Logitech Webcam C310 driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 3-3:4 chip-ID: 046d:081b
    class-ID: 0102 serial: <filter>
  Device-4: JOUNIVO JV601 driver: snd-usb-audio type: USB rev: 1.1
    speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 3-4:5 chip-ID: 5679:1002
    class-ID: 0300 serial: <filter>
  API: ALSA v: k6.12.34-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: sndiod v: N/A status: off tools: aucat,midicat,sndioctl
  Server-2: JACK v: 1.9.22 status: off tools: N/A
  Server-3: PipeWire v: 1.4.5 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    vendor: ASUSTeK RTL8111H driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s
    lanes: 1 port: f000 bus-ID: 08:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp8s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Info: services: NetworkManager,systemd-timesyncd
Bluetooth:
  Device-1: TP-Link UB500 Adapter driver: btusb v: 0.8 type: USB rev: 1.1
    speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 3-2:3 chip-ID: 2357:0604
    class-ID: e001 serial: <filter>
  Report: rfkill ID: hci0 rfk-id: 0 state: up address: see --recommends
Drives:
  Local Storage: total: 1.82 TiB used: 308.78 GiB (16.6%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Western Digital
    model: WD BLACK SN850X 1000GB size: 931.51 GiB block-size: physical: 512 B
    logical: 512 B speed: 63.2 Gb/s lanes: 4 tech: SSD serial: <filter>
    fw-rev: 620361WD temp: 45.9 C scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 vendor: Seagate model: Portable
    size: 931.51 GiB block-size: physical: 4096 B logical: 512 B type: USB
    rev: 3.0 spd: 5 Gb/s lanes: 1 mode: 3.2 gen-1x1 tech: N/A serial: <filter>
    fw-rev: 9300 scheme: MBR
Partition:
  ID-1: / raw-size: 931.22 GiB size: 931.22 GiB (100.00%)
    used: 308.78 GiB (33.2%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 616 KiB (0.2%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
  ID-3: /home raw-size: 931.22 GiB size: 931.22 GiB (100.00%)
    used: 308.78 GiB (33.2%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
  ID-4: /var/log raw-size: 931.22 GiB size: 931.22 GiB (100.00%)
    used: 308.78 GiB (33.2%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
Swap:
  Alert: No swap data was found.
Sensors:
  System Temperatures: cpu: 45.6 C mobo: N/A gpu: amdgpu temp: 59.0 C
    mem: 58.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 5
Info:
  Memory: total: 32 GiB available: 31.25 GiB used: 3.5 GiB (11.2%)
  Processes: 424 Power: uptime: 4m states: freeze,mem,disk suspend: deep
    avail: s2idle wakeups: 0 hibernate: platform avail: shutdown, reboot,
    suspend, test_resume image: 12.48 GiB services: org_kde_powerdevil,
    power-profiles-daemon, upowerd Init: systemd v: 257 default: graphical
    tool: systemctl
  Packages: pm: pacman pkgs: 1478 libs: 439 tools: pamac pm: flatpak pkgs: 0
    Compilers: clang: 20.1.6 gcc: 15.1.1 Shell: Zsh v: 5.9 running-in: konsole
    inxi: 3.3.38

Have you done anything to diagnose. How about opening it up and cleaning out dust. Run something like top to see what’s going on. To me it seems like a hardware problem.

1 Like

this issue i didnt encounter when i had ubuntu installed, I am testing now another kernel .

not the solution, but this might help to avoid damaged files and to turn off the computer even if the screen is blank:

There will be some infos in the logs - can you provide the input e.g. from “journalctl -b1 -r -p warning”

As there is no swap defined for your system - you should enable swap (btw: this is a good advice for nearly every system) and figure out if this changes something. And - following the instructions on the next link - check if suspend / hibernation etc. is configured correctly.

https://wiki.archlinux.org/title/Power_management/Suspend_and_hibernate#Hibernation

1 Like

Regularly cleaning fluff and dust is generally good advise to keep the system as cool as possible.

In case of a general heat issue - re-pasting critical components (cpu and gpu) may also be good advise.

I don’t think this is the issue here because

Compared to Manjaro Linux, Ubuntu Linux generally uses older kernel

So the topics below may be worth checking

Udev rule to lower AMD CPU 'scaling_min_freq' value for kernel 6.13 & up

AMD idle speed higher with kernel 6.13 than with 6.12

Yes, AMD has been tinkering with stuff in the kernel, so a change from Ubuntu with an older kernal and Manjaro with a newer one can, unexpectedly, make a difference. Try installing the Manjaro 6.12 LTS kernel to confirm. (Use the Manjaro Settings Manager to manage kernels.)

Wait. I see you’ve done that. Sorry.

My rig is also a Zen3+, but a 5900XT which is twice the cores of the Zen3 predecessor instead of twice the cache of yours. My GPU is only a 6750XT, though.

Adding the udev rule referenced by @linux-aarhus above, dropped my CPU idle speed to 550MHz. I see yours is 3193MHz. Close enough for government work to the advertised base of 3.2GHz.

I also use tuned and tuned-ppd to set the power profile. Between the two, I’ve dropped my idle temps 8-10C to roughly 10C over ambient, typically around 35C. Air cooled.

I’d also monitor running processes to see if something is running that you’re not already aware of. My choices are bpytop (terminal) and resources (GUI).

Good idea to check for dust bunnies nonetheless. Especially if you’re air-cooled.

HTH!

You could also test your hardware by booting a USB session and spend maybe an hour watching some of these (designed to diagnose the issue, but if it’s serious at least you’d be happy)

https://www.youtube.com/@funniestevercomedy/videos

Changing to older kernel didnt solve the issue .
I dont even get errors

    ~  journalctl -b -1 -p err                                                                              ✔ 
Jul 11 22:12:31 avihu-pc kernel: usbhid 3-4:1.2: couldn't find an input interrupt endpoint
Jul 11 22:12:31 avihu-pc kernel: usbhid 3-4:1.2: couldn't find an input interrupt endpoint
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: [XKB-542] /usr/share/X11/locale/iso8859-1/Compose:>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: /usr/share/X11/locale/iso8859-1/Compose:49:29: too>
Jul 11 22:13:07 avihu-pc kwin_wayland[923]: kwin_xkbcommon: XKB: /usr/share/X11/locale/iso8859-1/Compose:49:29: fai>
Jul 11 22:13:09 avihu-pc kernel: amd_pstate: Boost mode is not supported by this processor or SBIOS
Jul 11 22:13:10 avihu-pc org_kde_powerdevil[1204]: [  1204] busno=7, All features that should not exist detected. M>
Jul 11 22:19:16 avihu-pc kernel: amdgpu 0000:0b:00.0: amdgpu: still active bo inside vm
Jul 11 22:19:17 avihu-pc dbus-broker-launch[679]: Activation request for 'org.freedesktop.nm_dispatcher' failed.
Jul 11 22:19:17 avihu-pc systemd[1]: Failed unmounting /var/cache.

doing “reset to defaults” in bios , seems to fixed it.
im not completly sure yet.