Very high CPU temperature after latest update

After the latest stable update, my system is experiencing very high CPU temperature (over 100 C) and is very sluggish see photo below. Actually I was writing this post from my desktop which has a problem and it shuts down unexpectedly probably because of the high temperature. Can you please help?

System:
  Host: desktop Kernel: 6.6.7-1-MANJARO arch: x86_64 bits: 64 compiler: gcc
    v: 13.2.1 clocksource: tsc Desktop: KDE Plasma v: 5.27.10 tk: Qt v: 5.15.11
    wm: kwin_wayland vt: 1 dm: SDDM Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop Mobo: Gigabyte model: B550M GAMING
    serial: <superuser required> UEFI: American Megatrends LLC. v: F15c
    date: 05/12/2022
CPU:
  Info: 12-core model: AMD Ryzen 9 5900X bits: 64 type: MT MCP smt: enabled
    arch: Zen 3+ rev: 2 cache: L1: 768 KiB L2: 6 MiB L3: 64 MiB
  Speed (MHz): avg: 614 high: 2200 min/max: 2200/4950 boost: enabled cores:
    1: 546 2: 546 3: 546 4: 546 5: 546 6: 546 7: 546 8: 546 9: 545 10: 545
    11: 543 12: 546 13: 546 14: 2200 15: 546 16: 546 17: 546 18: 545 19: 546
    20: 546 21: 546 22: 546 23: 545 24: 546 bogomips: 177349
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Navi 23 [Radeon RX 6650 XT / 6700S 6800S] vendor: XFX
    driver: amdgpu v: kernel arch: RDNA-2 pcie: speed: 16 GT/s lanes: 16 ports:
    active: DP-1,DP-3,HDMI-A-1 empty: DP-2 bus-ID: 08:00.0 chip-ID: 1002:73ef
    class-ID: 0300
  Device-2: Owon USB CAMERA driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-2:2 chip-ID: 5345:336b
    class-ID: 0102 serial: USB CAMERA
  Device-3: Elgato Systems GmbH Cam Link 4K
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.0
    speed: 5 Gb/s lanes: 1 bus-ID: 4-4:2 chip-ID: 0fd9:0066 class-ID: 0102
    serial: 0006419AB1000
  Display: wayland server: X.org v: 1.21.1.10 with: Xwayland v: 23.2.3
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: modesetting,radeon alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu d-rect: 6440x4440 display-ID: 0
  Monitor-1: DP-1 pos: middle-c res: 3440x1440 size: N/A modes: N/A
  Monitor-2: DP-3 pos: bottom-l res: 1920x1080 size: N/A modes: N/A
  Monitor-3: HDMI-A-1 pos: top-right res: 1080x1920 size: N/A modes: N/A
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast surfaceless: drv: radeonsi wayland: drv: radeonsi x11:
    drv: radeonsi inactive: gbm
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 23.1.9-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 6650 XT (navi23 LLVM
    16.0.6 DRM 3.54 6.6.7-1-MANJARO) device-ID: 1002:73ef display-ID: :1.0
  API: Vulkan v: 1.3.269 layers: 4 surfaces: xcb,xlib,wayland device: 0
    type: discrete-gpu hw: amd driver: mesa radv device-ID: 1002:73ef
Audio:
  Device-1: AMD Navi 21/23 HDMI/DP Audio driver: snd_hda_intel v: kernel pcie:
    speed: 16 GT/s lanes: 16 bus-ID: 08:00.1 chip-ID: 1002:ab28 class-ID: 0403
  Device-2: AMD Starship/Matisse HD Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel pcie: speed: 16 GT/s lanes: 16
    bus-ID: 0a:00.4 chip-ID: 1022:1487 class-ID: 0403
  Device-3: Owon USB CAMERA driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-2:2 chip-ID: 5345:336b
    class-ID: 0102 serial: USB CAMERA
  Device-4: Elgato Systems GmbH Cam Link 4K
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.0
    speed: 5 Gb/s lanes: 1 bus-ID: 4-4:2 chip-ID: 0fd9:0066 class-ID: 0102
    serial: 0006419AB1000
  API: ALSA v: k6.6.7-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator
  Server-1: PipeWire v: 1.0.0 status: active with: 1: pipewire-pulse
    status: active 2: pipewire-media-session status: active 3: pipewire-alsa
    type: plugin 4: pw-jack type: plugin
Network:
  Device-1: Intel Wi-Fi 6 AX200 driver: iwlwifi v: kernel pcie: speed: 5 GT/s
    lanes: 1 bus-ID: 04:00.0 chip-ID: 8086:2723 class-ID: 0280
  IF: wlp4s0 state: up mac: 4c:44:5b:86:fd:71
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: Gigabyte driver: r8169 v: kernel pcie: speed: 2.5 GT/s lanes: 1
    port: f000 bus-ID: 05:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp5s0 state: down mac: d8:5e:d3:32:cd:96
Bluetooth:
  Device-1: Intel AX200 Bluetooth driver: btusb v: 0.8 type: USB rev: 2.0
    speed: 12 Mb/s lanes: 1 bus-ID: 1-7:3 chip-ID: 8087:0029 class-ID: e001
  Report: btmgmt ID: hci0 rfk-id: 0 state: down bt-service: enabled,running
    rfk-block: hardware: no software: no address: 4C:44:5B:86:FD:75 bt-v: 5.2
    lmp-v: 11
Drives:
  Local Storage: total: 4.55 TiB used: 1.69 TiB (37.1%)
  ID-1: /dev/nvme0n1 vendor: Crucial model: CT1000P2SSD8 size: 931.51 GiB
    speed: 31.6 Gb/s lanes: 4 tech: SSD serial: 2225E63DF9BA fw-rev: P2CR048
    temp: 45.9 C scheme: GPT
  ID-2: /dev/sda vendor: Seagate model: ST4000DM004-2U9104 size: 3.64 TiB
    speed: 6.0 Gb/s tech: HDD rpm: 5400 serial: WW60G8YF fw-rev: 0001
    scheme: GPT
Partition:
  ID-1: / size: 915.53 GiB used: 799.87 GiB (87.4%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-2: /boot/efi size: 299.4 MiB used: 312 KiB (0.1%) fs: vfat
    dev: /dev/nvme0n1p1
Swap:
  ID-1: swap-1 type: file size: 512 MiB used: 0 KiB (0.0%) priority: -2
    file: /swapfile
Sensors:
  System Temperatures: cpu: 99.5 C mobo: 29.0 C gpu: amdgpu temp: 49.0 C
    mem: 46.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 0
Info:
  Processes: 465 Uptime: 0m wakeups: 0 Memory: total: 32 GiB
  available: 31.25 GiB used: 3.05 GiB (9.8%) Init: systemd v: 254
  default: graphical Compilers: gcc: 13.2.1 clang: 16.0.6 Packages: 1791
  pm: pacman pkgs: 1771 pm: flatpak pkgs: 20 Shell: Bash v: 5.2.21
  running-in: konsole inxi: 3.3.31

Maybe close all applications and check your temperature again.

Which CPU cooler are you using?

Heat problems is mainly a issue related to your cooling… besides some AVX Benches that i know.

All in all it should not be a Linux problem, better check RPM on your CPU FAN or check if your vcore is right in Bios. You can also check your Thermal Paste or try undervolting your CPU.

With no applications open after just booting in, the temperature goes to 110 C and the system shuts down.

Here is a photo of the CPU, it has a ID-COOLING ZoomFlow 240 XT 240mm Addressable RGB AIO CPU Liquid Cooler

It’s weird how 5 min after the unexpected shut down, the lights in the desktop are still on.

Uff watercooling, my experience is very limited in this case, i only know there is sometimes air inside the waterstream and the other solutions i mentioned above.

there is no information of any scaling-governor installed and used. you should install one
https://wiki.archlinux.org/title/Scaling_governor

'bout that:

I’ve tried to enable ondemand


sudo cpupower frequency-set -g ondemand
[sudo] password for geo: 
Setting cpu: 0
Setting cpu: 1
Setting cpu: 2
Setting cpu: 3
Setting cpu: 4
Setting cpu: 5
Setting cpu: 6
Setting cpu: 7
Setting cpu: 8
Setting cpu: 9
Setting cpu: 10
Setting cpu: 11
Setting cpu: 12
Setting cpu: 13
Setting cpu: 14
Setting cpu: 15
Setting cpu: 16
Setting cpu: 17
Setting cpu: 18
Setting cpu: 19
Setting cpu: 20
Setting cpu: 21
Setting cpu: 22
Setting cpu: 23
Following CPUs are offline:
24-31
cpupower set operation was not performed on them

First of all why is CPU 24 offline?

Also, inxi is still not showing any governors.

System:
  Host: desktop Kernel: 6.6.7-1-MANJARO arch: x86_64 bits: 64 compiler: gcc
    v: 13.2.1 clocksource: tsc Desktop: KDE Plasma v: 5.27.10 tk: Qt v: 5.15.11
    wm: kwin_wayland vt: 1 dm: SDDM Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop Mobo: Gigabyte model: B550M GAMING
    serial: <superuser required> UEFI: American Megatrends LLC. v: F15c
    date: 05/12/2022
CPU:
  Info: 12-core model: AMD Ryzen 9 5900X bits: 64 type: MT MCP smt: enabled
    arch: Zen 3+ rev: 2 cache: L1: 768 KiB L2: 6 MiB L3: 64 MiB
  Speed (MHz): avg: 2225 high: 2800 min/max: 2200/4950 boost: enabled cores:
    1: 2200 2: 2200 3: 2200 4: 2200 5: 2200 6: 2200 7: 2200 8: 2200 9: 2200
    10: 2800 11: 2200 12: 2200 13: 2200 14: 2200 15: 2200 16: 2200 17: 2200
    18: 2200 19: 2200 20: 2200 21: 2200 22: 2200 23: 2200 24: 2200
    bogomips: 177351
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Navi 23 [Radeon RX 6650 XT / 6700S 6800S] vendor: XFX
    driver: amdgpu v: kernel arch: RDNA-2 pcie: speed: 16 GT/s lanes: 16 ports:
    active: DP-1,DP-3,HDMI-A-1 empty: DP-2 bus-ID: 08:00.0 chip-ID: 1002:73ef
    class-ID: 0300
  Device-2: Owon USB CAMERA driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-2:2 chip-ID: 5345:336b
    class-ID: 0102 serial: USB CAMERA
  Device-3: Elgato Systems GmbH Cam Link 4K
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.0
    speed: 5 Gb/s lanes: 1 bus-ID: 4-4:2 chip-ID: 0fd9:0066 class-ID: 0102
    serial: 0006419AB1000
  Display: wayland server: X.org v: 1.21.1.10 with: Xwayland v: 23.2.3
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: modesetting,radeon alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu d-rect: 6440x4440 display-ID: 0
  Monitor-1: DP-1 pos: middle-c res: 3440x1440 size: N/A modes: N/A
  Monitor-2: DP-3 pos: bottom-l res: 1920x1080 size: N/A modes: N/A
  Monitor-3: HDMI-A-1 pos: top-right res: 1080x1920 size: N/A modes: N/A
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast surfaceless: drv: radeonsi wayland: drv: radeonsi x11:
    drv: radeonsi inactive: gbm
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 23.1.9-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 6650 XT (navi23 LLVM
    16.0.6 DRM 3.54 6.6.7-1-MANJARO) device-ID: 1002:73ef display-ID: :1.0
  API: Vulkan v: 1.3.269 layers: 4 surfaces: xcb,xlib,wayland device: 0
    type: discrete-gpu hw: amd driver: mesa radv device-ID: 1002:73ef
Audio:
  Device-1: AMD Navi 21/23 HDMI/DP Audio driver: snd_hda_intel v: kernel pcie:
    speed: 16 GT/s lanes: 16 bus-ID: 08:00.1 chip-ID: 1002:ab28 class-ID: 0403
  Device-2: AMD Starship/Matisse HD Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel pcie: speed: 16 GT/s lanes: 16
    bus-ID: 0a:00.4 chip-ID: 1022:1487 class-ID: 0403
  Device-3: Owon USB CAMERA driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-2:2 chip-ID: 5345:336b
    class-ID: 0102 serial: USB CAMERA
  Device-4: Elgato Systems GmbH Cam Link 4K
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.0
    speed: 5 Gb/s lanes: 1 bus-ID: 4-4:2 chip-ID: 0fd9:0066 class-ID: 0102
    serial: 0006419AB1000
  API: ALSA v: k6.6.7-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator
  Server-1: PipeWire v: 1.0.0 status: active with: 1: pipewire-pulse
    status: active 2: pipewire-media-session status: active 3: pipewire-alsa
    type: plugin 4: pw-jack type: plugin
Network:
  Device-1: Intel Wi-Fi 6 AX200 driver: iwlwifi v: kernel pcie: speed: 5 GT/s
    lanes: 1 bus-ID: 04:00.0 chip-ID: 8086:2723 class-ID: 0280
  IF: wlp4s0 state: up mac: 4c:44:5b:86:fd:71
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: Gigabyte driver: r8169 v: kernel pcie: speed: 2.5 GT/s lanes: 1
    port: f000 bus-ID: 05:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp5s0 state: down mac: d8:5e:d3:32:cd:96
Bluetooth:
  Device-1: Intel AX200 Bluetooth driver: btusb v: 0.8 type: USB rev: 2.0
    speed: 12 Mb/s lanes: 1 bus-ID: 1-7:3 chip-ID: 8087:0029 class-ID: e001
  Report: btmgmt ID: hci0 rfk-id: 0 state: down bt-service: enabled,running
    rfk-block: hardware: no software: no address: 4C:44:5B:86:FD:75 bt-v: 5.2
    lmp-v: 11
Drives:
  Local Storage: total: 4.55 TiB used: 1.69 TiB (37.1%)
  ID-1: /dev/nvme0n1 vendor: Crucial model: CT1000P2SSD8 size: 931.51 GiB
    speed: 31.6 Gb/s lanes: 4 tech: SSD serial: 2225E63DF9BA fw-rev: P2CR048
    temp: 34.9 C scheme: GPT
  ID-2: /dev/sda vendor: Seagate model: ST4000DM004-2U9104 size: 3.64 TiB
    speed: 6.0 Gb/s tech: HDD rpm: 5400 serial: WW60G8YF fw-rev: 0001
    scheme: GPT
Partition:
  ID-1: / size: 915.53 GiB used: 799.91 GiB (87.4%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-2: /boot/efi size: 299.4 MiB used: 312 KiB (0.1%) fs: vfat
    dev: /dev/nvme0n1p1
Swap:
  ID-1: swap-1 type: file size: 512 MiB used: 0 KiB (0.0%) priority: -2
    file: /swapfile
Sensors:
  System Temperatures: cpu: 72.9 C mobo: 25.0 C gpu: amdgpu temp: 50.0 C
    mem: 48.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 0
Info:
  Processes: 460 Uptime: 5m wakeups: 0 Memory: total: 32 GiB
  available: 31.25 GiB used: 4.21 GiB (13.5%) Init: systemd v: 254
  default: graphical Compilers: gcc: 13.2.1 clang: 16.0.6 Packages: 1791
  pm: pacman pkgs: 1771 pm: flatpak pkgs: 20 Shell: Bash v: 5.2.21
  running-in: konsole inxi: 3.3.31

You’ll notice that the cores/CPUs are numbered cpu 0 to cpu 23, not 1-23 as humans count. So there are 24 cores/CPUs there.

Please provide the output of:

cat /sys/devices/system/cpu/cpu0/cpufreq/scaling_governor

…and:

systemctl status cpupower.service

…and:

sudo cpupower -c all frequency-info | grep -i "available cpufreq governors\|hardware limit\|current policy"

Edit:

Your output is quite significantly different than mine:

[...]
CPU:
Info: model: Intel Core i7-8700 socket: LGA1151 (U3E1) note: check bits: 64
type: MT MCP arch: Coffee Lake gen: core 8 level: v3 note: check built: 2018
process: Intel 14nm family: 6 model-id: 0x9E (158) stepping: 0xA (10)
microcode: 0xF4
Topology: cpus: 1x cores: 6 tpc: 2 threads: 12 smt: enabled cache:
L1: 384 KiB desc: d-6x32 KiB; i-6x32 KiB L2: 1.5 MiB desc: 6x256 KiB
L3: 12 MiB desc: 1x12 MiB
Speed (MHz): avg: 800 min/max: 800/4600 base/boost: 4300/8300 scaling:
driver: intel_pstate governor: powersave volts: 1.1 V ext-clock: 100 MHz
cores: 1: 800 2: 800 3: 800 4: 800 5: 800 6: 800 7: 800 8: 800 9: 800
10: 800 11: 800 12: 800 bogomips: 76831
Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat
arch_capabilities arch_perfmon art avx avx2 bmi1 bmi2 bts clflush
clflushopt cmov constant_tsc cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64
dtherm dts ept ept_ad erms est f16c flexpriority flush_l1d fma fpu
fsgsbase fxsr ht hwp hwp_act_window hwp_epp hwp_notify ibpb ibrs ida
intel_pt invpcid invpcid_single lahf_lm lm mca mce md_clear mmx monitor
movbe mpx msr mtrr nonstop_tsc nopl nx pae pat pbe pcid pclmulqdq pdcm
pdpe1gb pebs pge pln pni popcnt pse pse36 pti pts rdrand rdseed rdtscp
rep_good sdbg sep smap smep smx ss ssbd sse sse2 sse4_1 sse4_2 ssse3 stibp
syscall tm tm2 tpr_shadow tsc tsc_adjust tsc_deadline_timer vme vmx vnmi
vpid x2apic xgetbv1 xsave xsavec xsaveopt xsaves xtopology xtpr
Vulnerabilities:
Type: gather_data_sampling mitigation: Microcode
Type: itlb_multihit status: KVM: VMX disabled
Type: l1tf mitigation: PTE Inversion; VMX: conditional cache flushes, SMT
vulnerable
Type: mds mitigation: Clear CPU buffers; SMT vulnerable
Type: meltdown mitigation: PTI
Type: mmio_stale_data mitigation: Clear CPU buffers; SMT vulnerable
Type: retbleed mitigation: IBRS
Type: spec_rstack_overflow status: Not affected
Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
prctl
Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
sanitization
Type: spectre_v2 mitigation: IBRS, IBPB: conditional, STIBP: conditional,
RSB filling, PBRSB-eIBRS: Not affected
Type: srbds mitigation: Microcode
Type: tsx_async_abort mitigation: TSX disabled
[...]

So do me a flavour, and please also provide the output of:

sudo inxi --admin --verbosity=7 --filter --no-host --width

My guess is that the water-cooling pump has failed. There’s no way that system should be hitting shutdown temps regardless of cpu governor.

To confirm or rule out this guess, boot into bios and observe temperature there.

3 Likes

try to boot with these option
“amd-pstate=passive”

and see return for
sudo cpupower frequency-info ( if possible )

i guess trouble on your watercooler or failed contact between cpu and cooling