I have the following problem with my graphics card (Radeon 5500XT):
Before configuring anything manually, the fan speed was handled by the card, but did not exceed 1000/min, which lead to overheat problems (sure took me a while to figure out why games kept knocking out the machine, sometimes, but sometimes not…).
While trying to find the issue, I ran sensors-detect
to find the temperature readings for the GPU, and it found the following:
# sensors
k10temp-pci-00c3
Adapter: PCI adapter
Tctl: +44.0°C
Tdie: +44.0°C
Tccd1: +38.2°C
Tccd2: +41.8°C
amdgpu-pci-0a00
Adapter: PCI adapter
vddgfx: 6.00 mV
fan1: 0 RPM (min = 0 RPM, max = 3400 RPM)
edge: +37.0°C (crit = +110.0°C, hyst = -273.1°C)
(emerg = +115.0°C)
junction: +37.0°C (crit = +105.0°C, hyst = -273.1°C)
(emerg = +110.0°C)
mem: +0.0°C (crit = +105.0°C, hyst = -273.1°C)
(emerg = +110.0°C)
power1: 3.00 W (cap = 125.00 W)
So far so good. I have the case opened, and the fans are indeed not spinning.
When I then run pwmconfig
, it starts by spinning the fans up to 100%, (very audibly),
but the fan speed sensor stays at 0, and pwmconfig
aborts. The fans then keep spinning at 100% until I reboot the machine.
I thought that this might be an issue with the fan speed sensor (either directly, or due to it not being supported in some way), but when I boot from a live USB image (also Manjaro), the fan speed is detected and indicated without any issues, and the setup in pwmconfig
continues. Unfortunately, it then fails to auto-detect the pwm setting at which the fans stop spinning. That is: it manages to turn them on or off, but not to ramp in between the extremes, so it stays at 100% during the auto-detect phase, until I use ctrl+c
to abort (and then the fans stay at 100% until the next reboot…). On the live image, I can see how the fan speed ranges from 0 to 1000/min before running pwmcontrol
, depending on temperature, and goes up to 3400 afterwards.
In the course of pinpointing the issue (there was also something wrong with the RAM configuration, also causing crashes, which obfuscated a lot), I have swapped graphics cards a few times, and also recently replaced the motherboard on the same system. This means that the sensor configuration has changed a few times, and there may be some leftover data from the removed hardware causing issues.
…and just to make matters more interesting, I also tried setting this up on a live openSUSE USB, and it also found the fan speed but claimed there was no PWM control device available – while still being able to switch fan speed up to 100%.
The question(s):
- Where do I need to poke to get the installed system to find the correct fan sped sensor readings (must be possible since live Manjaro can do it)
- If
pwmcontrol
can change the fan speed from 0 to 100%, does that not imply that it should also be able to do anything in between? If yes, how can I figure out what’s holding it back?
system info:
Not sure which inxi
outputs may be relevant here, but for a starter, here’s some of them:
> inxi -F
System:
Host: monster Kernel: 5.13.19-2-MANJARO arch: x86_64 bits: 64
Desktop: KDE Plasma v: 5.26.2 Distro: Manjaro Linux
Machine:
Type: Desktop Mobo: ASUSTeK model: PRIME X570-P v: Rev X.0x
serial: 210484331300543 UEFI: American Megatrends v: 3603 date: 03/20/2021
CPU:
Info: 12-core model: AMD Ryzen 9 5900X bits: 64 type: MT MCP cache:
L2: 6 MiB
Speed (MHz): avg: 2262 min/max: 2200/4950 cores: 1: 3700 2: 2200 3: 2200
4: 2200 5: 2200 6: 2200 7: 2200 8: 2200 9: 2200 10: 2200 11: 2200 12: 2200
13: 2200 14: 2200 15: 2200 16: 2200 17: 2200 18: 2200 19: 2200 20: 2200
21: 2200 22: 2200 23: 2200 24: 2200
Graphics:
Device-1: AMD Navi 14 [Radeon RX 5500/5500M / Pro 5500M] driver: amdgpu
v: kernel
Display: x11 server: X.Org v: 21.1.4 driver: X: loaded: amdgpu
unloaded: modesetting,radeon dri: radeonsi gpu: amdgpu
resolution: 1920x1200~60Hz
API: OpenGL v: 4.6 Mesa 22.2.1 renderer: AMD Radeon RX 5500 XT (navi14
LLVM 14.0.6 DRM 3.41 5.13.19-2-MANJARO)
Audio: [...]
Network: [...]
Drives: [...]
Partition: [...]
Swap: [...]
Sensors:
System Temperatures: cpu: 54.8 C mobo: N/A gpu: amdgpu temp: 35.0 C
Fan Speeds (RPM): N/A gpu: amdgpu fan: 0
Info:
Processes: 506 Uptime: 39m Memory: 62.71 GiB used: 4.71 GiB (7.5%)
Shell: Bash inxi: 3.3.23