Repeated system hangs/crashes - I suppose it is related to graphics

I’m facing repeated system hangs/crashes since some weeks. Therefore I think the problem came with an update … but which one?

journalctl --catalog --priority=3 --boot=-1

Jan 27 10:13:02 catweazle kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.SMIC], AE_ALREADY_EXISTS (20230628/dswload2-326)
Jan 27 10:13:02 catweazle kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230628/psobject-220)
Jan 27 10:13:02 catweazle kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.SMIB], AE_ALREADY_EXISTS (20230628/dsfield-637)
Jan 27 10:14:50 catweazle lightdm[1383]: gkr-pam: unable to locate daemon control file
Jan 27 10:14:52 catweazle pulseaudio[1514]: GetManagedObjects() failed: org.freedesktop.DBus.Error.NameHasNoOwner: Could not activate remote peer 'org.bluez': unit failed
Jan 27 10:48:41 catweazle kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring comp_1.2.0 timeout, signaled seq=18133, emitted seq=18135
Jan 27 10:48:41 catweazle kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 1265 thread Xorg:cs0 pid 1302
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/4:0H:34 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/3:0H:52 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/3:1H:144 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/4:2H:1398 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/0:3H:2452 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/3:2H:3842 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:49:47 catweazle kernel: INFO: task kworker/3:3H:3843 blocked for more than 122 seconds.
Jan 27 10:49:47 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:49:47 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:51:50 catweazle kernel: INFO: task kworker/4:0H:34 blocked for more than 245 seconds.
Jan 27 10:51:50 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:51:50 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:51:50 catweazle kernel: INFO: task kworker/3:0H:52 blocked for more than 245 seconds.
Jan 27 10:51:50 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:51:50 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 27 10:51:50 catweazle kernel: INFO: task kworker/3:1H:144 blocked for more than 245 seconds.
Jan 27 10:51:50 catweazle kernel:       Tainted: G           OE      6.6.74-1-MANJARO #1
Jan 27 10:51:50 catweazle kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.

Hello and welcome,

A good starting point is usually system information.

Please enter into terminal the following command;

inxi -Farz

and share the output here.

Also attached is a guide including tips on how to format code and so on;

1 Like
System:
  Kernel: 6.6.74-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 14.2.1
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/vmlinuz-6.6-x86_64
    root=UUID=98cd14c3-9281-4af3-a2f2-a7cfb522df39 rw quiet
    resume=UUID=5332b1d7-41ee-48b3-8c91-02d1f5d0a012 udev.log_priority=3
  Desktop: LXDE v: 0.10.1 tk: Gtk v: N/A wm: Openbox v: 3.6.1 with: lxpanel
    tools: light-locker avail: lxlock vt: 7 dm: LightDM v: 1.32.0
    Distro: Manjaro base: Arch Linux
Machine:
  Type: Desktop System: Gigabyte product: B450 AORUS PRO v: N/A
    serial: <superuser required>
  Mobo: Gigabyte model: B450 AORUS PRO-CF v: x.x
    serial: <superuser required> uuid: <superuser required>
    UEFI: American Megatrends v: F2 date: 08/08/2018
CPU:
  Info: model: AMD Ryzen 5 2400G with Radeon Vega Graphics bits: 64
    type: MT MCP arch: Zen level: v3 note: check built: 2017-19 process: GF 14nm
    family: 0x17 (23) model-id: 0x11 (17) stepping: 0 microcode: 0x810100B
  Topology: cpus: 1x dies: 1 clusters: 1 cores: 4 threads: 8 tpc: 2
    smt: enabled cache: L1: 384 KiB desc: d-4x32 KiB; i-4x64 KiB L2: 2 MiB
    desc: 4x512 KiB L3: 4 MiB desc: 1x4 MiB
  Speed (MHz): avg: 1600 min/max: 1600/3600 boost: enabled scaling:
    driver: acpi-cpufreq governor: schedutil cores: 1: 1600 2: 1600 3: 1600
    4: 1600 5: 1600 6: 1600 7: 1600 8: 1600 bogomips: 57512
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
  Vulnerabilities:
  Type: gather_data_sampling status: Not affected
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: reg_file_data_sampling status: Not affected
  Type: retbleed mitigation: untrained return thunk; SMT vulnerable
  Type: spec_rstack_overflow mitigation: Safe RET
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines; IBPB: conditional; STIBP:
    disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Advanced Micro Devices [AMD/ATI] Raven Ridge [Radeon Vega Series
    / Radeon Mobile Series] vendor: Gigabyte driver: amdgpu v: kernel
    arch: GCN-5 code: Vega process: GF 14nm built: 2017-20 pcie: gen: 3
    speed: 8 GT/s lanes: 16 ports: active: DVI-D-1 empty: HDMI-A-1
    bus-ID: 07:00.0 chip-ID: 1002:15dd class-ID: 0300 temp: 37.0 C
  Display: x11 server: X.Org v: 21.1.15 driver: X: loaded: amdgpu
    unloaded: modesetting alternate: fbdev,vesa dri: radeonsi gpu: amdgpu
    display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1680x1050 s-dpi: 96 s-size: 444x277mm (17.48x10.91")
    s-diag: 523mm (20.6")
  Monitor-1: DVI-D-1 mapped: DVI-D-0 model: Fujitsu Siemens B22W-7 LED
    serial: <filter> built: 2015 res: mode: 1680x1050 hz: 60 scale: 100% (1)
    dpi: 90 gamma: 1.2 size: 474x296mm (18.66x11.65") diag: 559mm (22")
    ratio: 16:10 modes: max: 1680x1050 min: 720x400
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: radeonsi x11:
    drv: radeonsi inactive: wayland
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.3.4-arch1.1
    glx-v: 1.4 direct-render: yes renderer: AMD Radeon Vega 11 Graphics
    (radeonsi raven LLVM 19.1.7 DRM 3.54 6.6.74-1-MANJARO)
    device-ID: 1002:15dd memory: 1000 MiB unified: no
  Info: Tools: api: eglinfo,glxinfo de: xfce4-display-settings
    x11: xdpyinfo, xprop, xrandr
Audio:
  Device-1: Advanced Micro Devices [AMD/ATI] Raven/Raven2/Fenghuang HDMI/DP
    Audio driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 07:00.1 chip-ID: 1002:15de class-ID: 0403
  Device-2: Advanced Micro Devices [AMD] Family 17h/19h/1ah HD Audio
    vendor: Gigabyte driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s
    lanes: 16 bus-ID: 07:00.6 chip-ID: 1022:15e3 class-ID: 0403
  API: ALSA v: k6.6.74-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 1.2.7 status: off tools: pw-cli
  Server-3: PulseAudio v: 17.0-43-g3e2bb status: active with:
    1: pulseaudio-alsa type: plugin 2: pulseaudio-jack type: module
    tools: pacat,pactl,pavucontrol
Network:
  Device-1: Intel I211 Gigabit Network vendor: Gigabyte driver: igb v: kernel
    pcie: gen: 1 speed: 2.5 GT/s lanes: 1 port: f000 bus-ID: 03:00.0
    chip-ID: 8086:1539 class-ID: 0200
  IF: enp3s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Device-2: ASIX AX88179 Gigabit Ethernet driver: ax88179_178a type: USB
    rev: 3.0 speed: 5 Gb/s lanes: 1 mode: 3.2 gen-1x1 bus-ID: 4-3.4:3
    chip-ID: 0b95:1790 class-ID: ff00 serial: <filter>
  IF: usb-lan state: up speed: 1000 Mbps duplex: full mac: <filter>
  Info: services: sshd, systemd-networkd, systemd-timesyncd
Drives:
  Local Storage: total: 2.65 TiB used: 1.22 TiB (46.1%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Crucial model: CT1000P2SSD8
    size: 931.51 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: P2CR031 temp: 32.9 C
  ID-2: /dev/sda maj-min: 8:0 vendor: Western Digital
    model: WD15EURS-63S48Y0 size: 1.36 TiB block-size: physical: 4096 B
    logical: 512 B speed: 3.0 Gb/s tech: N/A serial: <filter> fw-rev: AB51
    scheme: MBR
  ID-3: /dev/sdb maj-min: 8:16 vendor: Samsung model: SSD 850 EVO 250GB
    size: 232.89 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    tech: SSD serial: <filter> fw-rev: 1B6Q scheme: MBR
  ID-4: /dev/sdc maj-min: 8:32 vendor: Western Digital
    model: WD1600AAJS-00PSA0 size: 149.05 GiB block-size: physical: 512 B
    logical: 512 B speed: 3.0 Gb/s tech: N/A serial: <filter> fw-rev: 6H05
  ID-5: /dev/sdd maj-min: 8:48 vendor: TrekStor model: USB-Stick CS
    size: 3.73 GiB block-size: physical: 512 B logical: 512 B type: USB rev: 2.0
    spd: 480 Mb/s lanes: 1 mode: 2.0 tech: N/A serial: <filter> fw-rev: 1100
    scheme: MBR
Partition:
  ID-1: / raw-size: 50 GiB size: 48.91 GiB (97.83%) used: 40.58 GiB (83.0%)
    fs: ext4 dev: /dev/sdb6 maj-min: 8:22
  ID-2: /boot raw-size: 10 GiB size: 9.75 GiB (97.45%)
    used: 157.2 MiB (1.6%) fs: ext4 dev: /dev/sdb5 maj-min: 8:21
  ID-3: /boot/efi raw-size: 189 MiB size: 188.8 MiB (99.89%)
    used: 268 KiB (0.1%) fs: vfat dev: /dev/sdb1 maj-min: 8:17
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: partition size: 15.65 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/sdb10 maj-min: 8:26
Sensors:
  System Temperatures: cpu: 37.1 C mobo: N/A gpu: amdgpu temp: 37.0 C
  Fan Speeds (rpm): N/A
Repos:
  Packages: pm: pacman pkgs: 1727 libs: 478 tools: pamac,yay
  Active pacman repo servers in: /etc/pacman.d/mirrorlist
    1: https://ipng.mm.fcix.net/manjaro/testing/$repo/$arch
    2: https://ftp.gwdg.de/pub/linux/manjaro/testing/$repo/$arch
    3: https://mirror.alwyzon.net/manjaro/testing/$repo/$arch
    4: https://manjaro.kurdy.org/testing/$repo/$arch
    5: https://ftpmirror1.infania.net/mirror/manjaro/testing/$repo/$arch
    6: https://ftp.halifax.rwth-aachen.de/manjaro/testing/$repo/$arch
    7: https://mirror.ibakerserver.pt/manjaro/testing/$repo/$arch
    8: https://ftp.lysator.liu.se/pub/manjaro/testing/$repo/$arch
Info:
  Memory: total: 16 GiB note: est. available: 14.58 GiB used: 2.55 GiB (17.5%)
  Processes: 301 Power: uptime: 27m states: freeze,mem,disk suspend: deep
    avail: s2idle wakeups: 0 hibernate: platform avail: shutdown, reboot,
    suspend, test_resume image: 5.81 GiB services: upowerd Init: systemd
    v: 257 default: graphical tool: systemctl
  Compilers: clang: 19.1.7 gcc: 14.2.1 Shell: Bash v: 5.2.37
    running-in: lxterminal inxi: 3.3.37

If your Manjaro is located on this sdb6 partition then you’re running out of space (the system reserves 10% so you’re left with around 2-3GB free). Clean up the partition or increase the size of the partition.

1 Like

While this is a valid concern, and likely to contribute greatly to these crashes you mention, I notice an inconsistency in your inxi output:

If /dev/sdb is your Manjaro installation, you have / and /boot partitions as expected with an MBR partition scheme – however, you also have a /boot/efi mountpoint to /dev/sdb1 – this infers a UEFI booting system; which obviously it can’t be, when using an MBR partition scheme.

I can’t explain how Calamares would have created this scenario.
Perhaps this is also contributing.


I note that the BIOS has not been updated – ever, by the look of it. If the following link reflects your mainboard, you might consider attending to this whenever convenient.

According to your inxi output BIOS F2 (2018-08-08) is currently used - there have been fifteen (15) BIOS update releases since then. The latest BIOS release in F66d and was made available on 2024-09-02.

Looking at the descriptions of the BIOS updates you have missed, it would be advisable to update as soon as may be practical. While doing so will not necessarily solve your immediate issue, it will likely solve problems you were not even aware of.


That’s all I have to add at this point.

Regards.

1 Like

yes, this is the partition, where the OS resides. But I don’t think, 83% are a problem. I never have seen any “no space left on device” messages or similar. The system is running for years with this configuration. But you are right, enlarging the partition is certainly not a luxury.

I cannot remember how this waas set up, but the system is running since at least 4…5 years with this onfiguration

There are extremely conflicting opinions on this topic. I recently bought a new motherboard (because I want to replace this system) - the manual explicitly states “never install a BIOS update if the system is running smoothly”.
Anyway - I’ll give it a try!

I can.

It happens when you have some OS (usually Windows) installed in BIOS mode and you install Manjaro in Uefi mode. (I think it happened to me once, and that forced me to learn the difference)

Edit:

While I can’t say I agree with the advice:

…do you not think repeated hangs/crashes is the system not running smoothly?

2 Likes

Yes, that used to be a common stance; I suppose too many users bricked their machines due to general inattention to instructions given by the respective board manufacturers.

Then again, that was a time when seeing more than a proverbial handful of BIOS updates in the lifetime of a product was rather uncommon.

of course, this might be a problem.
But - as mentioned - this is the first time in 6 … 7 years.
Therefore it was not my first suspicion and I have lately seen more systems crashing - even new systems with very much free space on disk and the newest BIOS update installed. The symptoms are not exactly the same, but very similar.

But of course that’s just my opinion - let’s just keep looking for the problem here!
I have enlarged the root partition in the meantime - I’ll take care of the BIOS tomorrow.

df -h /dev/sdb6
Filesystem      Size  Used Avail Use% Mounted on
/dev/sdb6        77G   41G   33G  56% /

2 Likes

Good morning!
Just finished the first BIOS upgrade F2 ->F4 … and - as expected - faced some “problems”. The system won’t boot any longer using UEFI, while BIOS boot works.
Maybe this comes from the odd partitioning table?
I tried to repair this using a Manjaro live-Stick. Re-installed grub (efi and BIOS) … but the efi variant still keeps on complaining ‘grub_is_lockdown’ not found …

Anyhow - BIOS boot is not optimal, but it works.

I’ll do further updates, but this will take a while.
I’m afraid there will rise more problems and … I’m not sure how to deal with BIOS versions F40 and above. The Gigabyte website states:

Note:
1. If you are using Q-Flash Utility to update BIOS, make sure you have updated BIOS to F32 before F40
2. Before update BIOS to F40, you have to install EC FW Update Tool (B19.0517.1 or later version) to avoid 4DIMM DDR incompatibility on 3rd Gen AMD Ryzen™ CPU 

… but I don’t have any Windows installations to run this Tool.

I opened a call at the Gigabyte support to clarify this

Re. the “Q-Flash” program: You might be able to run it via Hirens Boot CD.

Just quickly, on the topic of running out of space;

I would be remiss if I didn’t address this comment.

The space can likely become filled as caches and perhaps a swap file increase in size during use.

What can often happen is that there is no message to warn you of low space; the system simply hangs or crashes instead.

That the crashes may be infrequent, may depend upon how much load there is at any given time; how much available resources are used.

Another consideration is the possibility of overheating.

That is all.

1 Like

I pretty much have that same morherboard and as @BG405 posted you need that CD to upgrade EC FW Update Tool.It is not that hard to do and after updating it using that CD I could update the bios to the latest stable verision of the bios which is now at F66.Yours maybe different but after every update of the bios my settings went back to default.

2 Likes

This is to be expected, as it is a new BIOS. Although those default settings are adequate for many people, if there are any specific customisations one is used to, then they would of course need to be tweaked again.

If multibooting Manjaro and Windows, for example, Secure Boot would again need to be disabled, and likely the Fast Boot BIOS option also, if it exists.

@straycat mentioned the latest stable BIOS – this is also an important distinction – while I mentioned earlier that F66d was the latest available, it is also a beta BIOS. The latest stable BIOS is indeed what you should aim for; currently F65. Typically, one should only use a beta BIOS if it addresses a specific problem experienced.

Edit:- Noted Stable BIOS version currently available.

2 Likes

I really hate to contradict the experts, but I can’t find an F66 version. There is F65, F66b and F66d - but no F66 (just double-checked this)
But that’s absolutely no problem!

I have now reached F32. Before proceeding with F40 and the thing with the firmware updater, I need to read through this at a quiet moment.

I downloaded the boot disk from Hiren - thanks for the tip @BG405 ! Otherwise I would have tried to build myself some kind of DOS … FreeDOS or similar.

The system is currently stable - I’ll keep an eye on it.
But the thing with the UEFI boot is really strange. Sometimes it works - but most of the time I get this weird error message from grub ‘grub_is_lockdown’ not found … whatever this means …

Thank you so far for all your patience and advices!

You’re correct; F66 will be the next stable BIOS, I imagine. The point of course was to avoid a beta BIOS in favour of the stable – in your case, that appears to be F65.

1 Like

That may be my mistake I have the B450 AORUS ELITE and the latest stable bios is is F66. :slightly_smiling_face:

Maybe. I don’t recall whether I was influenced by your post, or simply made a typo; the focus wasn’t on the version in any case, it was more about avoiding the beta’s, so no harm done. :smile_cat:

1 Like