Hard system crash/reboot with Nvidia GPU

Hi all,

Today I installed a Nvidia 3090 that just came in, into a fairly new system. I installed the nonfree drivers and added nvidia_drm.modeset=1 nvidia_drm.fbdev=1 to the boot parameters and nvidia nvidia_modeset nvidia_uvm nvidia_drm to the kernel modules.

Everything seemed to be going well, but then I was downloading a game through Steam and the system just rebooted out of the blue. No panic message or nothing, and after rebooting and checking journalctl I didn’t see anything particularly concerning. It happened again an hour or so later, also while downloading a game. Now, it happens consistently when launching a particular game (The Long Dark). I am able to launch and play other games without crashing, so I am not sure what it could be. The sudden reboot without warning almost seems like a power supply issue, but the system is never loaded when it happens. I suspect it’s something to do with the Nvidia drivers but I have no way to verify that.

I’m running KDE Plasma on Wayland, and I have tried falling back to X11, with the same results. I’m also running the 6.13.0-rc2-1-MANJARO kernel, as the 6.12 kernel doesn’t support my mobo’s ethernet chip.

I’ve uploaded my journalctl logs here, and I will post anything else that may help people help me.

Any pointers would be much appreciated.

Some system information will be beneficial for those who wish to help (see below).

You say the crashing only happens with one game – are you certain it’s compatible with your hardware? Perhaps it needs more memory available; I might guess there is also no swap configured.

These are only vague guesses, as we don’t as yet know much about your system, except that you have a new Nvidia card.

Kernel 6.13.0-1 is currently in both Unstable and Testing branches, that is, if switching branches is a consideration. Whether that will go some way toward solving your issue, I can’t say.

Regards.


Asking for help with running Games

Before asking for help about running Games on Manjaro, please see the pinned topic in the Gaming category:

As the article suggests:

Welcome to the Manjaro community

As a new or infrequent forum user, please take some time to familiarise yourself with Forum requirements; in particular, the many ways to use the forum to your benefit:


Required Reading:

Resources:


Update Announcements:

The Update Announcements contain update related information and a Known Issues and Solutions section that should generally be checked before posting a request for support.


System Information:

Output of this command (formatted according to forum requirements) may be useful for those wishing to help:

inxi --admin --verbosity=8 --filter --no-host --width

Be prepared to provide more information and outputs from other commands when asked.


Regards.

1 Like

Hey, thanks for the quick reply.

My system specs are as follows:
128GB DDR5 (passed full memtest86 when I installed/configured it)
Ryzen 9 7950X
Gigabyte X870 AORUS WIFI7
850W PSU
Nvidia driver version: 550.144.03

I don’t have any swap configured, however I’m pretty sure that’s not an issue with the amount of memory I have. Also, since I originally posted this, I tried the 6.12 kernel, and got the same results. I’d also like to point out that I was got these crashes/reboots a couple of times before trying to launch this particular game, they just became consistent/reproducible when launching the game.

System Information:

Thanks for making an attempt – that might be useful for someone wishing to buy a computer, but for Support purposes, not so much.

The best way to provide system information that might be useful is to ask your system directly, using the inxi command (as already given above):

inxi --admin --verbosity=8 --filter --no-host --width

Regards.

1 Like

Sorry about that. Here we go:

System:
  Kernel: 6.13.0-rc2-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 14.2.1
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.13-x86_64
    root=UUID=630ed032-805c-4676-ac45-b21cb10626b1 rw nvidia_drm.modeset=1
    nvidia_drm.fbdev=1 udev.log_priority=3
  Desktop: KDE Plasma v: 6.2.4 tk: Qt v: N/A wm: kwin_wayland dm: SDDM
    Distro: Manjaro base: Arch Linux
Machine:
  Type: Desktop Mobo: Gigabyte model: X870 AORUS ELITE WIFI7 v: x.x serial: N/A
    uuid: 03ff0210-04e0-053b-9706-ca0700080009 UEFI: American Megatrends LLC.
    v: F3j date: 12/19/2024
Battery:
  Message: No system battery data found. Is one present?
Memory:
  System RAM: total: 128 GiB available: 124.91 GiB used: 4.33 GiB (3.5%)
  Array-1: capacity: 128 GiB slots: 4 modules: 4 EC: None
    max-module-size: 32 GiB note: est.
  Device-1: Channel-A DIMM 0 type: DDR5 detail: synchronous unbuffered
    (unregistered) size: 32 GiB speed: 5200 MT/s volts: curr: 1.1 min: 1.1
    max: 1.1 width (bits): data: 64 total: 64 manufacturer: Kingston
    part-no: KF552C40-32 serial: <filter>
  Device-2: Channel-A DIMM 1 type: DDR5 detail: synchronous unbuffered
    (unregistered) size: 32 GiB speed: 5200 MT/s volts: curr: 1.1 min: 1.1
    max: 1.1 width (bits): data: 64 total: 64 manufacturer: Kingston
    part-no: KF552C40-32 serial: <filter>
  Device-3: Channel-B DIMM 0 type: DDR5 detail: synchronous unbuffered
    (unregistered) size: 32 GiB speed: 5200 MT/s volts: curr: 1.1 min: 1.1
    max: 1.1 width (bits): data: 64 total: 64 manufacturer: Kingston
    part-no: KF552C40-32 serial: <filter>
  Device-4: Channel-B DIMM 1 type: DDR5 detail: synchronous unbuffered
    (unregistered) size: 32 GiB speed: 5200 MT/s volts: curr: 1.1 min: 1.1
    max: 1.1 width (bits): data: 64 total: 64 manufacturer: Kingston
    part-no: KF552C40-32 serial: <filter>
PCI Slots:
  Slot: 1 type: PCIe gen: 1 status: in use length: short volts: 3.3
    bus-ID: 00:01.1 children: 1: 01:00.0 class-ID: 0300 type: display 2: 01:00.1
    class-ID: 0403 type: audio
  Slot: N/A type: N/A status: available info: M.2, J3502 length: short
    volts: 3.3 bus-ID: 00:1f.7
  Slot: 3 type: PCIe gen: 3 status: in use length: short volts: 3.3
    bus-ID: 00:02.2 children: 1: 0e:00.0 class-ID: 0604 type: bridge children:
    1: 0f:00.0 class-ID: 0604 type: bridge 2: 0f:01.0 class-ID: 0604
    type: bridge 3: 0f:02.0 class-ID: 0604 type: bridge children: 1: 70:00.0
    class-ID: 0c03 type: serialbus 4: 0f:03.0 class-ID: 0604 type: bridge
    children: 1: 71:00.0 class-ID: 0c03 type: serialbus
CPU:
  Info: model: AMD Ryzen 9 7950X socket: AM5 bits: 64 type: MT MCP arch: Zen 4
    gen: 4 level: v4 note: check built: 2022+ process: TSMC n5 (5nm)
    family: 0x19 (25) model-id: 0x61 (97) stepping: 2 microcode: 0xA601209
  Topology: cpus: 1x dies: 2 clusters: 2x1 cores: 16 threads: 32 tpc: 2
    smt: enabled cache: L1: 1024 KiB desc: d-16x32 KiB; i-16x32 KiB L2: 16 MiB
    desc: 16x1024 KiB L3: 64 MiB desc: 2x32 MiB
  Speed (MHz): avg: 3010 min/max: 545/5881 boost: enabled
    base/boost: 4500/5850 scaling: driver: amd-pstate-epp governor: performance
    volts: 1.3 V ext-clock: 100 MHz cores: 1: 3010 2: 3010 3: 3010 4: 3010
    5: 3010 6: 3010 7: 3010 8: 3010 9: 3010 10: 3010 11: 3010 12: 3010 13: 3010
    14: 3010 15: 3010 16: 3010 17: 3010 18: 3010 19: 3010 20: 3010 21: 3010
    22: 3010 23: 3010 24: 3010 25: 3010 26: 3010 27: 3010 28: 3010 29: 3010
    30: 3010 31: 3010 32: 3010 bogomips: 287577
  Flags: 3dnowprefetch abm adx aes amd_lbr_pmc_freeze amd_lbr_v2 aperfmperf
    apic arat avic avx avx2 avx512_bf16 avx512_bitalg avx512_vbmi2 avx512_vnni
    avx512_vpopcntdq avx512bw avx512cd avx512dq avx512f avx512ifma avx512vbmi
    avx512vl bmi1 bmi2 bpext cat_l3 cdp_l3 clflush clflushopt clwb clzero cmov
    cmp_legacy constant_tsc cpb cppc cpuid cqm cqm_llc cqm_mbm_local
    cqm_mbm_total cqm_occup_llc cr8_legacy cx16 cx8 de decodeassists erms
    extapic extd_apicid f16c flush_l1d flushbyasid fma fpu fsgsbase fsrm fxsr
    fxsr_opt gfni ht hw_pstate ibpb ibrs ibrs_enhanced ibs invpcid irperf
    lahf_lm lbrv lm mba mca mce misalignsse mmx mmxext monitor movbe msr mtrr
    mwaitx nonstop_tsc nopl npt nrip_save nx ospke osvw overflow_recov pae pat
    pausefilter pclmulqdq pdpe1gb perfctr_core perfctr_llc perfctr_nb
    perfmon_v2 pfthreshold pge pku pni popcnt pse pse36 rapl rdpid rdpru
    rdrand rdseed rdt_a rdtscp rep_good sep sha_ni skinit smap smca smep ssbd
    sse sse2 sse4_1 sse4_2 sse4a ssse3 stibp succor svm svm_lock syscall tce
    topoext tsc tsc_scale umip user_shstk v_spec_ctrl vaes vgif vmcb_clean vme
    vmmcall vnmi vpclmulqdq wbnoinvd wdt x2avic xgetbv1 xsave xsavec
    xsaveerptr xsaveopt xsaves xtopology
  Vulnerabilities:
  Type: gather_data_sampling status: Not affected
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: reg_file_data_sampling status: Not affected
  Type: retbleed status: Not affected
  Type: spec_rstack_overflow mitigation: Safe RET
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Enhanced / Automatic IBRS; IBPB: conditional;
    STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not
    affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: NVIDIA GA102 [GeForce RTX 3090] vendor: eVga.com. driver: nvidia
    v: 550.144.03 alternate: nouveau,nvidia_drm non-free: 550/565.xx+
    status: current (as of 2025-01; EOL~2026-12-xx) arch: Ampere code: GAxxx
    process: TSMC n7 (7nm) built: 2020-2023 pcie: gen: 2 speed: 5 GT/s
    lanes: 16 link-max: gen: 4 speed: 16 GT/s ports: active: none off: DP-3
    empty: DP-1,DP-2,HDMI-A-1 bus-ID: 01:00.0 chip-ID: 10de:2204 class-ID: 0300
  Device-2: Advanced Micro Devices [AMD/ATI] Raphael vendor: Gigabyte
    driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x process: TSMC n7 (7nm)
    built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: none
    empty: DP-4, DP-5, HDMI-A-2, HDMI-A-3, Writeback-1 bus-ID: 72:00.0
    chip-ID: 1002:164e class-ID: 0300 temp: 40.0 C
  Device-3: Logitech HD Webcam C615 driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 9-1.1:3
    chip-ID: 046d:082c class-ID: 0e02 serial: <filter>
  Display: unspecified server: X.Org v: 24.1.4 with: Xwayland v: 24.1.4
    compositor: kwin_wayland driver: X: loaded: amdgpu,nvidia
    unloaded: modesetting,nouveau alternate: fbdev,nv,vesa dri: radeonsi
    gpu: nvidia,nvidia-nvswitch display-ID: :1 screens: 1
  Screen-1: 0 s-res: 3072x1728 s-dpi: 96 s-size: 813x457mm (32.01x17.99")
    s-diag: 933mm (36.72")
  Monitor-1: DP-3 note: disabled model: Acer ET322QK serial: <filter>
    built: 2018 res: mode: 3072x1728 hz: 60 scale: 100% (1) dpi: 112 gamma: 1.2
    chroma: red: x: 0.686 y: 0.310 green: x: 0.259 y: 0.686 blue: x: 0.149
    y: 0.059 white: x: 0.314 y: 0.329 size: 698x393mm (27.48x15.47")
    diag: 801mm (31.5") ratio: 16:9 modes: 3840x2160, 2560x1440, 1920x1080,
    1680x1050, 1280x1024, 1440x900, 1280x960, 1280x800, 1280x720, 1024x768,
    800x600, 720x576, 720x480, 640x480
  EDID-Warnings: 1: parse_edid: unhandled CEA mode 93 2: parse_edid:
    unhandled CEA mode 94 3: parse_edid: unhandled CEA mode 95 4: parse_edid:
    unhandled CEA mode 96 5: parse_edid: unhandled CEA mode 97
  API: EGL v: 1.5 hw: drv: nvidia drv: amd radeonsi platforms: device: 0
    drv: nvidia device: 2 drv: radeonsi device: 3 drv: swrast gbm: drv: nvidia
    surfaceless: drv: nvidia x11: drv: zink inactive: wayland,device-1
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: nvidia mesa v: 550.144.03
    glx-v: 1.4 direct-render: yes renderer: NVIDIA GeForce RTX 3090/PCIe/SSE2
    memory: 23.44 GiB
  API: Vulkan v: 1.4.303 layers: 1 device: 0 type: discrete-gpu
    name: NVIDIA GeForce RTX 3090 driver: N/A device-ID: 10de:2204
    surfaces: xcb,xlib device: 1 type: integrated-gpu name: AMD Radeon
    Graphics (RADV RAPHAEL_MENDOCINO) driver: N/A device-ID: 1002:164e
    surfaces: xcb,xlib
  Info: Tools: api: clinfo, eglinfo, glxinfo, vulkaninfo
    de: kscreen-console,kscreen-doctor gpu: nvidia-settings,nvidia-smi
    wl: wayland-info x11: xdpyinfo, xprop, xrandr
Audio:
  Device-1: NVIDIA GA102 High Definition Audio vendor: eVga.com.
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 01:00.1 chip-ID: 10de:1aef class-ID: 0403
  Device-2: Advanced Micro Devices [AMD/ATI] Rembrandt Radeon High
    Definition Audio driver: snd_hda_intel v: kernel pcie: gen: 4
    speed: 16 GT/s lanes: 16 bus-ID: 72:00.1 chip-ID: 1002:1640 class-ID: 0403
  Device-3: Advanced Micro Devices [AMD] Family 17h/19h/1ah HD Audio
    vendor: Gigabyte driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s
    lanes: 16 bus-ID: 72:00.6 chip-ID: 1022:15e3 class-ID: 0403
  Device-4: Logitech HD Webcam C615 driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 9-1.1:3
    chip-ID: 046d:082c class-ID: 0e02 serial: <filter>
  API: ALSA v: k6.13.0-rc2-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 1.2.7 status: n/a (root, process) with:
    1: pipewire-pulse status: active 2: wireplumber status: active
    3: pipewire-alsa type: plugin tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: Realtek RTL8125 2.5GbE vendor: Gigabyte driver: r8169 v: kernel
    pcie: gen: 2 speed: 5 GT/s lanes: 1 port: e000 bus-ID: 05:00.0
    chip-ID: 10ec:8125 class-ID: 0200
  IF: enp5s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  Device-2: MEDIATEK vendor: Foxconn driver: mt7925e v: kernel pcie: gen: 2
    speed: 5 GT/s lanes: 1 port: N/A bus-ID: 06:00.0 chip-ID: 14c3:7925
    class-ID: 0280
  IF: wlp6s0 state: down mac: <filter>
  IF-ID-1: docker0 state: down mac: <filter>
  IP v4: <filter> scope: global broadcast: <filter>
  IF-ID-2: virbr0 state: down mac: <filter>
  IP v4: <filter> scope: global broadcast: <filter>
  Info: services: NetworkManager, sshd, systemd-timesyncd, wpa_supplicant
  WAN IP: <filter>
Bluetooth:
  Device-1: Foxconn / Hon Hai Wireless_Device driver: btusb v: 0.8 type: USB
    rev: 2.1 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-9:4 chip-ID: 0489:e124
    class-ID: e001 serial: <filter>
  Report: rfkill ID: hci0 rfk-id: 0 state: down bt-service: disabled
    rfk-block: hardware: no software: yes address: see --recommends
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 4.55 TiB used: 682.55 GiB (14.7%)
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung model: SSD 980 1TB
    size: 931.51 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: 2B4QFXO7 temp: 43.9 C
    scheme: GPT
  SMART: yes health: PASSED on: 65d 14h cycles: 2,029
    read-units: 32,510,722 [16.6 TB] written-units: 28,179,425 [14.4 TB]
  ID-2: /dev/sda maj-min: 8:0 vendor: Seagate model: ST4000NM0033-9ZM170
    family: Constellation ES.3 size: 3.64 TiB block-size: physical: 512 B
    logical: 512 B sata: 3.0 speed: 6.0 Gb/s tech: HDD rpm: 7200
    serial: <filter> fw-rev: SN07 temp: 37 C scheme: GPT
  SMART: yes state: enabled health: PASSED on: 373d 1h cycles: 212 Pre-Fail:
    attribute: Spin_Retry_Count value: 100 worst: 100 threshold: 97
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 931.22 GiB size: 915.53 GiB (98.32%)
    used: 376.38 GiB (41.1%) fs: ext4 block-size: 4096 B dev: /dev/nvme0n1p2
    maj-min: 259:2 label: N/A uuid: 630ed032-805c-4676-ac45-b21cb10626b1
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 288 KiB (0.1%) fs: vfat block-size: 512 B dev: /dev/nvme0n1p1
    maj-min: 259:1 label: N/A uuid: 9771-6300
  ID-3: /mnt/4tb_sata raw-size: 3.64 TiB size: 3.58 TiB (98.40%)
    used: 306.17 GiB (8.4%) fs: ext4 block-size: 4096 B dev: /dev/sda1
    maj-min: 8:1 label: N/A uuid: a274a729-d213-40b2-a251-cf7114afce86
  ID-4: /mnt/nas raw-size: N/A size: 21.65 TiB used: 8.75 TiB (40.4%)
    fs: nfs4 remote: 192.168.1.2:/mnt/raid
Swap:
  Alert: No swap data was found.
Unmounted:
  Message: No unmounted partitions found.
USB:
  Hub-1: 1-0:1 info: hi-speed hub with single TT ports: 12 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 1-2:2 info: Logitech G413 Gaming Keyboard type: keyboard,HID
    driver: hid-generic,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 500mA chip-ID: 046d:c33a class-ID: 0300
    serial: <filter>
  Device-2: 1-6:3 info: Integrated Express GIGABYTE Device type: HID
    driver: hid-generic,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 100mA chip-ID: 048d:5711 class-ID: 0300
  Device-3: 1-9:4 info: Foxconn / Hon Hai Wireless_Device type: bluetooth
    driver: btusb interfaces: 3 rev: 2.1 speed: 480 Mb/s (57.2 MiB/s) lanes: 1
    mode: 2.0 power: 100mA chip-ID: 0489:e124 class-ID: e001 serial: <filter>
  Hub-2: 2-0:1 info: super-speed hub ports: 5 rev: 3.1
    speed: 20 Gb/s (2.33 GiB/s) lanes: 2 mode: 3.2 gen-2x2 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-3: 3-0:1 info: hi-speed hub with single TT ports: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-4: 4-0:1 info: super-speed hub ports: 2 rev: 3.1
    speed: 20 Gb/s (2.33 GiB/s) lanes: 2 mode: 3.2 gen-2x2 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-5: 5-0:1 info: hi-speed hub with single TT ports: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-6: 6-0:1 info: super-speed hub ports: 2 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-7: 7-0:1 info: hi-speed hub with single TT ports: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-8: 7-2:2 info: Realtek RTS5411 Hub ports: 4 rev: 2.1
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 0bda:5411
    class-ID: 0900
  Hub-9: 8-0:1 info: super-speed hub ports: 2 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-10: 8-2:2 info: Realtek Hub ports: 4 rev: 3.2
    speed: 5 Gb/s (596.0 MiB/s) lanes: 1 mode: 3.2 gen-1x1 chip-ID: 0bda:0411
    class-ID: 0900
  Hub-11: 9-0:1 info: hi-speed hub with single TT ports: 1 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-12: 9-1:2 info: Genesys Logic Hub ports: 4 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 100mA
    chip-ID: 05e3:0608 class-ID: 0900
  Device-1: 9-1.1:3 info: Logitech HD Webcam C615 type: audio,video
    driver: snd-usb-audio,uvcvideo interfaces: 4 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 power: 500mA
    chip-ID: 046d:082c class-ID: 0e02 serial: <filter>
  Device-2: 9-1.3:4 info: Logitech G502 SE HERO Gaming Mouse type: mouse,HID
    driver: hid-generic,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 300mA chip-ID: 046d:c08b class-ID: 0300
    serial: <filter>
  Device-3: 9-1.4:6 info: Microsoft Xbox One Controller
    type: <vendor specific> driver: xpad interfaces: 3 rev: 2.0
    speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1 power: 500mA
    chip-ID: 045e:02ea class-ID: ff00 serial: <filter>
  Hub-13: 10-0:1 info: Linux Foundation 3.0 root hub ports: N/A rev: 3.0
    speed: 5 Gb/s (596.0 MiB/s) lanes: 1 mode: 3.2 gen-1x1 chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 47.1 C mobo: N/A gpu: amdgpu temp: 41.0 C
  Fan Speeds (rpm): N/A
Repos:
  Packages: pm: pacman pkgs: 1553 libs: 418 tools: pamac pm: flatpak pkgs: 0
  Active pacman repo servers in: /etc/pacman.d/mirrorlist
    1: https://mirrors.cicku.me/manjaro/stable/$repo/$arch
    2: https://uvermont.mm.fcix.net/manjaro/stable/$repo/$arch
    3: https://manjaro.ipacct.com/manjaro/stable/$repo/$arch
    4: https://mirror.koddos.net/manjaro/stable/$repo/$arch
    5: https://mirror.leitecastro.com/manjaro/stable/$repo/$arch
    6: https://ftp.lysator.liu.se/pub/manjaro/stable/$repo/$arch
    7: https://mirror.ufro.cl/manjaro/stable/$repo/$arch
    8: http://ftp.rz.tu-bs.de/pub/mirror/manjaro.org/repos/stable/$repo/$arch
Processes:
  CPU top: 5 of 561
  1: cpu: 24.1% command: baloo_file_extractor pid: 2046 mem: 585.9 MiB (0.4%)
  2: cpu: 22.6% command: [kworker/18:3-events] pid: 2906 mem: 0.00 MiB (0.0%)
  3: cpu: 22.1% command: firefox pid: 2094 mem: 563.9 MiB (0.4%)
  4: cpu: 17.9% command: firefox pid: 2270 mem: 329.8 MiB (0.2%)
  5: cpu: 6.6% command: plasmashell pid: 1666 mem: 553.0 MiB (0.4%)
  Memory top: 5 of 561
  1: mem: 585.9 MiB (0.4%) command: baloo_file_extractor pid: 2046 cpu: 24.1%
  2: mem: 563.9 MiB (0.4%) command: firefox pid: 2094 cpu: 22.1%
  3: mem: 553.0 MiB (0.4%) command: plasmashell pid: 1666 cpu: 6.6%
  4: mem: 329.8 MiB (0.2%) command: firefox pid: 2270 cpu: 17.9%
  5: mem: 316.1 MiB (0.2%) command: kwin_wayland pid: 1496 cpu: 6.5%
Info:
  Processes: 561 Power: uptime: 0m states: freeze,mem,disk suspend: deep
    avail: s2idle wakeups: 0 hibernate: platform avail: shutdown, reboot,
    suspend, test_resume image: 49.89 GiB services: org_kde_powerdevil,
    power-profiles-daemon, upowerd Init: systemd v: 256 default: graphical
    tool: systemctl
  Compilers: clang: 18.1.8 gcc: 14.2.1 Shell: Sudo (sudo) v: 1.9.16p2
    default: Bash v: 5.2.37 running-in: konsole inxi: 3.3.37

I am not all that familiar with Nvidia - I have done some testing a couple a years back using Nvidia Quadro - but I sinced moved my systems to use AMD GPU.

Especially your CPU draws attention and your kenel is the early stage rc.

My suggestion is to switch branch to testing or unstable - as it provides 6.13.0 release kernel.

 $ mbn info linux613 -q | grep -e 'Branch' -e 'Version'
Branch         : unstable
Version        : 6.13.0-1
Branch         : testing
Version        : 6.13.0-1
Branch         : stable
Version        : 6.13.0rc2-1

package: manjaro-check-repos (remember mbn update)

2 Likes

Testing branch also has kernel 6.13.0, if the OP does not want to switch to Unstable branch:

mbn info linux613 -q | grep -v 'Packager'
Branch         : unstable
Name           : linux613
Version        : 6.13.0-1
Repository     : core
Build Date     : Mon 20 Jan 2025 15:45:43 
Branch         : testing
Name           : linux613
Version        : 6.13.0-1
Repository     : core
Build Date     : Mon 20 Jan 2025 15:45:43 
Branch         : stable
Name           : linux613
Version        : 6.13.0rc2-1
Repository     : core
Build Date     : Tue 10 Dec 2024 13:26:31
1 Like

that reminds me - I have to remember to update before I compare :facepalm:

1 Like

I changed to Unstable as @linux-aarhus suggested and the game no longer crashes the system at least. It just fails to launch now. I can look more after work but I need to figure out where Steam logs failed game launches. If anyone knows in the meantime and can save me the time of looking for those logs, it’d be appreciated

Update, I am still getting the crashes. As stated earlier, I switched to the Unstable branch, updated my system, and got the stable/release 6.13 kernel in the process.

At that point, The Long Dark was no longer crashing the system, but I started Bioshock Infinite, which I was able to play before on the 6.13-rc2 kernel and stable branch earlier, and it got into the 2K title screen and the system crashed and rebooted.

I then thought that maybe the unstable branch was causing a different issue, so I moved back to testing (keeping the 6.13 release kernel). I also disabled my integrated GPU via the BIOS and reinstalled the nvidia drivers via mhwd.

Now, simply opening Steam to the Store/Library causes the crash, before ever launching a game at all. A moment ago, it also crashed a few seconds after logging in and getting to the desktop.

I am wondering at this point if it is a hardware issue with the new card and if so, how to nail that down. I should add that I did not purchase this card brand new, it was purchased on Ebay in “like new” condition, but it is possible that the seller offloaded a faulty card onto me. According to their website, EVGA does offer warranty service for second-hand cards, but I would like to somehow confirm it’s a hardware problem or at least firm that up before pursuing that route. I have my old 1080ti I can swap back in, but I don’t know what that would rule out, considering it is a completely different, much older card.

I wouldn’t be too surprised if the card has been “cooked” i.e. using it for stuff like crypto-mining. Especially if the price was suspiciously low for a card of that type. :man_shrugging:

1 Like

I won the bid at $820, which was a tad lower than comparable listings, but I didn’t think it was suspiciously low.

I’m thinking what I may do is install Windows (is that a dirty word here?) onto a second drive and see if I get crashes there. If so, I would be a lot more confident in a hardware issue and I would move forward with a warranty process. Or at that point I may even go through eBay’s buyer protections. It would give me more options and more peace of mind to know it’s not a kernel bug or something.

1 Like

Good idea trying it with Windows, IMHO. I don’t touch it with a barge-pole normally, but I guess it does have its uses. :wink:

1 Like

Your profile picture didn’t even register with me until after I posted that :joy:

1 Like

One thing I forgot to mention: disconnect the Linux drive before installing Windows on the other one, to avoid messing up your bootloader, etc. .:wink:

1 Like

Latest update:

I installed Windows, and it did indeed crash, while doing nothing in particular (had just stopped a stress test program and was downloading another).

I went back to Linux, and I was able to trigger the crash pretty consistently by just scrolling through Steam’s store page. I swapped in my old 1080ti, and I actually got a crash on that too, so I considered the 3090 ruled out. I started planning on bringing my PSU back to Best Buy and exchanging it, just to rule that out too.

I was reading through forums and someone suggested disabling any XMP profiles/memory overclocking. I was running an EXPO profile, but I had run memtest86 after setting it so I hadn’t thought anything of it, but this was before installing a GPU. I was getting desperate, so I tried disabling it anyway, and I haven’t been able to trigger a crash since. It’s only been about 10 minutes, but it was happening pretty consistently in Steam earlier so I think this might have been it. Somehow, installing a GPU must have thrown off the memory stability. I’m also wondering if anyone has seen something like this before.

I’m planning on re-enabling it tonight and running memtest86 again, but between now and then I’m going to use Manjaro normally and see if it crashes again.

I note that you’re using F3j which is a beta BIOS.

As a rule, unless a beta BIOS has a fix for a specific issue, it is generally recommended to choose the latest non-beta BIOS.

Whether this (in combination with other factors) is contributing to these crashes, I can’t say; but, it’s worth consideration.

Regards.

Where on the Gigabyte download page does it say it’s a beta? I’m not arguing with you, just trying to understand how I would know that.

Also, still getting the crashes, the memory settings didn’t completely fix it.

The letter following the number usually indicates a beta BIOS. Most manufacturers have long adopted a similar naming convention; Gigabyte is no exception.

Regards.


I’ll also add that being a beta BIOS doesn’t necessarily mean it will be problematic, either; but, history has shown they can be.


It should probably go without saying, there is usually no warranty on second-hand goods; unless perhaps to a limited degree on a refurbished product (I suppose that would depend on whosoever refurbished it). eBay would likely be the better path to a resolution, if it came to that.

2 Likes

I have been cheated on ebay - with a defect cpu - and the seller told me I was an idiot.

That said - it could be the GPU has been used for crypto mining and found to be bad at it …

1 Like