Crashing to Black-Screen-Of-Indifference - Ryzen 3 System - Logs no help so far

Hello, I resurrected/put back together a Ryzen 3 system on an B350M Gaming Pro mother Board and installed Manjaro on it with XFCE for use as a server.

It’s got a bit older NVIDA card it it as well. It crashes. A lot, to a black screen. Sometimes it reboots, but most time it just hard locks to the point where capslock doesn’t trigger it’s LED on the keyboard.

I’ve tried open source and proprietary drives for the video, looking through the logs as best as I know how, turning off sleep, and hibernation, and even pulled its wifi card, but I’ve yet to find the issue. I also rolled back the kernel, and it’s still happening.

Is there a method of getting crash dumps from a kernel panic? I suspect it’s the motherboard bios, but MSI doesn’t allow a bios rollback and I’ve not discovered a work around if one exists.

It is of course frustrating, because at the moment I see no rhyme or reason to it…and I’ve never seen it happen in real time sadly.

I’m at the point of perhaps moving all my docker containers to another PC, which is frustrating. Even if the PC would just reboot it would be better than a hard lock.

Thanks in advance for any assistance.

Here’s the system info from inxi --admin --verbosity=7 --filter --no-host --width

System:
  Kernel: 5.10.136-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 12.1.1
    parameters: BOOT_IMAGE=/boot/vmlinuz-5.10-x86_64
    root=UUID=e33c020c-78d4-4754-81bc-fae92e7b4a41 rw quiet apparmor=1
    security=apparmor resume=UUID=e73fe0e8-3fa2-4b67-9e9a-e90b9a8b4a7f
    udev.log_priority=3
  Desktop: Xfce v: 4.16.0 tk: Gtk v: 3.24.29 info: xfce4-panel wm: xfwm
    v: 4.16.1 vt: 7 dm: LightDM v: 1.32.0 Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop System: Micro-Star product: MS-7A39 v: 1.0
    serial: <superuser required>
  Mobo: MSI model: B350M GAMING PRO (MS-7A39) v: 1.0
    serial: <superuser required> UEFI: American Megatrends LLC. v: 2.P3
    date: 07/16/2022
Battery:
  Message: No system battery data found. Is one present?
Memory:
  RAM: total: 15.61 GiB used: 6.92 GiB (44.3%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges
    required.
CPU:
  Info: model: AMD Ryzen 3 1300X bits: 64 type: MCP arch: Zen level: v3
    built: 2017-19 process: GF 14nm family: 0x17 (23) model-id: 1 stepping: 1
    microcode: 0x8001138
  Topology: cpus: 1x cores: 4 smt: <unsupported> cache: L1: 384 KiB
    desc: d-4x32 KiB; i-4x64 KiB L2: 2 MiB desc: 4x512 KiB L3: 8 MiB
    desc: 2x4 MiB
  Speed (MHz): avg: 1593 high: 2058 min/max: 1550/3500 boost: enabled
    scaling: driver: acpi-cpufreq governor: schedutil cores: 1: 1496 2: 1388
    3: 1433 4: 2058 bogomips: 28010
  Flags: 3dnowprefetch abm adx aes aperfmperf apic arat avic avx avx2 bmi1
    bmi2 bpext clflush clflushopt clzero cmov cmp_legacy constant_tsc cpb
    cpuid cr8_legacy cx16 cx8 de decodeassists extapic extd_apicid f16c
    flushbyasid fma fpu fsgsbase fxsr fxsr_opt ht hw_pstate ibpb irperf
    lahf_lm lbrv lm mca mce misalignsse mmx mmxext monitor movbe msr mtrr
    mwaitx nonstop_tsc nopl npt nrip_save nx osvw overflow_recov pae pat
    pausefilter pclmulqdq pdpe1gb perfctr_core perfctr_llc perfctr_nb
    pfthreshold pge pni popcnt pse pse36 rdrand rdseed rdtscp rep_good sep sev
    sha_ni skinit smap smca sme smep ssbd sse sse2 sse4_1 sse4_2 sse4a ssse3
    succor svm svm_lock syscall tce topoext tsc tsc_scale v_vmsave_vmload vgif
    vmcb_clean vme vmmcall wdt xgetbv1 xsave xsavec xsaveerptr xsaveopt xsaves
  Vulnerabilities:
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: retbleed mitigation: untrained return thunk; SMT disabled
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl and seccomp
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, STIBP:
    disabled, RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: NVIDIA GP106 [GeForce GTX 1060 6GB] vendor: Micro-Star MSI
    driver: nvidia v: 515.65.01 alternate: nouveau,nvidia_drm non-free: 515.xx+
    status: current (as of 2022-08) arch: Pascal code: GP10x
    process: TSMC 16nm built: 2016-21 pcie: gen: 2 speed: 5 GT/s lanes: 16
    link-max: gen: 3 speed: 8 GT/s bus-ID: 29:00.0 chip-ID: 10de:1c03
    class-ID: 0300
  Display: x11 server: X.Org v: 21.1.4 compositor: xfwm v: 4.16.1 driver: X:
    loaded: nvidia gpu: nvidia display-ID: :0.0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x286mm (20.00x11.26")
    s-diag: 583mm (22.95")
  Monitor-1: HDMI-1 res: 1920x1080 hz: 60 dpi: 93
    size: 527x296mm (20.75x11.65") diag: 604mm (23.8") modes: N/A
  OpenGL: renderer: NVIDIA GeForce GTX 1060 6GB/PCIe/SSE2 v: 4.6.0 NVIDIA
    515.65.01 direct render: Yes
Audio:
  Device-1: NVIDIA GP106 High Definition Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 29:00.1 chip-ID: 10de:10f1 class-ID: 0403
  Device-2: AMD Family 17h HD Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 2b:00.3 chip-ID: 1022:1457 class-ID: 0403
  Sound Server-1: ALSA v: k5.10.136-1-MANJARO running: yes
  Sound Server-2: JACK v: 1.9.21 running: no
  Sound Server-3: PulseAudio v: 16.1 running: yes
  Sound Server-4: PipeWire v: 0.3.56 running: yes
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: Micro-Star MSI driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s
    lanes: 1 port: f000 bus-ID: 25:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp37s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  IF-ID-1: br-1610b3b2c6a7 state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  IP v4: <filter> scope: global broadcast: <filter>
  IP v6: <filter> scope: link
  IF-ID-2: br-1cd8729ecbda state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  IP v4: <filter> scope: global broadcast: <filter>
  IP v6: <filter> scope: link
  IF-ID-3: br-276bd3965a26 state: down mac: <filter>
  Message: Output throttled. IPs: 1; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-4: br-4c467f11bf01 state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  Message: Output throttled. IPs: 2; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-5: br-928dcc537b74 state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  Message: Output throttled. IPs: 2; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-6: br-f0fdad466cbb state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  Message: Output throttled. IPs: 2; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-7: br-ff2cb78ee69f state: up speed: 10000 Mbps duplex: unknown
    mac: <filter>
  Message: Output throttled. IPs: 2; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-8: docker0 state: down mac: <filter>
  Message: Output throttled. IPs: 1; Limit: 10; Override: --limit [1-x;-1
    all]
  IF-ID-9: veth000d0a5 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-10: veth0e45d4b state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-11: veth1c3c1f7 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-12: veth232c019 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-13: veth30a49b9 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-14: veth365092f state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-15: veth3d56ccb state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-16: veth46071c7 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-17: veth4bacc94 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-18: veth550e9e0 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-19: veth5cf8b44 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-20: veth7311911 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-21: veth7492f95 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-22: veth8f46d98 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-23: vetha59060a state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-24: vethc9c4bd7 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-25: vethd66170e state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-26: vethda21635 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-27: vethdce5480 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-28: vethf244741 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  IF-ID-29: vethf685a98 state: up speed: 10000 Mbps duplex: full
    mac: <filter>
  WAN IP: <filter>
Bluetooth:
  Message: No bluetooth data found.
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 931.51 GiB used: 23.79 GiB (2.6%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/sda maj-min: 8:0 vendor: Western Digital model: WD Blue SA510
    2.5 1TB size: 931.51 GiB block-size: physical: 512 B logical: 512 B
    speed: 6.0 Gb/s type: SSD serial: <filter> rev: 8100 scheme: GPT
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 914.12 GiB size: 898.7 GiB (98.31%) used: 23.79 GiB (2.6%)
    fs: ext4 dev: /dev/sda2 maj-min: 8:2 label: N/A
    uuid: e33c020c-78d4-4754-81bc-fae92e7b4a41
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%) used: 312 KiB
    (0.1%) fs: vfat dev: /dev/sda1 maj-min: 8:1 label: NO_LABEL
    uuid: B552-FD79
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default)
  ID-1: swap-1 type: partition size: 17.1 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/sda3 maj-min: 8:3 label: swap
    uuid: e73fe0e8-3fa2-4b67-9e9a-e90b9a8b4a7f
Unmounted:
  Message: No unmounted partitions found.
USB:
  Hub-1: 1-0:1 info: Hi-speed hub with single TT ports: 10 rev: 2.0
    speed: 480 Mb/s chip-ID: 1d6b:0002 class-ID: 0900
  Hub-2: 2-0:1 info: Super-speed hub ports: 4 rev: 3.1 speed: 10 Gb/s
    chip-ID: 1d6b:0003 class-ID: 0900
  Hub-3: 3-0:1 info: Hi-speed hub with single TT ports: 4 rev: 2.0
    speed: 480 Mb/s chip-ID: 1d6b:0002 class-ID: 0900
  Device-1: 3-1:2 info: HP HP Business Slim Keyboard type: Keyboard,HID
    driver: hid-generic,usbhid interfaces: 2 rev: 2.0 speed: 1.5 Mb/s
    power: 100mA chip-ID: 03f0:2f4a class-ID: 0300
  Device-2: 3-2:3 info: Microsoft Ergonomic Mouse type: Mouse,HID
    driver: hid-generic,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s
    power: 100mA chip-ID: 045e:082e class-ID: 0300 serial: <filter>
  Hub-4: 4-0:1 info: Super-speed hub ports: 4 rev: 3.0 speed: 5 Gb/s
    chip-ID: 1d6b:0003 class-ID: 0900
Sensors:
  System Temperatures: cpu: 38.5 C mobo: N/A gpu: nvidia temp: 43 C
  Fan Speeds (RPM): N/A gpu: nvidia fan: 0%
Info:
  Processes: 335 Uptime: 59m wakeups: 0 Init: systemd v: 251
  default: graphical tool: systemctl Compilers: gcc: 12.1.1 clang: 14.0.6
  Packages: 1132 pm: pacman pkgs: 1129 libs: 319 tools: pamac,yay pm: flatpak
  pkgs: 0 pm: snap pkgs: 3 Shell: Bash v: 5.1.16 running-in: xfce4-terminal
  inxi: 3.3.21

Though you may need to access it from a chroot in order not to override it by booting.


But if you have no log of the crashes, it may likely be a hardware issue.

Yeah, I think it’s hardware. I’m really annoyed that MSI doesn’t allow rollbacks on their motherboard bios. I think i’ll have to move the docker containers to a more stable PC until they upgrade it again.

So on the next crash if I boot to install media and look at the dmesg logs with chroot i might get some more detail?

I was hoping for a way to have it log everything at all times to like a ~1000 line file at all times so I don’t have to be so creative. :slight_smile:

Thanks for the reply. I’ll give it a shot.

Hello,

A couple of personal pointers:

You would be better with 5.15 kernel or higher.

This is something i would address too. See this:

I do not recommend a swap partition on a SSD like that. Much better with a swapfile or use zram-generator.

for log use

sudo journalctl -b0 ( -b -1 boot before , -2 etc ...)
sudo journalctl -p 3 -xb

you should add on boot kernel
“amd_iommu=on iommu=pt processor.max_cstate=5”

Network device Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet is likely to work better if driver r8169 is replaced with r8168

sudo mhwd -i pci network-r8168