Hey guys! Hope yβall are doing well.
After installing the update from 04/09 my system started crashing randomly. It really does not have a pattern - I can be playing, scrolling through Facebook, coding, etc; It just freezes. No commands will work, have to hard reset/power down the hole computer.
At first, I thought the problem was related to some Iommu errors (like this one)
pci 0000:00:00.2: AMD-Vi: Unable to read/write to IOMMU perf counter.
But then I realized those are related boot errors - so its probably not related to the actual problem.
Iβve tried a lot of things, like
- undo the update
- reinstall the hole system
- changed GRUBβs IOMMU related values (GRUB_CMDLINE_LINUX=βiommu=ptβ) - before realizing it wasnβt related
- use the Linux 5.4 kernel instead of 5.10
And still, it will randomly freeze. Using 5.4 kernel helps, idk why. I really appreciate if someone can take sometime to help me out. I usually donβt open threads like these Iβm losing my mind with this problem.
Tks a lot!
Here are the logs for my last freeze:
abr 15 13:40:40 the-machine dbus-daemon[591]: [system] Failed to activate service 'org.bluez': timed out (service_start_timeout=25000ms)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc01000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc00000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc07000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc03000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc02000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc05000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: RW: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:2 pasid:32774, for process firefox pid 94303 thread firefox:cs0 pid 94347)
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: in page starting at address 0x000080010fc04000 from client 27
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00241051
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MORE_FAULTS: 0x1
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: WALKER_ERROR: 0x0
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: PERMISSION_FAULTS: 0x5
abr 15 13:41:03 the-machine kernel: amdgpu 0000:06:00.0: MAPPING_ERROR: 0x0
inxi -Fza
System: Kernel: 5.4.108-1-MANJARO x86_64 bits: 64 compiler: gcc v: 10.2.0
parameters: BOOT_IMAGE=/boot/vmlinuz-5.4-x86_64 root=UUID=1991e2ed-da17-4c3e-823e-37c85340ed96 rw iommu=pt quiet
splash apparmor=1 security=apparmor resume=UUID=97e5678e-9e32-48a7-8118-af3a599297a6 udev.log_priority=3
Desktop: GNOME 3.38.4 tk: GTK 3.24.28 wm: gnome-shell dm: GDM 3.38.2.1 Distro: Manjaro Linux base: Arch Linux
Machine: Type: Desktop Mobo: ASUSTeK model: EX-A320M-GAMING v: Rev X.0x serial: <filter> UEFI: American Megatrends v: 5220
date: 09/12/2019
CPU: Info: Quad Core model: AMD Ryzen 3 3200G with Radeon Vega Graphics bits: 64 type: MCP arch: Zen/Zen+ note: check
family: 17 (23) model-id: 18 (24) stepping: 1 microcode: 8108109 cache: L2: 2 MiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 28756
Speed: 1233 MHz min/max: 1400/3600 MHz boost: enabled Core speeds (MHz): 1: 1233 2: 2850 3: 2820 4: 2956
Vulnerabilities: Type: itlb_multihit status: Not affected
Type: l1tf status: Not affected
Type: mds status: Not affected
Type: meltdown status: Not affected
Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via prctl and seccomp
Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer sanitization
Type: spectre_v2 mitigation: Full AMD retpoline, IBPB: conditional, STIBP: disabled, RSB filling
Type: srbds status: Not affected
Type: tsx_async_abort status: Not affected
Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Picasso vendor: ASUSTeK driver: amdgpu v: kernel bus-ID: 06:00.0
chip-ID: 1002:15d8 class-ID: 0300
Display: wayland server: X.org 1.20.10 compositor: gnome-shell driver: loaded: amdgpu
note: n/a (using device driver) - try sudo/root display-ID: 0 resolution: <missing: xdpyinfo>
OpenGL: renderer: AMD Radeon Vega 8 Graphics (RAVEN DRM 3.35.0 5.4.108-1-MANJARO LLVM 11.1.0) v: 4.6 Mesa 21.0.1
direct render: Yes
Audio: Device-1: Advanced Micro Devices [AMD/ATI] Raven/Raven2/Fenghuang HDMI/DP Audio vendor: ASUSTeK
driver: snd_hda_intel v: kernel bus-ID: 06:00.1 chip-ID: 1002:15de class-ID: 0403
Device-2: Advanced Micro Devices [AMD] Family 17h HD Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel
bus-ID: 06:00.6 chip-ID: 1022:15e3 class-ID: 0403
Sound Server-1: ALSA v: k5.4.108-1-MANJARO running: yes
Sound Server-2: JACK v: 0.125.0 running: no
Sound Server-3: PulseAudio v: 14.2 running: yes
Sound Server-4: PipeWire v: 0.3.24 running: yes
Network: Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: ASUSTeK driver: r8169 v: kernel port: f000
bus-ID: 04:00.0 chip-ID: 10ec:8168 class-ID: 0200
IF: enp4s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Drives: Local Storage: total: 689.34 GiB used: 22.07 GiB (3.2%)
SMART Message: Required tool smartctl not installed. Check --recommends
ID-1: /dev/sda maj-min: 8:0 vendor: Kingston model: SA400S37120G size: 111.79 GiB block-size: physical: 512 B
logical: 512 B speed: 6.0 Gb/s rotation: SSD serial: <filter> rev: 0004 scheme: MBR
ID-2: /dev/sdb maj-min: 8:16 vendor: Kingston model: SV300S37A120G size: 111.79 GiB block-size: physical: 512 B
logical: 512 B speed: 6.0 Gb/s rotation: SSD serial: <filter> rev: BBF0 scheme: GPT
ID-3: /dev/sdc maj-min: 8:32 vendor: Seagate model: ST500LM012 HN-M500MBB size: 465.76 GiB block-size:
physical: 4096 B logical: 512 B speed: 3.0 Gb/s rotation: 5400 rpm serial: <filter> rev: 0002 scheme: MBR
Partition: ID-1: / raw-size: 51.89 GiB size: 50.78 GiB (97.85%) used: 12.78 GiB (25.2%) fs: ext4 dev: /dev/sda2 maj-min: 8:2
ID-2: /boot/efi raw-size: 512 MiB size: 511 MiB (99.80%) used: 308 KiB (0.1%) fs: vfat dev: /dev/sda1 maj-min: 8:1
ID-3: /home raw-size: 51.89 GiB size: 50.78 GiB (97.85%) used: 9.29 GiB (18.3%) fs: ext4 dev: /dev/sda3
maj-min: 8:3
Swap: Kernel: swappiness: 60 (default) cache-pressure: 100 (default)
ID-1: swap-1 type: partition size: 7.5 GiB used: 0 KiB (0.0%) priority: -2 dev: /dev/sda4 maj-min: 8:4
Sensors: System Temperatures: cpu: 34.4 C mobo: N/A gpu: amdgpu temp: 34.0 C
Fan Speeds (RPM): N/A
Info: Processes: 255 Uptime: 42m wakeups: 0 Memory: 13.6 GiB used: 3.12 GiB (22.9%) Init: systemd v: 247 tool: systemctl
Compilers: gcc: 10.2.0 Packages: 1231 pacman: 1228 lib: 301 flatpak: 0 snap: 3 Shell: Zsh v: 5.8
running-in: gnome-terminal inxi: 3.3.03
GPU Data:
description: VGA compatible controller
product: Picasso
vendor: Advanced Micro Devices, Inc. [AMD/ATI]
physical id: 0
bus info: pci@0000:06:00.0
version: c9
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi msix vga_controller bus_master cap_list rom
configuration: driver=amdgpu latency=0
resources: irq:61 memory:e0000000-efffffff memory:f0000000-f01fffff ioport:e000(size=256) memory:fcc00000-fcc7ffff memory:c0000-dffff
drivers
06:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Picasso (rev c9)
Subsystem: ASUSTeK Computer Inc. Device 876b
Kernel driver in use: amdgpu
Kernel modules: amdgpu