Frequent system crashes

My first post here. My machine includes:

Operating System: Manjaro Linux
KDE Plasma Version: 5.20.4
KDE Frameworks Version: 5.76.0
Qt Version: 5.15.2
Kernel Version: 5.8.18-1-MANJARO
OS Type: 64-bit
Processors: 8 × Intel® Core™ i7-3820 CPU @ 3.60GHz
Memory: 31.3 GiB of RAM
Graphics Processor: NV168
GTX 1660 Super

Over the last few months, this system has been crashing with increasing frequency. For instance, browsers and other programs stop to respond, the panel disappears from the bottom of the screen, the system update icon is permanently gone. The computer freezes and throwing the power switch is the only solution that works. I am not an expert Linux user, but a refugee from Windows 7. In the past, this computer was used as a budget graphics station, but now I have it mostly for browsing, email and light office work. I don’t even know where to begin to troubleshoot it. If I can’t have a reliable Linux PC, I will have to go back to Windows. I really want to avoid Windows 10. I will be grateful for any pointers to get this machine to work properly. Kind Regards.

Welcome at the forum, @ect1

This kernel is EOL, please change to 5.4 or 5.10 any or another still supported and maintained kernel. When you have changed the kernel, remove kernel 5.8. Then update using

sudo pacman-mirrors -f && sudo pacman -Syyu

Reboot.

To help you better, please post

inxi -Fxxxza --no-host
mhwd -l

Please provide more details, such as what browser, what program and what panel you mean here.

Have you checked already any of your log files?

1 Like

Wollie,

Thank you for your fast reply. I have installed the 5.4.85-1 kernel, but after the download and re-installation of packages, the 5.10.2-2 is listed as presently running.

As to programs:

Firefox slows down to a crawl and behaves as if my WIFI here was very slow, but I seem to be the only one affected by it. Vivaldi and Brave browsers just crash, but the desktop continues to work. FreeCad crashes frequently and often takes Manjaro with it. When it happens, the Task Manager panel disappears. I can move my cursor on the screen but nothing responds to my clicks. Sometimes, my desktop just goes haywire and freezes after I close my browser for instance. I have not looked at the logs yet. I will do it next.

The mhwd returns:

video-nvidia 2020.11.30 false PCI
video-linux 2018.05.04 true PCI
video-modesetting 2020.01.13 true PCI
video-vesa 2017.03.12 true PCI

I’m having trouble including the inxi dump, the post rejects it thinking that it includes links…

Split the inxi output in several blocks, could be that all together is too much for one post.
Use 3 backticks ` in a single line in front and at the end to format it, nicely.

Check journalctl logs and KDE (/var/log/Xorg.0.log). Also dmesg command could be useful.

Thank you. I’ll split it into smaller sizes tomorrow and get the dmesg data Just crashed again, this time in File Manager.

System:    Kernel: 5.10.2-2-MANJARO x86_64 bits: 64 compiler: gcc v: 10.2.0 
                   parameters: BOOT_IMAGE=/boot/vmlinuz-5.10-x86_64 root=UUID=98e30f1a-98f1-49af-ae7e-9df0838d785c rw quiet apparmor=1 
           security=apparmor resume=UUID=ec29530a-ee3c-4ae3-93f4-b17aabb23c94 			   udev.log_priority=3 
           Desktop: KDE Plasma 5.20.4 tk: Qt 5.15.2 wm: kwin_x11 dm: SDDM Distro: Manjaro Linux 

Machine:   Type: Desktop System: Gigabyte product: N/A v: N/A serial: <filter> Chassis: type: 3 serial: <filter> Mobo: Gigabyte model: X79-UP4 v: x.x serial: <filter> UEFI: American Megatrends v: F4 date: 09/05/2013 

CPU:       Info: Quad Core model: Intel Core i7-3820 bits: 64 type: MT MCP arch: Sandy Bridge family: 6 model-id: 2D (45) 
           stepping: 7 microcode: 71A L2 cache: 10 MiB 
           flags: avx lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx bogomips: 57616 
           Speed: 1200 MHz min/max: 1200/3800 MHz Core speeds (MHz): 1: 1200 2: 1200 3: 1200 4: 1200 5: 1200 6: 1200 7: 1200 8: 1200 
           Vulnerabilities: Type: itlb_multihit status: KVM: VMX disabled 
           Type: l1tf mitigation: PTE Inversion; VMX: conditional cache flushes, SMT vulnerable 
           Type: mds mitigation: Clear CPU buffers; SMT vulnerable 
           Type: meltdown mitigation: PTI 
           Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via prctl and seccomp 
           Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer sanitization 
           Type: spectre_v2 mitigation: Full generic retpoline, IBPB: conditional, IBRS_FW,
           Type: srbds status: Not affected 
           Type: tsx_async_abort status: Not affected 

Graphics:  Device-1: NVIDIA TU116 [GeForce GTX 1660 SUPER] driver: nouveau v: kernel 
ID: 02:00.0 chip ID: 10de:21c4 
           Display: x11 server: X.Org 1.20.10 compositor: kwin_x11 driver: modesetting,nouveau alternate: fbdev,nv,vesa 
           display ID: :0 screens: 1 
           Screen-1: 0 s-res: 1680x1050 s-dpi: 96 s-size: 444x277mm (17.5x10.9") s-diag: 523mm (20.6") 
           Monitor-1: DP-1 res: 1680x1050 hz: 60 dpi: 90 size: 473x296mm (18.6x11.7") diag: 558mm (22") 
           OpenGL: renderer: NV168 v: 4.3 Mesa 20.3.1 direct render: Yes```
           bus ID: 00:1b.0 chip ID: 8086:1d20 
           Device-2: NVIDIA TU116 High Definition Audio driver: snd_hda_intel v: kernel bus ID: 02:00.1 chip ID: 10de:1aeb 
           Sound Server: ALSA v: k5.10.2-2-MANJARO 
Network:   Device-1: Intel 82579LM Gigabit Network vendor: Gigabyte driver: e1000e v: kernel port: f020 bus ID: 00:19.0 
           chip ID: 8086:1502 
           IF: eno1 state: up speed: 1000 Mbps duplex: full mac: <filter> 
Drives:    Local Storage: total: 1.13 TiB used: 194.27 GiB (16.8%) 
           SMART Message: Unable to run smartctl. Root privileges required. 
           ID-1: /dev/sda maj-min: 8:0 vendor: Crucial model: CT240BX500SSD1 size: 223.57 GiB block size: physical: 512 B 
           logical: 512 B speed: 6.0 Gb/s serial: <filter> rev: R013 
           ID-2: /dev/sdb maj-min: 8:16 vendor: Western Digital model: WD10EZEX-00UD2A0 size: 931.51 GiB block size: ```
Partition: ID-1: / raw size: 214.47 GiB size: 210.11 GiB (97.96%) used: 121.11 GiB (57.6%) fs: ext4 dev: /dev/sda2 
           maj-min: 8:2 
           ID-2: /boot/efi raw size: 300 MiB size: 299.4 MiB (99.80%) used: 280 KiB (0.1%) fs: vfat dev: /dev/sda1 
           maj-min: 8:1 
Swap:      Kernel: swappiness: 60 (default) cache pressure: 100 (default) 
           ID-1: swap-1 type: partition size: 8.8 GiB used: 0 KiB (0.0%) priority: -2 dev: /dev/sda3 maj-min: 8:3 
Sensors:   System Temperatures: cpu: 39.0 C mobo: N/A gpu: nouveau temp: 58.0 C 
           Fan Speeds (RPM): N/A gpu: nouveau fan: 0 
Info:      Processes: 236 Uptime: 12m wakeups: 0 Memory: 31.28 GiB used: 1.96 GiB (6.3%) Init: systemd v: 247 Compilers: 
           gcc: 10.2.0 Packages: 1424 pacman: 1403 lib: 418 flatpak: 14 snap: 7 Shell: Bash v: 5.1.0 running in: konsole 
           inxi: 3.2.01 ```

Please, install Nvidia drivers by

sudo mhwd -a pci nonfree 0300

Once rebooted, check status by

mhwd -li

[cp@cp-pc ~]$ mhwd -li

Installed PCI configs:


              NAME               VERSION          FREEDRIVER           TYPE

      video-nvidia            2020.11.30               false            PCI
       video-linux            2018.05.04                true            PCI

Warning: No installed USB configs!
[cp@cp-pc ~]$

This is confusing, my kernel change from 5.8 to 5.10 yesterday did replace my Nvidia drivers to this TU116 business.

“Warning: No installed USB configs!” - My USB ports work OK. Does this warning relate to something other than the USB ports? Thank you for your help and patience.

Ignore it.

Crashed again in Firefox. The dmesg dump shows several error lines like:

[ 10.176634] EDAC sbridge: CPU SrcID #0, Ha #0, Channel #0 has DIMMs, but ECC is disabled [ 10.176639] EDAC sbridge: Couldn't find mci handler [ 10.176641] EDAC sbridge: Failed to register device with error -19.

and the last one:

[ 10.267144] EDAC sbridge: CPU SrcID #0, Ha #0, Channel #0 has DIMMs, but ECC is disabled [ 10.267151] EDAC sbridge: Couldn't find mci handler [ 10.267153] EDAC sbridge: Failed to register device with error -19. [ 10.471439] nvidia-gpu 0000:02:00.3: i2c timeout error e0000000 [ 10.471450] ucsi_ccg 0-0008: i2c_transfer failed -110 [ 10.471455] ucsi_ccg 0-0008: ucsi_ccg_init failed - -110