Complete system freeze after login

Hello everyone,

In the past (about 2 years ago) I installed Manjaro and followed a tutorial (which I can’t find anymore) about how to setup nVidia for my Dual GPU Laptop (Intel CPU + nVidia Dedicated). This used to work perfectly for my Asus ROG GL753VD.

The issue that I have now (like last time) is that after I input my password and press enter on the Login Screen, the system completely freezes. Can’t move cursor, can’t go to tty, can’t do anything and I’m forced to turn off and turn on the laptop. This also happens, intermittently, during the boot screen (when the small dots and Manjaro logo are shown, just before the login screen).

Right now I’m using a fresh Manjaro Plasma install, with the Open-Source drivers, as they don’t freeze my screen. No other edits to grub and/or /x11/ conf files.

My aim is to be able to use the nVidia graphics card, support and guidance is very appreciated.
Below is the information that hopefully will speed up this support request.

mhwd-kernel -li

Currently running: 6.5.5-1-MANJARO (linux65)
The following kernels are installed in your system:
   * linux65

mhwd -l --pci

> 0000:03:00.0 (0200:10ec:8168) Network controller Realtek Semiconductor Co., Ltd.:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
         network-r8168            2023.09.12                true            PCI


> 0000:01:00.0 (0302:10de:1c8d) Display controller nVidia Corporation:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
video-hybrid-intel-nvidia-prime            2023.03.23               false            PCI
video-hybrid-intel-nvidia-470xx-prime            2023.03.23               false            PCI
video-hybrid-intel-nvidia-390xx-bumblebee            2023.03.23               false            PCI
          video-nvidia            2023.03.23               false            PCI
    video-nvidia-470xx            2023.03.23               false            PCI
    video-nvidia-390xx            2023.03.23               false            PCI
           video-linux            2018.05.04                true            PCI


> 0000:00:02.0 (0300:8086:591b) Display controller Intel Corporation:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
video-hybrid-intel-nvidia-prime            2023.03.23               false            PCI
video-hybrid-intel-nvidia-470xx-prime            2023.03.23               false            PCI
video-hybrid-intel-nvidia-390xx-bumblebee            2023.03.23               false            PCI
           video-linux            2018.05.04                true            PCI
     video-modesetting            2020.01.13                true            PCI
            video-vesa            2017.03.12                true            PCI

glxinfo | grep OpenGL

OpenGL vendor string: Intel
OpenGL renderer string: Mesa Intel(R) HD Graphics 630 (KBL GT2)
OpenGL core profile version string: 4.6 (Core Profile) Mesa 23.1.9-manjaro1.1
OpenGL core profile shading language version string: 4.60
OpenGL core profile context flags: (none)
OpenGL core profile profile mask: core profile
OpenGL core profile extensions:
OpenGL version string: 4.6 (Compatibility Profile) Mesa 23.1.9-manjaro1.1
OpenGL shading language version string: 4.60
OpenGL context flags: (none)
OpenGL profile mask: compatibility profile
OpenGL extensions:
OpenGL ES profile version string: OpenGL ES 3.2 Mesa 23.1.9-manjaro1.1
OpenGL ES profile shading language version string: OpenGL ES GLSL ES 3.20
OpenGL ES profile extensions:

inxi -Fazy

System:
  Kernel: 6.5.5-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 13.2.1
    clocksource: tsc available: acpi_pm
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.5-x86_64
    root=UUID=bc319599-7a94-432e-ac67-3eefbea7303a rw quiet splash
    resume=UUID=65b17bde-db4a-4ee4-8b98-ee939969afab udev.log_priority=3
  Desktop: KDE Plasma v: 5.27.8 tk: Qt v: 5.15.10 wm: kwin_x11 vt: 2 dm: SDDM
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Laptop System: ASUSTeK product: GL753VD v: 1.0
    serial: <superuser required>
  Mobo: ASUSTeK model: GL753VD v: 1.0 serial: <superuser required>
    UEFI: American Megatrends v: GL753VD.308 date: 04/29/2019
Battery:
  ID-1: BAT0 charge: 44.9 Wh (98.0%) condition: 45.8/47.5 Wh (96.4%)
    volts: 16.4 min: 14.4 model: Simplo SDI ICR18650 type: Li-ion
    serial: <filter> status: not charging cycles: 33
CPU:
  Info: model: Intel Core i7-7700HQ bits: 64 type: MT MCP arch: Kaby Lake
    gen: core 7 level: v3 note: check built: 2018 process: Intel 14nm family: 6
    model-id: 0x9E (158) stepping: 9 microcode: 0xF4
  Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
    L1: 256 KiB desc: d-4x32 KiB; i-4x32 KiB L2: 1024 KiB desc: 4x256 KiB
    L3: 6 MiB desc: 1x6 MiB
  Speed (MHz): avg: 800 min/max: 800/3800 scaling: driver: intel_pstate
    governor: powersave cores: 1: 800 2: 800 3: 800 4: 800 5: 800 6: 800 7: 800
    8: 800 bogomips: 44817
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
  Vulnerabilities:
  Type: gather_data_sampling mitigation: Microcode
  Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf mitigation: PTE Inversion; VMX: conditional cache flushes, SMT
    vulnerable
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: mmio_stale_data mitigation: Clear CPU buffers; SMT vulnerable
  Type: retbleed mitigation: IBRS
  Type: spec_rstack_overflow status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: IBRS, IBPB: conditional, STIBP: conditional,
    RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Intel HD Graphics 630 vendor: ASUSTeK driver: i915 v: kernel
    arch: Gen-9.5 process: Intel 14nm built: 2016-20 ports: active: eDP-1
    empty: DP-1,HDMI-A-1,HDMI-A-2 bus-ID: 00:02.0 chip-ID: 8086:591b
    class-ID: 0300
  Device-2: NVIDIA GP107M [GeForce GTX 1050 Mobile] vendor: ASUSTeK
    driver: nouveau v: kernel non-free: 535.xx+ status: current (as of 2023-09)
    arch: Pascal code: GP10x process: TSMC 16nm built: 2016-21 pcie: gen: 1
    speed: 2.5 GT/s lanes: 16 link-max: gen: 3 speed: 8 GT/s bus-ID: 01:00.0
    chip-ID: 10de:1c8d class-ID: 0302 temp: 37.0 C
  Device-3: IMC Networks USB2.0 HD UVC WebCam driver: uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-11:4 chip-ID: 13d3:5666
    class-ID: 0e02 serial: <filter>
  Display: x11 server: X.Org v: 21.1.8 compositor: kwin_x11 driver: X:
    loaded: modesetting alternate: fbdev,vesa dri: iris,nouveau gpu: i915
    display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.00x11.22")
    s-diag: 582mm (22.93") monitors: <missing: xrandr>
  Monitor-1: eDP-1 model: LG Display 0x04e8 built: 2015 res: 1920x1080
    dpi: 128 gamma: 1.2 size: 382x215mm (15.04x8.46") diag: 438mm (17.3")
    ratio: 16:9 modes: 1920x1080
  API: EGL v: 1.5 hw: drv: intel iris drv: nvidia nouveau platforms:
    device: 0 drv: iris device: 1 drv: nouveau device: 2 drv: swrast surfaceless:
    drv: iris x11: drv: iris inactive: gbm,wayland
  API: OpenGL v: 4.6 compat-v: 4.3 vendor: intel mesa v: 23.1.9-manjaro1.1
    glx-v: 1.4 direct-render: yes renderer: Mesa Intel HD Graphics 630 (KBL GT2)
    device-ID: 8086:591b memory: 30.5 GiB unified: yes
  API: Vulkan v: 1.3.264 layers: N/A device: 0 type: integrated-gpu
    name: Intel HD Graphics 630 (KBL GT2) driver: mesa intel v: 23.1.9-manjaro1.1
    device-ID: 8086:591b surfaces: xcb,xlib
Audio:
  Device-1: Intel CM238 HD Audio vendor: ASUSTeK driver: snd_hda_intel
    v: kernel alternate: snd_soc_avs bus-ID: 00:1f.3 chip-ID: 8086:a171
    class-ID: 0403
  API: ALSA v: k6.5.5-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 0.3.80 status: off with: pipewire-media-session
    status: active tools: pw-cli
  Server-3: PulseAudio v: 16.1 status: active with: pulseaudio-alsa
    type: plugin tools: pacat,pactl
Network:
  Device-1: Intel Wireless 7265 driver: iwlwifi v: kernel pcie: gen: 1
    speed: 2.5 GT/s lanes: 1 bus-ID: 02:00.0 chip-ID: 8086:095a class-ID: 0280
  IF: wlp2s0 state: up mac: <filter>
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: ASUSTeK driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s lanes: 1
    port: d000 bus-ID: 03:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp3s0 state: down mac: <filter>
Bluetooth:
  Device-1: Intel Bluetooth wireless interface driver: btusb v: 0.8 type: USB
    rev: 2.0 speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-9:3 chip-ID: 8087:0a2a
    class-ID: e001
  Report: rfkill ID: hci0 rfk-id: 3 state: up address: see --recommends
Drives:
  Local Storage: total: 465.76 GiB used: 12.63 GiB (2.7%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung
    model: SSD 970 EVO Plus 500GB size: 465.76 GiB block-size: physical: 512 B
    logical: 512 B speed: 31.6 Gb/s lanes: 4 tech: SSD serial: <filter>
    fw-rev: 2B2QEXM7 temp: 38.9 C scheme: GPT
Partition:
  ID-1: / raw-size: 431.1 GiB size: 423.27 GiB (98.18%) used: 12.63 GiB (3.0%)
    fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 288 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
    compressor: zstd max-pool: 20%
  ID-1: swap-1 type: partition size: 34.36 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/nvme0n1p3 maj-min: 259:3
Sensors:
  System Temperatures: cpu: 46.0 C pch: 52.0 C mobo: N/A gpu: nouveau
    temp: 38.0 C
  Fan Speeds (rpm): cpu: 2000
Info:
  Processes: 228 Uptime: 4h 57m wakeups: 55982 Memory: total: 32 GiB
  available: 31.24 GiB used: 4 GiB (12.8%) Init: systemd v: 254
  default: graphical tool: systemctl Compilers: gcc: 13.2.1 clang: 16.0.6
  Packages: pm: pacman pkgs: 1159 libs: 317 tools: pamac pm: flatpak pkgs: 0
  Shell: Zsh v: 5.9 default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.30

:point_up: This alone can be a recipe for trouble & a lot of tinkering to get it working.

Maybe someone familiar with Optimus/Prime + bumblebee & dual-GPU setups in general can help? I gave up on trying to get this to work the one time I encountered it.

You need to install the NVIDIA proprietary drivers for Optimus to work. Open Manjaro Settings Manager > Hardware configuration, click the Auto Install Proprietary Driver and reboot.

1 Like

This shows whats available, not what you have installed.

mhwd -li

What driver profile were you using before?

If I do that, I will not be able to login into the Desktop Environment (After password on login screen) as it will instantly freeze after pressing Enter.

Before I installed with proprietary drivers. Which ended up with freezes.
Then I tried Open Source + Install the appropriate drivers with sudo mhwd -a pci nonfree 0300 which then pushed me back into the freeze states.

Here is the list for mhwd -li, not too much offered, as it is a fresh install with Open Source drivers.

> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI


Warning: No installed USB configs!

Interesting…
I was in a similar boat (Laptop with dual GPU: Intel Iris i915 and Nvidia 4060), but my problem was the opposite of yours;
I kept having frequent hard freezes when using the open source drivers!
I fixed it by switching to the non-free ones and using envycontrol to permanently enable the discrete GPU (see here for context: Random freezes (and horrible graphics performances) on new laptop)

So after following @damdam’s topic, I managed to play with envycontrol to reach a more controlled state of freezes.
I know now that they happen exactly when the nVidia card activates.

If I run in integrated or hybrid, I get no issues, except when in hybrid it switches to nVidia card.
Goes without saying, if I choose nVidia card, system instantly freezes up.

Only thing frustrating is that I don’t have any indications of what when wrong when I run:

journalctl -b-1 -p4 --no-pager

oct 13 00:07:15 Akhlis-Linux kernel: x86/cpu: SGX disabled by BIOS.
oct 13 00:07:15 Akhlis-Linux kernel: MDS CPU bug present and SMT on, data leak possible. See https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/mds.html for more details.
oct 13 00:07:15 Akhlis-Linux kernel: MMIO Stale Data CPU bug present and SMT on, data leak possible. See https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/processor_mmio_stale_data.html for more details.
oct 13 00:07:15 Akhlis-Linux kernel: ENERGY_PERF_BIAS: Set to 'normal', was 'performance'
oct 13 00:07:15 Akhlis-Linux kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.SAT0.PRT0], AE_ALREADY_EXISTS (20230331/dswload2-326)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230331/psobject-220)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.SAT0.PRT1], AE_ALREADY_EXISTS (20230331/dswload2-326)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230331/psobject-220)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.SAT0.PRT3], AE_ALREADY_EXISTS (20230331/dswload2-326)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230331/psobject-220)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.SAT0.PRT4], AE_ALREADY_EXISTS (20230331/dswload2-326)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230331/psobject-220)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.SAT0.PRT5], AE_ALREADY_EXISTS (20230331/dswload2-326)
oct 13 00:07:15 Akhlis-Linux kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20230331/psobject-220)
oct 13 00:07:15 Akhlis-Linux kernel: hpet_acpi_add: no address or irqs in _CRS
oct 13 00:07:15 Akhlis-Linux kernel: tpm_crb MSFT0101:00: [Firmware Bug]: ACPI region does not cover the entire command/response buffer. [mem 0xfed40000-0xfed4087f flags 0x200] vs fed40080 f80
oct 13 00:07:15 Akhlis-Linux kernel: tpm_crb MSFT0101:00: [Firmware Bug]: ACPI region does not cover the entire command/response buffer. [mem 0xfed40000-0xfed4087f flags 0x200] vs fed40080 f80
oct 13 00:07:15 Akhlis-Linux kernel: ahci 0000:00:17.0: can't derive routing for PCI INT A
oct 13 00:07:15 Akhlis-Linux kernel: ahci 0000:00:17.0: PCI INT A: not connected
oct 13 00:07:15 Akhlis-Linux kernel: resource: resource sanity check: requesting [mem 0x00000000fdffe800-0x00000000fe0007ff], which spans more than pnp 00:05 [mem 0xfdb00000-0xfdffffff]
oct 13 00:07:15 Akhlis-Linux kernel: caller get_primary_reg_base+0x4f/0xb0 mapping multiple BARs
oct 13 00:07:15 Akhlis-Linux kernel: usb: port power management may be unreliable
oct 13 00:07:15 Akhlis-Linux kernel: nvme nvme0: missing or invalid SUBNQN field.
oct 13 00:07:14 Akhlis-Linux systemd-vconsole-setup[255]: KD_FONT_OP_GET failed while trying to get the font metadata: Invalid argument
oct 13 00:07:14 Akhlis-Linux systemd-vconsole-setup[255]: Fonts will not be copied to remaining consoles

Currently my drivers are the following:
mhwd -li

> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
video-hybrid-intel-nvidia-prime            2023.03.23               false            PCI
           video-linux            2018.05.04                true            PCI
     video-modesetting            2020.01.13                true            PCI