First troubleshooting post here! I’m on manjaro since january 20th and browsing the forum for a while now. Will try to get more involved in the community if I can find some time! Sorry in advance for bad English!
So, I’ve been trying to get VFIO working for a GPU passthrough for a windows VM. While doing some tweaking in my bios, it triggered Manjaro to list new drivers for both my IGP Intel HD 630 and my GPU Nvidia GTX 1070: video-hybrid-Intel-nvidia-bumblebee (prime and nouveau-bumblebee too). I decided to give it a go and uninstall video-nvidia in favor of video-hybrid-Intel-nvidia-bumblebee. After a lot of time and effort, I was able to have something barely working, but when I want to go back to only video-nvidia drivers, I can’t boot pass this error:
[FAILED] Failed to start lightdm.service
Fortunately I was able to access TTY, so I started logging everything I found interesting. I tried a lot of fixes, but nothing work. Let’s start with my logs:
From systemctl status lightdm:
● lightdm.service - Light Display Manager Loaded: loaded (/usr/lib/systemd/system/lightdm.service; enabled; vendor preset: disabled) Active: failed (Result: core-dump) since Thu 2018-02-15 20:39:36 EST; 14min ago Docs: man:lightdm(1) Process: 977 ExecStart=/usr/bin/lightdm (code=dumped, signal=SEGV) Main PID: 977 (code=dumped, signal=SEGV) Feb 15 20:39:36 Main-PC systemd: lightdm.service: Service hold-off time over, scheduling restart. Feb 15 20:39:36 Main-PC systemd: lightdm.service: Scheduled restart job, restart counter is at 5. Feb 15 20:39:36 Main-PC systemd: Stopped Light Display Manager. Feb 15 20:39:36 Main-PC systemd: lightdm.service: Start request repeated too quickly. Feb 15 20:39:36 Main-PC systemd: lightdm.service: Failed with result 'core-dump'. Feb 15 20:39:36 Main-PC systemd: Failed to start Light Display Manager.
Next step, I tried to run debug on lightdm:
lightdm --test-mode --debug:
[+0.00s] DEBUG: Logging to /var/log/lightdm/lightdm.log [+0.00s] DEBUG: Starting Light Display Manager 1.24.0, UID=0 PID=1361 [+0.00s] DEBUG: Loading configuration dirs from /usr/share/lightdm/lightdm.conf.d [+0.00s] DEBUG: Loading configuration dirs from /usr/local/share/lightdm/lightdm.conf.d [+0.00s] DEBUG: Loading configuration dirs from /etc/xdg/lightdm/lightdm.conf.d [+0.00s] DEBUG: Loading configuration from /etc/lightdm/lightdm.conf [+0.00s] DEBUG: Registered seat module local [+0.00s] DEBUG: Registered seat module xremote [+0.00s] DEBUG: Registered seat module unity [+0.00s] DEBUG: Using D-Bus name org.freedesktop.DisplayManager [+0.00s] DEBUG: Monitoring logind for seats [+0.00s] DEBUG: New seat added from logind: seat0 [+0.00s] DEBUG: Seat seat0: Loading properties from config section Seat:* [+0.00s] DEBUG: Seat seat0: Starting [+0.00s] DEBUG: Seat seat0: Creating greeter session [+0.00s] DEBUG: Seat seat0: Creating display server of type x [+0.00s] DEBUG: Could not run plymouth --ping: Failed to execute child process ?plymouth? (No such file or directory) [+0.00s] DEBUG: Using VT 7 [+0.00s] DEBUG: Seat seat0: Starting local X display on VT 7 [+0.00s] DEBUG: XServer 0: Logging to /var/log/lightdm/x-0.log [+0.00s] DEBUG: XServer 0: Writing X server authority to /run/lightdm/root/:0 [+0.00s] DEBUG: XServer 0: Launching X Server [+0.00s] DEBUG: Launching process 1366: /usr/bin/X :0 -seat seat0 -auth /run/lightdm/root/:0 -nolisten tcp vt7 -novtswitch [+0.00s] DEBUG: XServer 0: Waiting for ready signal from X server :0 [+0.00s] DEBUG: Acquired bus name org.freedesktop.DisplayManager [+0.00s] DEBUG: Registering seat with bus path /org/freedesktop/DisplayManager/Seat0 [+0.00s] DEBUG: Loading users from org.freedesktop.Accounts [+0.00s] DEBUG: User /org/freedesktop/Accounts/User1000 added [+0.05s] DEBUG: Seat seat0 changes active session to [+0.12s] DEBUG: Process 1366 exited with return value 1 [+0.12s] DEBUG: XServer 0: X server stopped [+0.12s] DEBUG: Releasing VT 7 [+0.12s] DEBUG: XServer 0: Removing X server authority /run/lightdm/root/:0 [+0.12s] DEBUG: Seat seat0: Display server stopped [+0.12s] DEBUG: Seat seat0: Stopping session [+0.12s] DEBUG: Seat seat0: Session stopped [+0.12s] DEBUG: Seat seat0: Stopping display server, no sessions require it [+0.12s] DEBUG: Seat seat0: Stopping; greeter display server failed to start [+0.12s] DEBUG: Seat seat0: Stopping [+0.12s] DEBUG: Seat seat0: Stopped [+0.12s] DEBUG: Required seat has stopped [+0.12s] DEBUG: Stopping display manager [+0.12s] DEBUG: Display manager stopped [+0.12s] DEBUG: Stopping daemon Segmentation fault: core dumped
And this is what dmesg shows about lightdm (I’ve only pasted this line since they are all the same, the full dmesg is available on pastebin):
[ 5.514851] traps: lightdm general protection ip:563b92bd420b sp:7ffd4849f210 error:0 in lightdm[563b92bc1000+40000]
From there, I’ve also run down a few greps on logs, and also find something interesting from dmesg:
[ 1.998565] NVRM: The NVIDIA probe routine was not called for 1 device(s). [ 1.998566] NVRM: This can occur when a driver such as: NVRM: nouveau, rivafb, nvidiafb or rivatv NVRM: was loaded and obtained ownership of the NVIDIA device(s). [ 1.998566] NVRM: Try unloading the conflicting kernel module (and/or NVRM: reconfigure your kernel without the conflicting NVRM: driver(s)), then try loading the NVIDIA kernel module NVRM: again. [ 1.998566] NVRM: No NVIDIA graphics adapter probed!
Which repeats itself 5 times, again, see full dmesg on pastebin.
systemctl enable lightdm.service mkinitcpio -P update-grub
I can’t see nothing of interest being blacklisted in modprobe.d, my xorg.conf from mhwd are all OK
and mhwd-gpu --check and --status all are OK. It does feel like some older drivers were not correctly uninstall (so conflicting kernel module?). I think the more important error message is the NVRM one in dmesg, but I can’t find much on the interwebs and I’m starting to run out of ressources… ! Is it me or every post I find online while looking for solutions are from 2010-issh…? I rarely find more recent threads… I think I also have a problem with my web searches. It’s been like this since I’m on duckduckgo (even with using !g)…
So for now, the only thing working reliably is uninstalling every video drivers. Only my monitors connected to my IGP works though, so nothing from my GPU.
My specs right now (with default drivers):
System: Host: Main-PC Kernel: 4.15.0-1-MANJARO x86_64 bits: 64 gcc: 7.2.1 Desktop: N/A Distro: Manjaro Linux Machine: Device: desktop Mobo: Gigabyte model: H270-Gaming 3 v: x.x serial: N/A UEFI: American Megatrends v: F9b date: 01/10/2018 CPU: Quad core Intel Core i7-7700 (-MT-MCP-) arch: Skylake rev.9 cache: 8192 KB flags: (lm nx sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx) bmips: 28808 clock speeds: max: 4200 MHz 1: 800 MHz 2: 800 MHz 3: 800 MHz 4: 800 MHz 5: 800 MHz 6: 800 MHz 7: 800 MHz 8: 800 MHz Graphics: Card-1: Intel HD Graphics 630 bus-ID: 00:02.0 Card-2: NVIDIA GP104 [GeForce GTX 1070] bus-ID: 01:00.0 Display Server: N/A driver: modesetting tty size: 80x24 Audio: Card-1 NVIDIA GP104 High Def. Audio Controller driver: vfio-pci bus-ID: 01:00.1 Card-2 Intel 200 Series PCH HD Audio driver: snd_hda_intel bus-ID: 00:1f.3 Sound: Advanced Linux Sound Architecture v: k4.15.0-1-MANJARO Network: Card: Qualcomm Atheros Killer E2500 Gigabit Ethernet Controller driver: alx port: d000 bus-ID: 06:00.0 IF: enp6s0 state: up speed: 100 Mbps duplex: full mac: <filter> Drives: HDD Total Size: 3525.7GB (55.2% used) ID-1: /dev/sdb model: ST1000DM010 size: 1000.2GB ID-2: /dev/sda model: Crucial_CT525MX3 size: 525.1GB ID-3: USB /dev/sdc model: My_Passport_0820 size: 2000.4GB Partition: ID-1: / size: 96G used: 8.1G (9%) fs: ext4 dev: /dev/sda3 ID-2: /home size: 48G used: 4.7G (11%) fs: ext4 dev: /dev/sda1 ID-3: /var size: 48G used: 6.3G (14%) fs: ext4 dev: /dev/sdb1 ID-4: swap-1 size: 33.55GB used: 0.00GB (0%) fs: swap dev: /dev/sdb3 Sensors: System Temperatures: cpu: 29.8C mobo: 27.8C Fan Speeds (in rpm): cpu: N/A Info: Processes: 244 Uptime: 2:13 Memory: 2805.6/15990.5MB Init: systemd Gcc sys: 7.2.1 Client: Shell (bash 4.4.121) inxi: 2.3.56
I’ll be able to continue troubleshooting tomorrow night, but I wanted to share the logs right now so people who are willing to help can have them (because I’m guessing this is mostly how we troubleshoot?) I’m still learning, but I learn fast!
Thank you very much in advance