Xorg-Server 1.20.9 crashes with Nvidia 450xx drivers

bad news
i get a dump from Xorg , and rtkit-daemon should not run so many times

août 27 12:13:10 mjro rtkit-daemon[3993]: Supervising 8 threads of 6 processes of 1 users.
août 27 12:13:10 mjro rtkit-daemon[3993]: Supervising 8 threads of 6 processes of 1 users.
août 27 12:19:49 mjro rtkit-daemon[3993]: Supervising 6 threads of 4 processes of 1 users.
août 27 12:19:49 mjro rtkit-daemon[3993]: Supervising 6 threads of 4 processes of 1 users.
août 27 12:19:51 mjro rtkit-daemon[3993]: Supervising 6 threads of 4 processes of 1 users.
août 27 12:19:51 mjro rtkit-daemon[3993]: Supervising 6 threads of 4 processes of 1 users.
août 27 12:20:06 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:06 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:07 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:07 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:08 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:08 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:09 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:09 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:10 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:10 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:11 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:20:11 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:21:26 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:21:26 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:21:27 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:21:27 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:25:41 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:25:41 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:25:41 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:25:41 mjro rtkit-daemon[3993]: Supervising 4 threads of 2 processes of 1 users.
août 27 12:25:41 mjro rtkit-daemon[3993]: Successfully made thread 110757 of process 108228 owned by '1000' RT at priority 10.
août 27 12:25:41 mjro rtkit-daemon[3993]: Supervising 5 threads of 3 processes of 1 users.
août 27 12:32:36 mjro rtkit-daemon[3993]: Supervising 5 threads of 3 processes of 1 users.
août 27 12:32:36 mjro rtkit-daemon[3993]: Supervising 5 threads of 3 processes of 1 users.
août 27 12:32:37 mjro rtkit-daemon[3993]: Supervising 5 threads of 3 processes of 1 users.
août 27 12:32:37 mjro rtkit-daemon[3993]: Supervising 5 threads of 3 processes of 1 users.
août 27 12:33:01 mjro audit[492]: ANOM_ABEND auid=4294967295 uid=0 gid=0 ses=4294967295 pid=492 comm="Xorg" exe="/usr/lib/Xorg" sig=6 res=1
août 27 12:33:01 mjro kernel: audit: type=1701 audit(1598524381.476:116): auid=4294967295 uid=0 gid=0 ses=4294967295 pid=492 comm="Xorg" exe="/usr/lib/Xorg" sig=6 res=1
août 27 12:33:01 mjro systemd[1]: Created slice system-systemd\x2dcoredump.slice.
août 27 12:33:01 mjro audit: BPF prog-id=18 op=LOAD
août 27 12:33:01 mjro audit: BPF prog-id=19 op=LOAD
août 27 12:33:01 mjro systemd[1]: Started Process Core Dump (PID 113398/UID 0).
août 27 12:33:01 mjro audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@0-113398-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=>
août 27 12:33:01 mjro kernel: audit: type=1334 audit(1598524381.490:117): prog-id=18 op=LOAD
août 27 12:33:01 mjro kernel: audit: type=1334 audit(1598524381.490:118): prog-id=19 op=LOAD
août 27 12:33:01 mjro kernel: audit: type=1130 audit(1598524381.490:119): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@0-113398-0 comm="systemd" exe="/usr/lib/syste>
août 27 12:33:01 mjro xfce4-notifyd[4692]: xfce4-notifyd: Fatal IO error 11 (Ressource temporairement non disponible) on X server :0.
août 27 12:33:01 mjro systemd[3594]: xfce4-notifyd.service: Main process exited, code=exited, status=1/FAILURE
août 27 12:33:01 mjro polkitd[527]: Unregistered Authentication Agent for unix-session:2 (system bus name :1.37, object path /org/gnome/PolicyKit1/AuthenticationAgent, locale fr_FR.utf8) (d>
août 27 12:33:01 mjro at-spi-bus-launcher[3766]: X connection to :0 broken (explicit kill or server shutdown).
août 27 12:33:01 mjro pulseaudio[3870]: X connection to :0.0 broken (explicit kill or server shutdown).
août 27 12:33:01 mjro systemd[3594]: xfce4-notifyd.service: Failed with result 'exit-code'.
août 27 12:33:01 mjro systemd[3594]: pulseaudio.service: Main process exited, code=exited, status=1/FAILURE
août 27 12:33:01 mjro systemd[3594]: pulseaudio.service: Failed with result 'exit-code'.
août 27 12:33:01 mjro audit[1062]: USER_LOGOUT pid=1062 uid=0 auid=1000 ses=2 msg='op=logout id=1000 exe="/usr/bin/lightdm" hostname=mjro addr=? terminal=/dev/tty7 res=success'
août 27 12:33:01 mjro lightdm[1062]: pam_unix(lightdm:session): session closed for user scoin
août 27 12:33:01 mjro audit[1062]: USER_END pid=1062 uid=0 auid=1000 ses=2 msg='op=PAM:session_close grantors=pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_permit,pam_mail,pam_systemd,pa>
août 27 12:33:01 mjro kernel: audit: type=1113 audit(1598524381.833:120): pid=1062 uid=0 auid=1000 ses=2 msg='op=logout id=1000 exe="/usr/bin/lightdm" hostname=mjro addr=? terminal=/dev/tty>
août 27 12:33:01 mjro kernel: audit: type=1106 audit(1598524381.833:121): pid=1062 uid=0 auid=1000 ses=2 msg='op=PAM:session_close grantors=pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_>
août 27 12:33:01 mjro audit[1062]: CRED_DISP pid=1062 uid=0 auid=1000 ses=2 msg='op=PAM:setcred grantors=pam_shells,pam_faillock,pam_permit,pam_faillock,pam_gnome_keyring acct="scoin" exe=">
août 27 12:33:01 mjro systemd-logind[437]: Session 2 logged out. Waiting for processes to exit.
août 27 12:33:01 mjro kernel: audit: type=1104 audit(1598524381.833:122): pid=1062 uid=0 auid=1000 ses=2 msg='op=PAM:setcred grantors=pam_shells,pam_faillock,pam_permit,pam_faillock,pam_gno>
août 27 12:33:02 mjro systemd[3594]: pulseaudio.service: Scheduled restart job, restart counter is at 1.
août 27 12:33:02 mjro systemd[3594]: Stopped Sound Service.
août 27 12:33:02 mjro systemd[3594]: Starting Sound Service...
août 27 12:33:02 mjro rtkit-daemon[3993]: Successfully made thread 113403 of process 113403 owned by '1000' high priority at nice level -11.
août 27 12:33:02 mjro rtkit-daemon[3993]: Supervising 1 threads of 1 processes of 1 users.
août 27 12:33:02 mjro pulseaudio[113403]: W: [pulseaudio] pid.c: Stale PID file, overwriting.
août 27 12:33:02 mjro rtkit-daemon[3993]: Supervising 1 threads of 1 processes of 1 users.
août 27 12:33:02 mjro rtkit-daemon[3993]: Successfully made thread 113430 of process 113403 owned by '1000' RT at priority 5.
août 27 12:33:02 mjro rtkit-daemon[3993]: Supervising 2 threads of 1 processes of 1 users.
août 27 12:33:02 mjro rtkit-daemon[3993]: Supervising 2 threads of 1 processes of 1 users.
août 27 12:33:02 mjro rtkit-daemon[3993]: Successfully made thread 113431 of process 113403 owned by '1000' RT at priority 5.
août 27 12:33:02 mjro rtkit-daemon[3993]: Supervising 3 threads of 1 processes of 1 users.
août 27 12:33:02 mjro dbus-daemon[433]: [system] Activating via systemd: service name='org.bluez' unit='dbus-org.bluez.service' requested by ':1.1540' (uid=1000 pid=113403 comm="/usr/bin/pu>
août 27 12:33:02 mjro dbus-daemon[433]: [system] Activation via systemd failed for unit 'dbus-org.bluez.service': Unit dbus-org.bluez.service is masked.
août 27 12:33:02 mjro systemd[3594]: Started Sound Service.
août 27 12:33:02 mjro pulseaudio[113403]: E: [pulseaudio] bluez5-util.c: GetManagedObjects() failed: org.freedesktop.systemd1.UnitMasked: Unit dbus-org.bluez.service is masked.
août 27 12:33:02 mjro systemd-coredump[113399]: Process 492 (Xorg) of user 0 dumped core.
                                                 
                                                 Stack trace of thread 492:
                                                 #0  0x00007fbd6c672615 raise (libc.so.6 + 0x3d615)
                                                 #1  0x00007fbd6c65b862 abort (libc.so.6 + 0x26862)
                                                 #2  0x000055fc71f2e38a OsAbort (Xorg + 0x14a38a)
                                                 #3  0x000055fc71f2fe51 FatalError (Xorg + 0x14be51)
                                                 #4  0x000055fc71f35aa9 n/a (Xorg + 0x151aa9)
                                                 #5  0x00007fbd6c6726a0 __restore_rt (libc.so.6 + 0x3d6a0)
                                                 #6  0x000055fc71e62f03 n/a (Xorg + 0x7ef03)
                                                 #7  0x000055fc71e8a03b ChangeWindowDeviceCursor (Xorg + 0xa603b)
                                                 #8  0x000055fc71ef05fd n/a (Xorg + 0x10c5fd)
                                                 #9  0x000055fc71e1e165 n/a (Xorg + 0x3a165)
                                                 #10 0x00007fbd6c65d152 __libc_start_main (libc.so.6 + 0x28152)
                                                 #11 0x000055fc71e1e5ae _start (Xorg + 0x3a5ae)
août 27 12:33:02 mjro systemd[1]: systemd-coredump@0-113398-0.service: Succeeded.

second reboot
after the screen goes to sleep
login & password ok , but not Xorg server

août 27 20:33:49 mjro lightdm[182845]: gkr-pam: unable to locate daemon control file
août 27 20:34:04 mjro systemd[670]: xfce4-notifyd.service: Failed with result 'exit-code'.
août 27 20:34:04 mjro systemd[670]: pulseaudio.service: Failed with result 'exit-code'.
août 27 20:34:05 mjro systemd-coredump[191858]: Process 541 (Xorg) of user 0 dumped core.
                                                 
                                                 Stack trace of thread 541:
                                                 #0  0x00007f271a0f6615 raise (libc.so.6 + 0x3d615)
                                                 #1  0x00007f271a0df862 abort (libc.so.6 + 0x26862)
                                                 #2  0x0000565539b6738a OsAbort (Xorg + 0x14a38a)
                                                 #3  0x0000565539b68e51 FatalError (Xorg + 0x14be51)
                                                 #4  0x0000565539b6eaa9 n/a (Xorg + 0x151aa9)
                                                 #5  0x00007f271a0f66a0 __restore_rt (libc.so.6 + 0x3d6a0)
                                                 #6  0x0000565539a6f384 n/a (Xorg + 0x52384)
                                                 #7  0x0000565539a66deb mieqProcessInputEvents (Xorg + 0x49deb)
                                                 #8  0x0000565539b7d5c9 ProcessInputEvents (Xorg + 0x1605c9)
                                                 #9  0x0000565539a568c8 n/a (Xorg + 0x398c8)
                                                 #10 0x00007f271a0e1152 __libc_start_main (libc.so.6 + 0x28152)
                                                 #11 0x0000565539a575ae _start (Xorg + 0x3a5ae)
août 27 20:34:18 mjro lightdm[191974]: gkr-pam: unable to locate daemon control file
août 27 20:34:18 mjro kernel: kauditd_printk_skb: 3 callbacks suppressed

@stephane can you try with nvidia-440xx driver or nouveau?

Which package versions are involved here?

I have:

rtkit 0.13-1
xorg-server 1.20.9-1
nvidia-450xx-utils 450.66-1
linux58-nvidia-450xx 450.66-4

And all is working fine.

1 Like

from testing

  • Xorg 1.20.9 + 450xx drivers ( fail on return sleep video) , login ok , xorg failed after
  • Xorg 1.20.8 + 450xx drivers ( ok )
  • Xorg 1.20.9 + 440xx drivers ( fail on return sleep video) , login ok , xorg failed after
sudo downgrade xorg-server 

Paquets disponibles :

-   1)  xorg-server    1.20.4  1  x86_64  (distant)
-   2)  xorg-server    1.20.5  1  x86_64  (distant)
-   3)  xorg-server    1.20.5  2  x86_64  (distant)
    4)  xorg-server    1.20.5  3  x86_64  (distant)
    5)  xorg-server    1.20.5  4  x86_64  (distant)
-   6)  xorg-server    1.20.6  1  x86_64  (distant)
-   7)  xorg-server    1.20.6  2  x86_64  (distant)
-   8)  xorg-server    1.20.6  3  x86_64  (distant)
-   9)  xorg-server    1.20.7  1  x86_64  (distant)
-  10)  xorg-server    1.20.8  1  x86_64  (distant)
-  11)  xorg-server    1.20.8  2  x86_64  (distant)
-  12)  xorg-server    1.20.8  3  x86_64  (distant)
-  13)  xorg-server    1.20.8  4  x86_64  (local) ( OK)
+  14)  xorg-server    1.20.9  1  x86_64  (distant) 
+  15)  xorg-server    1.20.9  1  x86_64  (local)  (KO )
1 Like

I’m experiencing the xorg crashes also. It typically occurs after suspend. I also have multi-monitor setup. After rolling back to 1.20.8 all seems stable again. Also have 450xx drivers.

Aug 28 17:18:35 raven-lpc systemd-coredump[726165]: Process 1337 (Xorg) of user 0 dumped core.

                                                Stack trace of thread 1337:
                                                #0  0x00007f0145bff615 raise (libc.so.6 + 0x3d615)
                                                #1  0x00007f0145be8862 abort (libc.so.6 + 0x26862)
                                                #2  0x0000564899c1138a OsAbort (Xorg + 0x14a38a)
                                                #3  0x0000564899c12e51 FatalError (Xorg + 0x14be51)
                                                #4  0x0000564899c18aa9 n/a (Xorg + 0x151aa9)
                                                #5  0x00007f0145bff6a0 __restore_rt (libc.so.6 + 0x3d6a0)
                                                #6  0x0000564899b45f03 n/a (Xorg + 0x7ef03)
                                                #7  0x0000564899b68faf ChangeWindowAttributes (Xorg + 0xa1faf)
                                                #8  0x0000564899b36a58 n/a (Xorg + 0x6fa58)
                                                #9  0x0000564899b01165 n/a (Xorg + 0x3a165)
                                                #10 0x00007f0145bea152 __libc_start_main (libc.so.6 + 0x28152)
                                                #11 0x0000564899b015ae _start (Xorg + 0x3a5ae)
                                                
                                                Stack trace of thread 1344:
                                                #0  0x00007f0145ab76a2 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf6a2)
                                                #1  0x00007f014027990c n/a (iris_dri.so + 0x4ae90c)
                                                #2  0x00007f0140278308 n/a (iris_dri.so + 0x4ad308)
                                                #3  0x00007f0145ab13e9 start_thread (libpthread.so.0 + 0x93e9)
                                                #4  0x00007f0145cc2293 __clone (libc.so.6 + 0x100293)
                                                
                                                Stack trace of thread 1346:
                                                #0  0x00007f0145ab76a2 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf6a2)
                                                #1  0x00007f014027990c n/a (iris_dri.so + 0x4ae90c)
                                                #2  0x00007f0140278308 n/a (iris_dri.so + 0x4ad308)
                                                #3  0x00007f0145ab13e9 start_thread (libpthread.so.0 + 0x93e9)
                                                #4  0x00007f0145cc2293 __clone (libc.so.6 + 0x100293)
                                                
                                                Stack trace of thread 1345:
                                                #0  0x00007f0145ab76a2 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf6a2)
                                                #1  0x00007f014027990c n/a (iris_dri.so + 0x4ae90c)
                                                #2  0x00007f0140278308 n/a (iris_dri.so + 0x4ad308)
                                                #3  0x00007f0145ab13e9 start_thread (libpthread.so.0 + 0x93e9)
                                                #4  0x00007f0145cc2293 __clone (libc.so.6 + 0x100293)
                                                
                                                Stack trace of thread 1347:
                                                #0  0x00007f0145ab76a2 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf6a2)
                                                #1  0x00007f014027990c n/a (iris_dri.so + 0x4ae90c)
                                                #2  0x00007f0140278308 n/a (iris_dri.so + 0x4ad308)
                                                #3  0x00007f0145ab13e9 start_thread (libpthread.so.0 + 0x93e9)
                                                #4  0x00007f0145cc2293 __clone (libc.so.6 + 0x100293)

Aug 28 17:18:35 raven-lpc systemd[1]: systemd-coredump@2-726164-0.service: Succeeded.

I am on testing, have updated xorg, switched to 450xx drivers, then switched to Nvidia mode. All works, I can’t detect crashes or dump cores like described here. However, I tried this in single monitor setup. Maybe when I plug in another monitor, things will change.

That gives me an idea. Maybe PRIME synchronization got broken again by the 1.20.9 update. It got fixed with the 450xx drivers so I never had to turn it off again when I updated to those drivers. Maybe if I disable PRIME sync it will stabilize again. Will test and get back to this.

Nope. After 3 suspends xorg crashed again. Rolling back to 1.20.8 for now.

*Seems this also occurs on arch and someone there has already posted the issue at upstream as issue 1073. Guess we’ll have to hold off on 1.20.9 for those who’re experiencing this issue.

I have the same on a AMD only System:

So It should work again fine if I downgrade to 1.20.8?

Yes. Just downgrade xorg-server and xorg-server-common. I’m currently bisecting the xorg-server source to find the commit responsible for this so maybe we can have the Manjaro devs revert it for our Manjaro package while upstream solves it for the next official release.

2 Likes

Or simply ignore the packages, for next updates. Until this is fixed.

Bug is reported by Arch Linux:
https://bugs.archlinux.org/task/67774?project=1&string=xorg-server

downgrade only xorg-server , xorg-server-common seems to be ok ( unless fails appears )

2 Likes

Just an update, issue has been cited as duplicate and closed since the commit that messed it has been found and slated for the 1.20.10 release. Maybe my corroboration with regards to that commit confirmed that it essentially was the same issue with a previous one? :stuck_out_tongue: For now, either wait for 1.20.10 or downgrade temporarily to 1.20.8 for those who updated to 1.20.9.

@philm , check version Xorg 1.20.10

There’s no 1.20.10 yet. It’s just tagged as to be fixed for 1.20.10 since they seem to have a fix already that does not involve reverting the commit. Just an oversight which needs to be addressed in another patch. :slight_smile:

As of today, the commit fixing this issue has been merged to master. Now we’ll have to wait for either 1.20.10 or for Arch/Manjaro devs to cherry-pick the commit (919f1f46fc67dae93b2b3f278fcbfc77af34ec58) to our 1.20.9 packages. :slight_smile:

Tagging @philm since Arch has updated their xorg with the patch fixing this regression with their 1.20.9-2 release.

its already in unstable …
Installed the update 50 minutes ago.

2 Likes

xorg-server 1.20.9-2 is already in Testing Branch. Now I no longer have crashes.