Gnome Session crashes due to X.org segfault (Nvidia)

Recently I started getting X.org crashes (which kill my whole session). So far I could provoke this by working for a bit with PyCharm. The screen suddenly is black and it takes a moment to get back to GDM where I can login again. But looking at the logs it seems this is not related to PyCharm but rather some problem with the drivers.

Any ideas how to fix this?

Kernel 5.10.61
Gnome 40.4
Nvidia 470.63.01

Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) Backtrace:
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) 0: /usr/lib/Xorg (xorg_backtrace+0x89) [0x5555ae19e659]
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) 1: /usr/lib/Xorg (0x5555ae054000+0x1554b9) [0x5555ae1a94b9]
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) 2: /usr/lib/libc.so.6 (0x7fc2063cb000+0x3cda0) [0x7fc206407da0]
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) 3: /usr/lib/nvidia/xorg/libglxserver_nvidia.so (0x7fc203f76000+0x8a0c40) [0x7fc204816c40]
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) Segmentation fault at address 0x10
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: Fatal server error:
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) Caught signal 11 (Segmentation fault). Server aborting
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: Please consult the The X.Org Foundation support
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]:          at http://wiki.x.org
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]:  for help.
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) Please also check the log file at "/var/log/Xorg.1.log" for additional information.
Sep 09 16:13:35 david-pc1 /usr/lib/gdm-x-session[9960]: (EE)
Sep 09 16:13:40 david-pc1 /usr/lib/gdm-x-session[9960]: (WW) NVIDIA: Wait for channel idle timed out.
Sep 09 16:13:41 david-pc1 /usr/lib/gdm-x-session[9960]: (EE) Server terminated with error (1). Closing log file.
Sep 09 16:13:41 david-pc1 audit[9960]: ANOM_ABEND auid=1000 uid=1000 gid=1001 ses=7 subj==unconfined pid=9960 comm="Xorg" exe="/usr/lib/Xorg" sig=6 res=1
Sep 09 16:13:41 david-pc1 systemd-coredump[17782]: Process 9960 (Xorg) of user 1000 dumped core.
                                                   
                                                   Stack trace of thread 9960:
                                                   #0  0x00007fc206407d22 raise (libc.so.6 + 0x3cd22)
                                                   #1  0x00007fc2063f1862 abort (libc.so.6 + 0x26862)
                                                   #2  0x00005555ae1a1d50 OsAbort (Xorg + 0x14dd50)
                                                   #3  0x00005555ae1a3865 FatalError (Xorg + 0x14f865)
                                                   #4  0x00005555ae1a951a n/a (Xorg + 0x15551a)
                                                   #5  0x00007fc206407da0 __restore_rt (libc.so.6 + 0x3cda0)
                                                   #6  0x00007fc204816c40 n/a (libglxserver_nvidia.so + 0x8a0c40)

Did you do that? :wink:

Of course. It’s exactly the same as in the journalctl output I have posted above.

[ 10908.607] (EE) 
[ 10908.607] (EE) Backtrace:
[ 10908.608] (EE) 0: /usr/lib/Xorg (xorg_backtrace+0x89) [0x5555ae19e659]
[ 10908.608] (EE) 1: /usr/lib/Xorg (0x5555ae054000+0x1554b9) [0x5555ae1a94b9]
[ 10908.608] (EE) 2: /usr/lib/libc.so.6 (0x7fc2063cb000+0x3cda0) [0x7fc206407da0]
[ 10908.608] (EE) 3: /usr/lib/nvidia/xorg/libglxserver_nvidia.so (0x7fc203f76000+0x8a0c40) [0x7fc204816c40]
[ 10908.608] (EE) 
[ 10908.608] (EE) Segmentation fault at address 0x10
[ 10908.608] (EE) 
Fatal server error:
[ 10908.608] (EE) Caught signal 11 (Segmentation fault). Server aborting
[ 10908.608] (EE) 
[ 10908.608] (EE) 
Please consult the The X.Org Foundation support 
	 at http://wiki.x.org
 for help. 
[ 10908.608] (EE) Please also check the log file at "/var/log/Xorg.1.log" for additional information.
[ 10908.608] (EE) 
[ 10913.742] (WW) NVIDIA: Wait for channel idle timed out.
[ 10913.880] (EE) Server terminated with error (1). Closing log file.

:+1: Welcome to Manjaro! :+1:

  1. Please read this:
    How to provide good information
    and press the three dots below your post and press the :pencil2: to give us more information so we can see what’s really going on.
    Now we know the symptom of the disease, but we need some more probing to know where the origin lies… :grin:
  2. An inxi --admin --verbosity=7 --filter --no-host --width would be the minimum required information for us to be able to help you. (Personally Identifiable Information like serial numbers and MAC addresses will be filtered out by the above command)
    Also, please copy-paste that output in-between 3 backticks ``` at the beginning and end of the code/text.
  3. Does this happen under a newly created user as well? (E.G. munguph2)

:+1:

P.S. If you want to notify me that you did provide this information, please :heart: this message and I’ll come back and have another look.

Just a quick update on this: Before posting this thread this has happened multiple times per day. Afterwards: Not once :sweat_smile: No idea what’s going on. That’s why I wasn’t able to deliver some more logs…

Could be a hardware problem (Overheating?): clean your computer with a can of compressed air and/or vacuum it out…

:crossed_fingers:

1 Like

Thanks for the suggestion but I don’t think that was it. I’ve been doing some pretty heavy calculations the last two weeks and no problems whatsoever :man_shrugging:
Let’s just hope it keeps working :smiley:

1 Like

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.