Xorg and Kworker blocked for more than 122 seconds

Hi everyone, randomly after some time of use Xorg freezes while the rest of the linux system Manjaro Xfce continues to work fine in the background, so it makes my system unusable. This desktop PC has a GTX 1080Ti and I think this bug is related to the “nouveau” driver. I think it is the fault of the “nouveau” driver because my old laptop with an intel iGPU and a second nvidia GPU had a similar problem with the same Manjaro Xfce. To solve the problem on my laptop, I had simply uninstalled the “nouveau” driver because my intel iGPU was enough for me at that time. Except that now, this desktop PC has only one GPU and I would like to make “new” work properly, because I don’t know any good alternative. And, as said, I’m not sure it’s really the fault of “nouveau”. Finally, two error messages are displayed one after the other; the first one is with Xorg and the second one with Kworker; both must be related I think. Here they are:

nov. 02 17:07:41 local-network kernel: INFO: task Xorg:1160 blocked for more than 122 seconds.
nov. 02 17:07:41 local-network kernel:       Not tainted 5.19.17-1-MANJARO #1
nov. 02 17:07:41 local-network kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
nov. 02 17:07:41 local-network kernel: task:Xorg            state:D stack:    0 pid: 1160 ppid:  1154 flags:0x00000006
nov. 02 17:07:41 local-network kernel: Call Trace:
nov. 02 17:07:41 local-network kernel:  <TASK>
nov. 02 17:07:41 local-network kernel:  __schedule+0x356/0x11a0
nov. 02 17:07:41 local-network kernel:  ? update_load_avg+0x7e/0x730
nov. 02 17:07:41 local-network kernel:  schedule+0x5e/0xd0
nov. 02 17:07:41 local-network kernel:  schedule_preempt_disabled+0x15/0x30
nov. 02 17:07:41 local-network kernel:  __ww_mutex_lock.constprop.0+0x577/0x9f0
nov. 02 17:07:41 local-network kernel:  drm_modeset_lock+0x91/0xe0
nov. 02 17:07:41 local-network kernel:  drm_crtc_get_sequence_ioctl+0xec/0x1a0
nov. 02 17:07:41 local-network kernel:  ? drm_wait_vblank_ioctl+0x740/0x740
nov. 02 17:07:41 local-network kernel:  drm_ioctl_kernel+0xcd/0x170
nov. 02 17:07:41 local-network kernel:  drm_ioctl+0x231/0x410
nov. 02 17:07:41 local-network kernel:  ? drm_wait_vblank_ioctl+0x740/0x740
nov. 02 17:07:41 local-network kernel:  nouveau_drm_ioctl+0x5a/0xb0 [nouveau e33404538c3ec87e1252c32c1da056fac116d741]
nov. 02 17:07:41 local-network kernel:  __x64_sys_ioctl+0x94/0xd0
nov. 02 17:07:42 local-network kernel:  do_syscall_64+0x5f/0x90
nov. 02 17:07:42 local-network kernel:  ? syscall_exit_to_user_mode+0x1b/0x40
nov. 02 17:07:42 local-network kernel:  ? do_syscall_64+0x6b/0x90
nov. 02 17:07:42 local-network kernel:  ? do_syscall_64+0x6b/0x90
nov. 02 17:07:42 local-network kernel:  ? do_syscall_64+0x6b/0x90
nov. 02 17:07:42 local-network kernel:  ? do_syscall_64+0x6b/0x90
nov. 02 17:07:42 local-network kernel:  ? do_syscall_64+0x6b/0x90
nov. 02 17:07:42 local-network kernel:  entry_SYSCALL_64_after_hwframe+0x63/0xcd
nov. 02 17:07:42 local-network kernel: RIP: 0033:0x7fc888f9ac0f
nov. 02 17:07:42 local-network kernel: RSP: 002b:00007ffd30e7fcd0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
nov. 02 17:07:42 local-network kernel: RAX: ffffffffffffffda RBX: 00007ffd30e7fdc0 RCX: 00007fc888f9ac0f
nov. 02 17:07:42 local-network kernel: RDX: 00007ffd30e7fd60 RSI: 00000000c018643b RDI: 0000000000000010
nov. 02 17:07:42 local-network kernel: RBP: 00007ffd30e7fd60 R08: 0000555e1b187e70 R09: 0000000000000000
nov. 02 17:07:42 local-network kernel: R10: 0000555e1b10bd00 R11: 0000000000000246 R12: 00000000c018643b
nov. 02 17:07:42 local-network kernel: R13: 0000000000000010 R14: 00007ffd30e7fdc0 R15: 0000555e1b074750
nov. 02 17:07:42 local-network kernel:  </TASK>
nov. 02 17:07:42 local-network kernel: INFO: task kworker/u64:7:9735 blocked for more than 122 seconds.
nov. 02 17:07:42 local-network kernel:       Not tainted 5.19.17-1-MANJARO #1
nov. 02 17:07:42 local-network kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
nov. 02 17:07:42 local-network kernel: task:kworker/u64:7   state:D stack:    0 pid: 9735 ppid:     2 flags:0x00004000
nov. 02 17:07:42 local-network kernel: Workqueue: events_unbound nv50_disp_atomic_commit_work [nouveau]
nov. 02 17:07:42 local-network kernel: Call Trace:
nov. 02 17:07:42 local-network kernel:  <TASK>
nov. 02 17:07:42 local-network kernel:  __schedule+0x356/0x11a0
nov. 02 17:07:42 local-network kernel:  ? gk104_fifo_uevent_init+0x1e/0x40 [nouveau e33404538c3ec87e1252c32c1da056fac116d741]
nov. 02 17:07:42 local-network kernel:  ? nvkm_notify_get+0x5c/0x70 [nouveau e33404538c3ec87e1252c32c1da056fac116d741]
nov. 02 17:07:42 local-network kernel:  schedule+0x5e/0xd0
nov. 02 17:07:42 local-network kernel:  schedule_timeout+0x11c/0x150
nov. 02 17:07:42 local-network kernel:  dma_fence_default_wait+0x1d3/0x270
nov. 02 17:07:42 local-network kernel:  ? __bpf_trace_dma_fence+0x10/0x10
nov. 02 17:07:42 local-network kernel:  dma_fence_wait_timeout+0xe7/0x110
nov. 02 17:07:42 local-network kernel:  drm_atomic_helper_wait_for_fences+0x86/0xe0
nov. 02 17:07:42 local-network kernel:  nv50_disp_atomic_commit_tail+0x8e/0x7e0 [nouveau e33404538c3ec87e1252c32c1da056fac116d741]
nov. 02 17:07:42 local-network kernel:  process_one_work+0x1c7/0x380
nov. 02 17:07:42 local-network kernel:  worker_thread+0x51/0x390
nov. 02 17:07:42 local-network kernel:  ? rescuer_thread+0x3b0/0x3b0
nov. 02 17:07:42 local-network kernel:  kthread+0xde/0x110
nov. 02 17:07:42 local-network kernel:  ? kthread_complete_and_exit+0x20/0x20
nov. 02 17:07:42 local-network kernel:  ret_from_fork+0x22/0x30
nov. 02 17:07:42 local-network kernel:  </TASK>

Thanks to all!

just install proper drivers, nouveau is not good, as you can see…
post output from:
inxi -G && mhwd -l -li

Thanks a lot for your help and sorry for my delay, I was looking into it.
Should I do this :

  1. uninstall “xf86-video-nouveau” first (to avoid a conflict between drivers)
  2. reboot
  3. install proper driver: “pacman -S nvidia-lts” (Because I use a linux LTS kernel and just “pacman -S nvidia” is only for non LTS ones. Is that right?)
  4. I think it’s optional but I can also install “lib32-nvidia-utils” for 3D acceleration. Is this optional?
  5. done
    Thanks

Post output of the commands… If you would do the things you want to, you would end up with a black screen…

That’s what I thought, I did well to wait for you…

inxi -G && mhwd -l -li
Graphics:
  Device-1: NVIDIA GP102 [GeForce GTX 1080 Ti] driver: nouveau v: kernel
  Display: x11 server: X.Org v: 21.1.4 driver: X: loaded: modesetting
    dri: nouveau gpu: nouveau resolution: 3440x1440~60Hz
  API: OpenGL Message: Unable to show GL data. Required tool glxinfo
    missing.
> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI


Warning: No installed USB configs!
> 0000:26:00.0 (0300:10de:1b06) Display controller nVidia Corporation:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
          video-nvidia            2021.11.04               false            PCI
    video-nvidia-470xx            2021.11.04               false            PCI
    video-nvidia-390xx            2021.11.26               false            PCI
           video-linux            2018.05.04                true            PCI
     video-modesetting            2020.01.13                true            PCI
            video-vesa            2017.03.12                true            PCI


> 0000:22:00.0 (0200:10ec:8168) Network controller Realtek Semiconductor Co., Ltd.:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
         network-r8168            2016.04.20                true            PCI


so first add this parameter:
sudo nano /etc/default/grub
and in this line: GRUB_CMDLINE_LINUX_DEFAULT inside the quotes add this parameter:
ibt=off
dont remove anything from there!, just add it to existing parameters, save the file, update grub:
sudo update-grub
its possible that you would not be able to boot without this parameter, so we added it just in case…


now uninstall this:
sudo mhwd -r pci video-linux
and install nvidia drivers:
sudo mhwd -i pci video-nvidia
reboot

1 Like

Really, thank you, you are top notch!

So I did all your instructions and I got a success message for the uninstallation and then also for the installation. After rebooting, this is what I get with the command “inxi -G && mhwd -l -li” :

inxi -G && mhwd -l -li
Graphics:
  Device-1: NVIDIA GP102 [GeForce GTX 1080 Ti] driver: nvidia v: 520.56.06
  Display: x11 server: X.Org v: 21.1.4 driver: N/A
    resolution: 3440x1440~60Hz
  API: OpenGL Message: Unable to show GL data. Required tool glxinfo
    missing.
> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
          video-nvidia            2021.11.04               false            PCI


Warning: No installed USB configs!
> 0000:26:00.0 (0300:10de:1b06) Display controller nVidia Corporation:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
          video-nvidia            2021.11.04               false            PCI
    video-nvidia-470xx            2021.11.04               false            PCI
    video-nvidia-390xx            2021.11.26               false            PCI
           video-linux            2018.05.04                true            PCI
     video-modesetting            2020.01.13                true            PCI
            video-vesa            2017.03.12                true            PCI


> 0000:22:00.0 (0200:10ec:8168) Network controller Realtek Semiconductor Co., Ltd.:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
         network-r8168            2016.04.20                true            PCI


Oh yes, when I successfully installed “video-nvidia”, I got this information:

If you run into trouble with CUDA not being available, run nvidia-modprobe first.
If you use GDM on Wayland, you might have to run systemctl enable --now nvidia-resume.service

However, I think that these messages do not concern me. Do we agree?


It seems that we are done?

Thanks a lot.

GDM is gnome display manager, and since you are on xfce, it doesnt concern you, and the same goes for cuda if you are not using it…
so yes, thats it…

Ok, thanks. For the moment everything seems to be fine with the nvidia drivers I installed with you. As I said in my first message, my error is random, so I’ll have to wait a bit before resolving this topic. I promise to come back and resolve it if it turns out to be the case.
Thanks again for your help.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.