I’ve tried blacklisting nvidia_uvm and while this has improved the situation slightly, I still get kernel panic and random freezes. The issue with kernel panic is that I loose the logs (sysrq does not work) and log entries and file system gets corrupted. Worse, when the panic happens during shutdown/reboot, I can only see the ending of traceback which lists modules loaded and the final
kernel: note: systemd[993] exited with irqs disabled
It was pure luck that I managed to grab the logs during a startup and had the terminal open which led me to figure out it’s an Nvidia issue.
I searched through the forums and a general consensus seems to be that Nvidia drivers cannot be downgraded[0], however Downgrade nvidia 470.86 to 470.63 - #10 by philm from 3 years ago makes it seem like it’s possible. Is it still valid?
How do I downgrade (from 550 → 545 or earlier if this does not work out)?
Once 555 is release restore things back the way it is currently i.e. move back to the latest branch (and any associated packages such as linux-nvidia*)
There are no nvidia-545 packages in the Manjaro or Arch repos, or even in the AUR.
This means that you cannot use mhwd or any downgrade tool to recover/switch to 545, nor can you simply install it.
If you happen to have certain packages in your cache then maybe … otherwise you will have to write/edit a PKGBUILD yourself.
If we are to assume you somehow managed to install nvidia-545 packages then you would re-perform regular mhwd selection. ex:
sudo mhwd -i pci video-nvidia
Though it could be likely that you will have dependencies to manage before that will go through.
In such an event you may have to remove some of the offending packages beforehand. You may also benefit from the force flag:
I managed to get 545 and will let it run for a week to see if it helps. philm’s instructions are clear and I had no issues building the packages. I needed to downgrade the following:
I picked the last 545 commit for these and built the packages. Also added these to ignore package in pacman.conf
I suspect a package could be missed here as the GUI app (it’s called Nvidia Server, I think) is not shown in application menu – pacman does not complain so whatever it is, is not a dependency. For now downgrading the driver, running the system normally and seeing if I continue to observe kernel panic are my next steps.
When the next version of drivers come out I can test it out by removing ignore package. I have btrfs + snapshots, so can go back and forth as needed.
It should also be considered that Linux 6.7.12 [EOL] so please change to supported kernel ASAP.
linux66 or linux68 is valid choices.
Noted. I’ve been using 6.6 and 6.1 LTS for the moment.
I wonder whether you could try another kernel instead of linux67. Maybe, you could solve your problem.
I’ve used 6.1, 6.6, 6.7 & 6.8 so far and have had issues with all of them. Frankly, downgrading Nvidia packages is in itself a test, but the symptoms plus the only journal I managed to capture match the Arch forum post.
In order to help you, this post will be useful to any other posts.
‘[HowTo] Request support’
Here is what I have in the only journal I managed to retain, kernel panic (6.8) on VT and sysrq not working don’t help:
Because you did not post the output of the following terminal command: inxi -v8azy
Thanks for trying to help, I understand that my question is not in the usual forum format where I’m describing a symptom and looking for cause and solutions. In this case I’ve identified the issue as being with Nvidia 550 drivers and have a solution. It’s more of a procedural question of “How do I do … on Manjaro”.
But your point brings to mind an associated idea that forums are not necessarily to solve the problem of an individual but also to act as a reference for future posters who may have the same/similar question.
The hardware I have this issue on is ASUS TUF Gaming F15 (2022), Intel Core i7-12700H 12th Gen, RTX 3050 Ti 4GB Graphics FX577ZE-HN056W. The issue is with Nvidia 550 drivers which causes kernel panic (caps lock flashing) due to some weird interaction with what looks like cgroups during reboot/shutdown and sometimes during start.
I had a hard time figuring out what was going on because panic during reboot/shutdown only shows the last part of the trace i.e. from modules loaded due to limited screen size and being unable to scroll up. Sysrq does not work after kernel panic.
To solve this I downgraded to Nvidia 545 and the kernel panics have disappeared. There is a post on the Nvidia forums and from the people on the post it looks like it’s Arch Linux (and those based on it who are affected - even found a Manjaro user there). Nvidia have a bug open Series 550 freezes laptop - #27 by amrits - Linux - NVIDIA Developer Forums