Blackscreen (monitor lost signal) issue and sometimes wont wake up, only happends after demanding Steam Proton gaming and idle time

After I had this issue, i also saw this 2 new errors in journal that showed up:

Mai 11 02:25:15 koboldx-z170 kernel: [drm:drm_new_set_master] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Failed to grab modeset ownership
Mai 11 02:26:26 koboldx-z170 kernel: [drm:drm_new_set_master] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Failed to grab modeset ownership

When i had this issue the first time, i was total shocked because i though my GPU breaked.

But after moving my mouse, the Monitor comes back life, like it was in a sleeping state, but i had deactivated all kind of Energy Savings in KDE.

The Nvidia-DRM issue that showed up in Journal makes me think, that it was linked 1 month old Nvidia Flickering Issue that i fixed with the workaround “nvidia_drm.modeset=1” in Grub Boot that i had to add in Grub in April after we had to reinstall nvidia-settings. The1 month old Topic (that was solved at this time because of the workaround) but still not fixed from nvidia and today i see this drawbacks from this workaround.

Yesterday, i had the second time this black screen issue (signal was total lost) and pressing keyboard or moving mouse wont do anything.

I saw no other way then pressing “quick” the power button… for a normal shutdown.
And 1-2sec just befor the shutdown, i saw the screen again befor the PC finally shutdown.

That makes me thinking that maybe i can restore the Monitor signal and get a picture with crtl+alt+f2 (TTY) and back to desktop with crtl+alt+f1, but i still very confused why this all happends and if switching to TTY really helps here or not :grimacing:

Also what makes it even more harder that this new stable update comes out, 1 day after i refreshed my Thermal Paste at my GPU, because of a heat issue that is fixed now, in this situations a rolling release can we a big problem :frowning:

Im actually updated today my LTSC Kernel from 5.15 to 6.01 but i dont have high expectation and just waiting for the next blackscreen, that maybe happening 1 time in around 30hours in my limited experience with that issue… so its a really rare situation.

inxi --admin --verbosity=7 --filter --no-host --width:

Summary
System:
  Kernel: 5.15.109-1-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 12.2.1
    parameters: BOOT_IMAGE=/vmlinuz-5.15-x86_64
    root=UUID=eb235aa7-d461-413d-800e-ea57385703fb rw quiet apparmor=1
    sysrq_always_enabled=1 retbleed=off security=apparmor nvidia_drm.modeset=1
    resume=UUID=717b267e-7322-4bf9-a840-f1210d422d1a udev.log_priority=3
  Desktop: KDE Plasma v: 5.27.4 tk: Qt v: 5.15.9 wm: kwin_x11 vt: 1 dm: SDDM
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Desktop System: Gigabyte product: Z170X-UD3 v: N/A
    serial: <superuser required>
  Mobo: Gigabyte model: Z170X-UD3-CF v: x.x serial: <superuser required>
    UEFI-[Legacy]: American Megatrends v: F23d date: 12/01/2017
Memory:
  System RAM: available: 15.58 GiB used: 2.6 GiB (16.7%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges required.
CPU:
  Info: model: Intel Core i7-6700K bits: 64 type: MT MCP arch: Skylake-S
    gen: core 6 level: v3 note: check built: 2015 process: Intel 14nm family: 6
    model-id: 0x5E (94) stepping: 3 microcode: 0xF0
  Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
    L1: 256 KiB desc: d-4x32 KiB; i-4x32 KiB L2: 1024 KiB desc: 4x256 KiB
    L3: 8 MiB desc: 1x8 MiB
  Speed (MHz): avg: 4501 high: 4505 min/max: 800/4700 scaling:
    driver: intel_pstate governor: performance cores: 1: 4502 2: 4505 3: 4502
    4: 4500 5: 4504 6: 4500 7: 4501 8: 4499 bogomips: 64026
  Flags: 3dnowprefetch abm acpi adx aes aperfmperf apic arat
    arch_capabilities arch_perfmon art avx avx2 bmi1 bmi2 bts clflush
    clflushopt cmov constant_tsc cpuid cpuid_fault cx16 cx8 de ds_cpl dtes64
    dtherm dts ept ept_ad erms est f16c flexpriority flush_l1d fma fpu
    fsgsbase fxsr ht hwp hwp_act_window hwp_epp hwp_notify ibpb ibrs ida
    intel_pt invpcid invpcid_single lahf_lm lm mca mce md_clear mmx monitor
    movbe mpx msr mtrr nonstop_tsc nopl nx pae pat pbe pcid pclmulqdq pdcm
    pdpe1gb pebs pge pln pni popcnt pse pse36 pti pts rdrand rdseed rdtscp
    rep_good sdbg sep smap smep ss ssbd sse sse2 sse4_1 sse4_2 ssse3 stibp
    syscall tm tm2 tpr_shadow tsc tsc_adjust tsc_deadline_timer vme vmx vnmi
    vpid x2apic xgetbv1 xsave xsavec xsaveopt xsaves xtopology xtpr
  Vulnerabilities:
  Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf mitigation: PTE Inversion; VMX: conditional cache flushes, SMT
    vulnerable
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: mmio_stale_data mitigation: Clear CPU buffers; SMT vulnerable
  Type: retbleed status: Vulnerable
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl and seccomp
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, IBRS_FW,
    STIBP: conditional, RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds mitigation: Microcode
  Type: tsx_async_abort mitigation: TSX disabled
Graphics:
  Device-1: NVIDIA TU102 [GeForce RTX 2080 Ti Rev. A] vendor: Micro-Star MSI
    driver: nvidia v: 530.41.03 alternate: nouveau,nvidia_drm non-free: 530.xx+
    status: current (as of 2023-05) arch: Turing code: TUxxx
    process: TSMC 12nm FF built: 2018-22 pcie: gen: 1 speed: 2.5 GT/s lanes: 16
    link-max: gen: 3 speed: 8 GT/s ports: active: none off: DP-3 empty: DP-1,
    DP-2, HDMI-A-1, Unknown-1 bus-ID: 01:00.0 chip-ID: 10de:1e07 class-ID: 0300
  Display: x11 server: X.Org v: 21.1.8 compositor: kwin_x11 driver: X:
    loaded: nvidia gpu: nvidia,nvidia-nvswitch display-ID: :0 screens: 1
  Screen-1: 0 s-res: 2560x1440 s-dpi: 122 s-size: 532x302mm (20.94x11.89")
    s-diag: 612mm (24.08")
  Monitor-1: DP-3 mapped: DP-4 note: disabled model: Dell S2417DG
    serial: <filter> built: 2018 res: 2560x1440 dpi: 123 gamma: 1.2
    size: 527x296mm (20.75x11.65") diag: 604mm (23.8") ratio: 16:9 modes:
    max: 2560x1440 min: 640x480
  API: OpenGL v: 4.6.0 NVIDIA 530.41.03 renderer: NVIDIA GeForce RTX 2080
    Ti/PCIe/SSE2 direct-render: Yes
Audio:
  Device-1: Intel 100 Series/C230 Series Family HD Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel bus-ID: 00:1f.3 chip-ID: 8086:a170
    class-ID: 0403
  Device-2: NVIDIA TU102 High Definition Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 01:00.1 chip-ID: 10de:10f7 class-ID: 0403
  Device-3: Creative Labs CA0132 Sound Core3D [Sound Blaster Recon3D /
    Z-Series BlasterX AE-5 Plus] driver: snd_hda_intel v: kernel pcie: gen: 1
    speed: 2.5 GT/s lanes: 1 bus-ID: 0b:00.0 chip-ID: 1102:0012 class-ID: 0403
  API: ALSA v: k5.15.109-1-MANJARO status: kernel-api with: aoss
    type: oss-emulator tools: alsactl,alsamixer,amixer
  Server-1: JACK v: 1.9.22 status: off tools: N/A
  Server-2: PipeWire v: 0.3.70 status: off with: pipewire-media-session
    status: active tools: pw-cli
  Server-3: PulseAudio v: 16.1 status: active with: 1: pulseaudio-alsa
    type: plugin 2: pulseaudio-jack type: module tools: pacat,pactl
Network:
  Device-1: Intel Ethernet I219-V vendor: Gigabyte driver: e1000e v: kernel
    port: N/A bus-ID: 00:1f.6 chip-ID: 8086:15b8 class-ID: 0200
  IF: enp0s31f6 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IP v4: <filter> type: dynamic noprefixroute scope: global
    broadcast: <filter>
  IP v6: <filter> type: noprefixroute scope: link
  WAN IP: <filter>
Bluetooth:
  Device-1: Cambridge Silicon Radio Bluetooth Dongle (HCI mode) driver: btusb
    v: 0.8 type: USB rev: 2.0 speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-3:5
    chip-ID: 0a12:0001 class-ID: e001
  Report: rfkill ID: hci0 rfk-id: 0 state: down bt-service: enabled,running
    rfk-block: hardware: no software: yes address: see --recommends
Logical:
  Message: No logical block device data found.
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 4.57 TiB used: 198.35 GiB (4.2%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung model: SSD 960 EVO 500GB
    size: 465.76 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: 2B7QCXE7 temp: 23.9 C
    scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 vendor: Samsung model: SSD 860 PRO 1TB
    size: 953.87 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    tech: SSD serial: <filter> fw-rev: 2B6Q scheme: MBR
  ID-3: /dev/sdb maj-min: 8:16 vendor: HGST (Hitachi) model: HDN724030ALE640
    size: 2.73 TiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
    tech: HDD rpm: 7200 serial: <filter> fw-rev: A5E0 scheme: GPT
  ID-4: /dev/sdc maj-min: 8:32 vendor: Samsung model: Portable SSD T5
    size: 465.76 GiB block-size: physical: 512 B logical: 512 B type: USB
    rev: 3.1 spd: 5 Gb/s lanes: 1 mode: 3.2 gen-1x1 tech: SSD serial: <filter>
    scheme: MBR
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 88.61 GiB size: 86.66 GiB (97.80%) used: 32.57 GiB (37.6%)
    fs: ext4 dev: /dev/sdc1 maj-min: 8:33 label: N/A
    uuid: eb235aa7-d461-413d-800e-ea57385703fb
  ID-2: /boot raw-size: 200 MiB size: 188.2 MiB (94.09%)
    used: 70.5 MiB (37.5%) fs: ext3 dev: /dev/sdc3 maj-min: 8:35 label: N/A
    uuid: 26eda82e-b403-49b8-abca-202167417020
  ID-3: /home raw-size: 332.03 GiB size: 325.75 GiB (98.11%)
    used: 27.13 GiB (8.3%) fs: ext4 dev: /dev/sdc4 maj-min: 8:36 label: N/A
    uuid: ada4a6a2-bd0a-4652-b386-7c637bba7ee9
  ID-4: /media/linux-games raw-size: 196.58 GiB size: 192.43 GiB (97.89%)
    used: 133.63 GiB (69.4%) fs: ext4 dev: /dev/sda3 maj-min: 8:3
    label: Linux-games uuid: dd5af583-9d00-4017-adf7-e1d8876486e7
  ID-5: /media/temp raw-size: 63.48 GiB size: 62.18 GiB (97.96%)
    used: 4.94 GiB (7.9%) fs: ext4 dev: /dev/sdb1 maj-min: 8:17 label: temp
    uuid: 1e81b7c2-3438-438a-b572-ff8a966a78e1
Swap:
  Kernel: swappiness: 10 (default 60) cache-pressure: 100 (default)
  ID-1: swap-1 type: partition size: 7.81 GiB used: 2.8 MiB (0.0%)
    priority: -2 dev: /dev/sdc2 maj-min: 8:34 label: N/A
    uuid: 717b267e-7322-4bf9-a840-f1210d422d1a
Unmounted:
  ID-1: /dev/nvme0n1p1 maj-min: 259:1 size: 442.47 GiB fs: ntfs label: ssm
    uuid: AE7EDC0696B158FD
  ID-2: /dev/sda1 maj-min: 8:1 size: 50 MiB fs: ntfs label: System-reserviert
    uuid: B2286A122869D5BF
  ID-3: /dev/sda2 maj-min: 8:2 size: 97.09 GiB fs: ntfs label: win10
    uuid: 5E60C09860C077F3
  ID-4: /dev/sda4 maj-min: 8:4 size: 585.94 GiB fs: ntfs label: games
    uuid: 165692E31D7ADAF2
  ID-5: /dev/sdb2 maj-min: 8:18 size: 1.57 TiB fs: <superuser required>
    label: N/A uuid: N/A
  ID-6: /dev/sdb3 maj-min: 8:19 size: 1.09 TiB fs: <superuser required>
    label: N/A uuid: N/A
USB:
  Hub-1: 1-0:1 info: hi-speed hub with single TT ports: 16 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Device-1: 1-3:5 info: Cambridge Silicon Radio Bluetooth Dongle (HCI mode)
    type: bluetooth driver: btusb interfaces: 2 rev: 2.0
    speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1 chip-ID: 0a12:0001
    class-ID: e001
  Device-2: 1-9:2 info: Endor AG ClubSportPedal type: HID
    driver: hid-generic,usbhid interfaces: 1 rev: 2.0 speed: 12 Mb/s (1.4 MiB/s)
    lanes: 1 mode: 1.1 power: 100mA chip-ID: 0eb7:183b class-ID: 0300
  Device-3: 1-13:3 info: A4Tech XL-730K / XL-750BK XL-755BK Mice
    type: keyboard,mouse driver: hid-generic,usbhid interfaces: 2 rev: 1.1
    speed: 12 Mb/s (1.4 MiB/s) lanes: 1 mode: 1.1 power: 100mA
    chip-ID: 09da:9090 class-ID: 0301
  Hub-2: 2-0:1 info: super-speed hub ports: 10 rev: 3.0
    speed: 5 Gb/s (596.0 MiB/s) lanes: 1 mode: 3.2 gen-1x1 chip-ID: 1d6b:0003
    class-ID: 0900
  Device-1: 2-5:2 info: Samsung Portable SSD T5 type: mass storage
    driver: uas interfaces: 1 rev: 3.1 speed: 5 Gb/s (596.0 MiB/s) lanes: 1
    mode: 3.2 gen-1x1 power: 896mA chip-ID: 04e8:61f5 class-ID: 0806
    serial: <filter>
  Hub-3: 3-0:1 info: hi-speed hub with single TT ports: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-4: 4-0:1 info: super-speed hub ports: 4 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-5: 5-0:1 info: hi-speed hub with single TT ports: 2 rev: 2.0
    speed: 480 Mb/s (57.2 MiB/s) lanes: 1 mode: 2.0 chip-ID: 1d6b:0002
    class-ID: 0900
  Hub-6: 6-0:1 info: super-speed hub ports: 2 rev: 3.1
    speed: 10 Gb/s (1.16 GiB/s) lanes: 1 mode: 3.2 gen-2x1 chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 33.0 C mobo: N/A gpu: nvidia temp: 34 C
  Fan Speeds (RPM): N/A gpu: nvidia fan: 25%
Info:
  Processes: 243 Uptime: 11h 53m wakeups: 1 Init: systemd v: 252
  default: graphical tool: systemctl Compilers: gcc: 12.2.1 clang: 15.0.7
  Packages: 1494 pm: pacman pkgs: 1488 libs: 425 tools: pamac pm: flatpak
  pkgs: 6 Shell: Bash v: 5.1.16 running-in: konsole inxi: 3.3.27

I just got the third time the blackscreen issue and it happens after i was afk for maybe around 20 min. My suggestions was right, i switched to TTY and my blinking Monitor (LED) restored the signal and my Desktop was working again after i switched to TTY and then back to Desktop.

This time, no new errors showed up in Journal… like nothing happends :thinking:

What is also strange, it always happends after i played very demanding GPU Title, Steam Proton game (Monster Hunter World), but around 4-5hours delay after i closed the game and till the blackscreen showed up.

I had played also several hours Linux Native game (Beyond all Reason) where nothing happends after wards. :man_shrugging: :face_with_monocle:

Edit1:
4 days (15.May) later now and no blackscreen because i stopped played Monster Hunter World (Steam Proton) but everyday i played RTS game (native Linux game).

Can somebody explain why this happends?

Edit2: After another absines for 9 days (24.May) from MHW i still dont have any Blackscreen issue.

I still playing RTS and use Video Encoding… sometimes even at the same time, but no problem.

So when i play this game or another very demanding GPU Title, my Monitor will run into a lost signal… but i can play the same title (or VR 5k Resolution games, so even more demanding) in Windows10 and dont run into the issue.

Edit3: Yesterday (26.May) i started playing MHW again and after having absolute no issues
for atleast 11 days and daily/long time using the PC and playing Linux Native Games.

So i decided to played MHW for a few hours. After i closed the game and did nothing with the PC. I decided to wait for the blackscreen and straight after around 10min without any workload (idle desktop) and no mouse/keyboard usage my PC Monitor showed a blackscreen, i moved my mouse and the Screen showed a picture again but no nvidia error in journal log.

I would bet if i didnt moved my mouse, straight after the black screen, the screen would stayed black. Till im switching to TTY… :roll_eyes:

Anyone have a idea how to fix this? Should i report it to Nvidia?

HI, There!

It seems that we have a different set up, BUT the same is happening here!

My reaction was the same! GPU!!!

I have an AMD JUNNIPER!

It happens using Google Meet only! Totally random!

So the problem is not about the GPU only!

Still looking for any solution!

Ricardo

You said the same happening, can you specific that?

Did you play something demanding before the Monitor lost signal?

Did you used keyboard/mouse and was your PC idle as your Monitor went black?

Edit: I played MHW today and i had blackscreen (lost signal) after my PC went idle for exactly 15 Minutes.