ASUS GL702ZC: Kernel Oops with AMDGPU when standby

I’m currently running Manjaro XFCE on an ASUS ROG Strix GL702ZC (Ryzen 7 1700) with latest packages. While everything works, I’m having some issues.

  • The built-in touchpad is not functional (although I mostly use a mouse, at times I might be powering it on without one for temporary maintenance such as updating packages, during which I’m completely on keyboard).
  • The system might go into a hard freeze when the system attempts to go to sleep, resulting in hard freeze, black screen (at one time it also caused the fan to rev at full speed). While disabling sleep (only for plugged in at present) mitigated the issue, the system could also hard freeze when trying to lock the screen.

At present, both 4.14 and 4.15 RC kernels exhibit these problems.

This one (originally for 4.14.0 kernel) apparently incorporated some unofficial kernel patches specific for this laptop, which I might consider trying it out to see if it addressed some of the issues I’m having, but I dunno if there’s a proper way to build and maintain a patched kernel on Arch/Manjaro (plus other kernel modules).

While the issue is nothing major, it can be problematic as I also use the laptop at work, and I need to avoid doing things that could lead to a hard freeze as it can cause data losses, which was possibly responsible for an endless loop of core dump from lightdm-gtk-greeter that resulted in being unable to boot to the system and forced me to reinstall the system, when I first started installing and configuring Manjaro on the laptop a few months ago.

UPDATE: Haven’t had the time looking into getting an unofficial kernel, but I happened to find something useful when I looked into journalctl. The crash is caused by AMDGPU, and is actually a Kernel Oops panic. When that happened, there were nothing suspicious prior to the actual panic log.

Feb 03 08:28:23 linuxsys kernel: BUG: unable to handle kernel NULL pointer dereference at (null)
Feb 03 08:28:23 linuxsys kernel: IP: dce110_vblank_set+0x4f/0xb0 [amdgpu]
Feb 03 08:28:23 linuxsys kernel: PGD 7d8afd067 P4D 7d8afd067 PUD 7d8b00067 PMD 0
Feb 03 08:28:23 linuxsys kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Feb 03 08:28:23 linuxsys kernel: Modules linked in: vmw_vsock_vmci_transport vsock cmac rfcomm fuse bnep vmnet(O) amdkfd amd_iommu_v2 amdgpu arc4 iwlmvm btusb btrtl btbcm btintel edac_mce_amd mac80211 kvm_amd wmi_bmof serio bluetooth ax8817
Feb 03 08:28:23 linuxsys kernel: sysfillrect rfkill sysimgblt rng_core fb_sys_fops soundcore tpm_crb shpchp wmi tpm_tis thermal ac tpm_tis_core battery video tpm i2c_hid asus_wireless button acpi_cpufreq sch_fq_codel vmmon(O) vmw_vmci vbox
Feb 03 08:28:23 linuxsys kernel: CPU: 8 PID: 1492 Comm: xfwm4 Tainted: G O 4.15.0-1-MANJARO #1
Feb 03 08:28:23 linuxsys kernel: Hardware name: ASUSTeK COMPUTER INC. GL702ZC/GL702ZC, BIOS GL702ZC.303 12/15/2017
Feb 03 08:28:23 linuxsys kernel: RIP: 0010:dce110_vblank_set+0x4f/0xb0 [amdgpu]
Feb 03 08:28:23 linuxsys kernel: RSP: 0018:ffff9ee388c7bbe0 EFLAGS: 00010002
Feb 03 08:28:23 linuxsys kernel: RAX: ffff90d8ddbb0000 RBX: 0000000000000001 RCX: 0000000000000000
Feb 03 08:28:23 linuxsys kernel: RDX: 0000000000000000 RSI: 000000000000000c RDI: 0000000000000000
Feb 03 08:28:23 linuxsys kernel: RBP: ffff90db7178a880 R08: 0000000000000000 R09: 0000000000000000
Feb 03 08:28:23 linuxsys kernel: R10: 00007ffdbffc9720 R11: ffff90db791c3b00 R12: ffff90db6a806700
Feb 03 08:28:23 linuxsys kernel: R13: ffffffffc0e99ba0 R14: ffff90db69200000 R15: ffff90db6a37ab68
Feb 03 08:28:23 linuxsys kernel: FS: 00007f22ddf93980(0000) GS:ffff90db7e800000(0000) knlGS:0000000000000000
Feb 03 08:28:23 linuxsys kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 03 08:28:23 linuxsys kernel: CR2: 0000000000000000 CR3: 00000007f9bf6000 CR4: 00000000003406e0
Feb 03 08:28:23 linuxsys kernel: Call Trace:
Feb 03 08:28:23 linuxsys kernel: amdgpu_dm_set_crtc_irq_state+0x31/0x60 [amdgpu]
Feb 03 08:28:23 linuxsys kernel: amdgpu_irq_update+0x55/0x90 [amdgpu]
Feb 03 08:28:23 linuxsys kernel: drm_vblank_enable+0x84/0x100 [drm]
Feb 03 08:28:23 linuxsys kernel: drm_vblank_get+0x8d/0xb0 [drm]
Feb 03 08:28:23 linuxsys kernel: drm_wait_vblank_ioctl+0x12a/0x690 [drm]
Feb 03 08:28:23 linuxsys kernel: ? unix_stream_recvmsg+0x53/0x70
Feb 03 08:28:23 linuxsys kernel: ? drm_legacy_modeset_ctl_ioctl+0x100/0x100 [drm]
Feb 03 08:28:23 linuxsys kernel: drm_ioctl_kernel+0x5b/0xb0 [drm]
Feb 03 08:28:23 linuxsys kernel: drm_ioctl+0x2d5/0x370 [drm]
Feb 03 08:28:23 linuxsys kernel: ? drm_legacy_modeset_ctl_ioctl+0x100/0x100 [drm]
Feb 03 08:28:23 linuxsys kernel: ? do_iter_write+0xdc/0x190
Feb 03 08:28:23 linuxsys kernel: ? vfs_writev+0xb9/0x110
Feb 03 08:28:23 linuxsys kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Feb 03 08:28:23 linuxsys kernel: do_vfs_ioctl+0xa4/0x630
Feb 03 08:28:23 linuxsys kernel: ? __sys_recvmsg+0x4e/0x90
Feb 03 08:28:23 linuxsys kernel: ? __sys_recvmsg+0x7d/0x90
Feb 03 08:28:23 linuxsys kernel: SyS_ioctl+0x74/0x80
Feb 03 08:28:23 linuxsys kernel: entry_SYSCALL_64_fastpath+0x20/0x83
Feb 03 08:28:23 linuxsys kernel: RIP: 0033:0x7f22da5dfd87
Feb 03 08:28:23 linuxsys kernel: RSP: 002b:00007ffdbffc9768 EFLAGS: 00000246
Feb 03 08:28:23 linuxsys kernel: Code: e8 17 20 04 00 83 e8 4e 0f b6 d0 48 89 d0 48 c1 e0 05 48 01 d0 48 c1 e0 05 49 03 86 60 01 00 00 84 db 48 8b b8 78 02 00 00 74 18 <48> 8b 07 be 02 00 00 00 48 8b 80 d8 00 00 00 e8 6d 63 c0 cf 84
Feb 03 08:28:23 linuxsys kernel: RIP: dce110_vblank_set+0x4f/0xb0 [amdgpu] RSP: ffff9ee388c7bbe0
Feb 03 08:28:23 linuxsys kernel: CR2: 0000000000000000
Feb 03 08:28:23 linuxsys kernel: —[ end trace 0e2fa94fe7517dd5 ]—

This usually happens when the system goes to standby (perhaps blank screen would do), and it’s becoming easier to reproduce recently.

And I found this… I don’t know if the issue is related.

PS: The issue existed since day one, starting from 4.14 kernel, and the modules are unlikely to be involved in the issue as I have observed it happening on a fresh install. The only mitigation right now is to disable power saving features both on battery and plugged in, and avoid locking the screen if possible.

UPDATE 2: I guess this really is a problem… it’s 100% reproducible here. Lock the screen, let the screen go blank for a while, then try waking it up by moving the mouse. It seems not entirely related to the bug I mentioned, but I was able to easily reproduce.

UPDATE 3: I believe this issue (closed and was supposed to be fixed) is a closer match to the one I’m having.

I think this is the question:

?

If so,

  1. Download the PKGBUILD file
  2. Edit it
  3. Build it

I’m not sure about the details at present, just that I fear using a 3rd-party patched kernel may cause issues with mhwd, and that someone back then suggested not to use kernels and drivers from AUR so I’ve been reluctant…

At present I’d also prefer official kernels to be able to check if issues might have been solved officially as well.

I’ll try look for a PKGBUILD intended for building a kernel package when I have time.

BUMP: Added an update in the first post regarding the detail of the crash, and edited the title.

Facing the same issue. Using Kernel 4.15 (was the same using 4.14) with AMDGPU for an AMD RX580 8GB GPU. CPU is a Intel Xeon Processor E5-2660 v4. The system freezes when switching off the display.
Relevant kernel logs:

Feb 09 09:40:59 hostname kernel: BUG: unable to handle kernel NULL pointer dereference at           (null)
Feb 09 09:40:59 hostname kernel: IP: dce110_vblank_set+0x4f/0xb0 [amdgpu]
Feb 09 09:40:59 hostname kernel: PGD 80000004636b8067 P4D 80000004636b8067 PUD 46aff3067 PMD 0
Feb 09 09:40:59 hostname kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Feb 09 09:40:59 hostname kernel: Modules linked in: tun xt_recent ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 xt_hl ip6t_rt nf_conntrack_ipv6 nf_defrag_ipv6 ipt_REJECT nf_reject_ipv4 nf_log_ipv4 nf_log_com
Feb 09 09:40:59 hostname kernel:  snd_usbmidi_lib drm_kms_helper aesni_intel snd_rawmidi btusb snd_hwdep snd_seq_device aes_x86_64 btrtl crypto_simd glue_helper btbcm snd_pcm cryptd btintel intel_csta
Feb 09 09:40:59 hostname kernel: CPU: 15 PID: 1384 Comm: xfwm4 Not tainted 4.15.0-1-MANJARO #1
Feb 09 09:40:59 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./EPC612D4I, BIOS P1.20 07/13/2015
Feb 09 09:40:59 hostname kernel: RIP: 0010:dce110_vblank_set+0x4f/0xb0 [amdgpu]
Feb 09 09:40:59 hostname kernel: RSP: 0018:ffffaa3c826bfbe0 EFLAGS: 00010002
Feb 09 09:40:59 hostname kernel: RAX: ffff904c240b8000 RBX: 0000000000000001 RCX: 0000000000000000
Feb 09 09:40:59 hostname kernel: RDX: 0000000000000000 RSI: 000000000000000c RDI: 0000000000000000
Feb 09 09:40:59 hostname kernel: RBP: ffff904c20a914c0 R08: 0000000000000000 R09: 0000000000000000
Feb 09 09:40:59 hostname kernel: R10: 00007ffcee9139a0 R11: ffff904c24258000 R12: ffff904c16f5dc80
Feb 09 09:40:59 hostname kernel: R13: ffffffffc116dba0 R14: ffff904c1f008000 R15: ffff904c22693368
Feb 09 09:40:59 hostname kernel: FS:  00007f595d5c1980(0000) GS:ffff904c2f1c0000(0000) knlGS:0000000000000000
Feb 09 09:40:59 hostname kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 09:40:59 hostname kernel: CR2: 0000000000000000 CR3: 0000000468f4a003 CR4: 00000000001606e0
Feb 09 09:40:59 hostname kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 09 09:40:59 hostname kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 09 09:40:59 hostname kernel: Call Trace:
Feb 09 09:40:59 hostname kernel:  amdgpu_dm_set_crtc_irq_state+0x31/0x60 [amdgpu]
Feb 09 09:40:59 hostname kernel:  amdgpu_irq_update+0x55/0x90 [amdgpu]
Feb 09 09:40:59 hostname kernel:  drm_vblank_enable+0x84/0x100 [drm]
Feb 09 09:40:59 hostname kernel:  drm_vblank_get+0x8d/0xb0 [drm]
Feb 09 09:40:59 hostname kernel:  drm_wait_vblank_ioctl+0x12a/0x690 [drm]
Feb 09 09:40:59 hostname kernel:  ? unix_stream_recvmsg+0x53/0x70
Feb 09 09:40:59 hostname kernel:  ? drm_legacy_modeset_ctl_ioctl+0x100/0x100 [drm]
Feb 09 09:40:59 hostname kernel:  drm_ioctl_kernel+0x5b/0xb0 [drm]
Feb 09 09:40:59 hostname kernel:  drm_ioctl+0x2d5/0x370 [drm]
Feb 09 09:40:59 hostname kernel:  ? drm_legacy_modeset_ctl_ioctl+0x100/0x100 [drm]
Feb 09 09:40:59 hostname kernel:  ? do_iter_write+0xdc/0x190
Feb 09 09:40:59 hostname kernel:  ? vfs_writev+0xb9/0x110
Feb 09 09:40:59 hostname kernel:  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Feb 09 09:40:59 hostname kernel:  do_vfs_ioctl+0xa4/0x630
Feb 09 09:40:59 hostname kernel:  ? __sys_recvmsg+0x4e/0x90
Feb 09 09:40:59 hostname kernel:  ? __sys_recvmsg+0x7d/0x90
Feb 09 09:40:59 hostname kernel:  SyS_ioctl+0x74/0x80
Feb 09 09:40:59 hostname kernel:  entry_SYSCALL_64_fastpath+0x20/0x83
Feb 09 09:40:59 hostname kernel: RIP: 0033:0x7f5959c0bd87
Feb 09 09:40:59 hostname kernel: RSP: 002b:00007ffcee9139e8 EFLAGS: 00000246
Feb 09 09:40:59 hostname kernel: Code: e8 17 20 04 00 83 e8 4e 0f b6 d0 48 89 d0 48 c1 e0 05 48 01 d0 48 c1 e0 05 49 03 86 60 01 00 00 84 db 48 8b b8 78 02 00 00 74 18 <48> 8b 07 be 02 00 00 00 48 8b
Feb 09 09:40:59 hostname kernel: RIP: dce110_vblank_set+0x4f/0xb0 [amdgpu] RSP: ffffaa3c826bfbe0
Feb 09 09:40:59 hostname kernel: CR2: 0000000000000000
Feb 09 09:40:59 hostname kernel: ---[ end trace a4dc8aafeb19f667 ]---
Feb 09 09:40:59 hostname kernel: note: xfwm4[1384] exited with preempt_count 3
Feb 09 09:42:48 hostname kernel: [UFW BLOCK] IN=enp2s0 OUT= MAC=01:00:5e:00:00:01:38:10:d5:19:3e:e8:08:00 SRC=192.168.178.1 DST=224.0.0.1 LEN=36 TOS=0x00 PREC=0xC0 TTL=1 ID=20096 DF PROTO=2
Feb 09 09:44:01 hostname kernel: [drm] {2560x1440, 2720x1481@241500Khz}
Feb 09 09:44:01 hostname kernel: usb 3-10.3: USB disconnect, device number 5
Feb 09 09:44:23 hostname kernel: watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [panel-16-cpufre:1541]
Feb 09 09:44:23 hostname kernel: Modules linked in: tun xt_recent ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 xt_hl ip6t_rt nf_conntrack_ipv6 nf_defrag_ipv6 ipt_REJECT nf_reject_ipv4 nf_log_ipv4 nf_log_com
Feb 09 09:44:23 hostname kernel:  snd_usbmidi_lib drm_kms_helper aesni_intel snd_rawmidi btusb snd_hwdep snd_seq_device aes_x86_64 btrtl crypto_simd glue_helper btbcm snd_pcm cryptd btintel intel_csta
Feb 09 09:44:23 hostname kernel: CPU: 7 PID: 1541 Comm: panel-16-cpufre Tainted: G      D          4.15.0-1-MANJARO #1
Feb 09 09:44:23 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./EPC612D4I, BIOS P1.20 07/13/2015
Feb 09 09:44:23 hostname kernel: RIP: 0010:smp_call_function_single+0x98/0x120
Feb 09 09:44:23 hostname kernel: RSP: 0018:ffffaa3c82dd7be0 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff11
Feb 09 09:44:23 hostname kernel: RAX: ffff904c2efe06c0 RBX: 00000000000003e7 RCX: 0000000000000001
Feb 09 09:44:23 hostname kernel: RDX: 0000000000000000 RSI: ffffffff8703f1a0 RDI: 000000000000000e
Feb 09 09:44:23 hostname kernel: RBP: ffffaa3c82dd7c30 R08: ffffffffffffc000 R09: 000000000fffffff
Feb 09 09:44:23 hostname kernel: R10: ffffaa3c82dd7c48 R11: 0000000000000000 R12: 000004e15b159670
Feb 09 09:44:23 hostname kernel: R13: 0000000000000001 R14: ffffffff87267530 R15: 0000000000000000
Feb 09 09:44:23 hostname kernel: FS:  00007f8c46cbbe00(0000) GS:ffff904c2efc0000(0000) knlGS:0000000000000000
Feb 09 09:44:23 hostname kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 09:44:23 hostname kernel: CR2: 00007ff0a8d48000 CR3: 0000000454aec006 CR4: 00000000001606e0
Feb 09 09:44:23 hostname kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 09 09:44:23 hostname kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 09 09:44:23 hostname kernel: Call Trace:
Feb 09 09:44:23 hostname kernel:  ? recalibrate_cpu_khz+0x10/0x10
Feb 09 09:44:23 hostname kernel:  ? aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:44:23 hostname kernel:  aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:44:23 hostname kernel:  arch_freq_prepare_all+0x3d/0x80
Feb 09 09:44:23 hostname kernel:  cpuinfo_open+0xe/0x20
Feb 09 09:44:23 hostname kernel:  proc_reg_open+0x70/0x120
Feb 09 09:44:23 hostname kernel:  ? proc_reg_llseek+0x70/0x70
Feb 09 09:44:23 hostname kernel:  do_dentry_open+0x1b3/0x2d0
Feb 09 09:44:23 hostname kernel:  path_openat+0x4f9/0x12e0
Feb 09 09:44:23 hostname kernel:  do_filp_open+0x9b/0x110
Feb 09 09:44:23 hostname kernel:  ? fsnotify_grab_connector+0x3c/0x60
Feb 09 09:44:23 hostname kernel:  ? __check_object_size+0xaf/0x1b0
Feb 09 09:44:23 hostname kernel:  ? __alloc_fd+0xaf/0x160
Feb 09 09:44:23 hostname kernel:  ? do_sys_open+0x1bd/0x250
Feb 09 09:44:23 hostname kernel:  do_sys_open+0x1bd/0x250
Feb 09 09:44:23 hostname kernel:  entry_SYSCALL_64_fastpath+0x20/0x83
Feb 09 09:44:23 hostname kernel: RIP: 0033:0x7f8c44a06180
Feb 09 09:44:23 hostname kernel: RSP: 002b:00007ffff46edd70 EFLAGS: 00000293
Feb 09 09:44:23 hostname kernel: Code: 58 0f 1f 44 00 00 f6 c4 02 74 7e 85 c9 75 57 48 c7 c0 c0 06 02 00 65 48 03 05 0d 0d f0 78 8b 48 18 83 e1 01 74 0a f3 90 8b 48 18 <83> e1 01 75 f6 83 48 18 01 48
Feb 09 09:44:51 hostname kernel: watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [panel-16-cpufre:1541]
Feb 09 09:44:51 hostname kernel: Modules linked in: tun xt_recent ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 xt_hl ip6t_rt nf_conntrack_ipv6 nf_defrag_ipv6 ipt_REJECT nf_reject_ipv4 nf_log_ipv4 nf_log_com
Feb 09 09:44:51 hostname kernel:  snd_usbmidi_lib drm_kms_helper aesni_intel snd_rawmidi btusb snd_hwdep snd_seq_device aes_x86_64 btrtl crypto_simd glue_helper btbcm snd_pcm cryptd btintel intel_csta
Feb 09 09:44:51 hostname kernel: CPU: 7 PID: 1541 Comm: panel-16-cpufre Tainted: G      D      L   4.15.0-1-MANJARO #1
Feb 09 09:44:51 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./EPC612D4I, BIOS P1.20 07/13/2015
Feb 09 09:44:51 hostname kernel: RIP: 0010:smp_call_function_single+0x95/0x120
Feb 09 09:44:51 hostname kernel: RSP: 0018:ffffaa3c82dd7be0 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff11
Feb 09 09:44:51 hostname kernel: RAX: ffff904c2efe06c0 RBX: 00000000000003e7 RCX: 0000000000000001
Feb 09 09:44:51 hostname kernel: RDX: 0000000000000000 RSI: ffffffff8703f1a0 RDI: 000000000000000e
Feb 09 09:44:51 hostname kernel: RBP: ffffaa3c82dd7c30 R08: ffffffffffffc000 R09: 000000000fffffff
Feb 09 09:44:51 hostname kernel: R10: ffffaa3c82dd7c48 R11: 0000000000000000 R12: 000004e15b159670
Feb 09 09:44:51 hostname kernel: R13: 0000000000000001 R14: ffffffff87267530 R15: 0000000000000000
Feb 09 09:44:51 hostname kernel: FS:  00007f8c46cbbe00(0000) GS:ffff904c2efc0000(0000) knlGS:0000000000000000
Feb 09 09:44:51 hostname kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 09:44:51 hostname kernel: CR2: 00007ff0a8d48000 CR3: 0000000454aec006 CR4: 00000000001606e0
Feb 09 09:44:51 hostname kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 09 09:44:51 hostname kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 09 09:44:51 hostname kernel: Call Trace:
Feb 09 09:44:51 hostname kernel:  ? recalibrate_cpu_khz+0x10/0x10
Feb 09 09:44:51 hostname kernel:  ? aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:44:51 hostname kernel:  aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:44:51 hostname kernel:  arch_freq_prepare_all+0x3d/0x80
Feb 09 09:44:51 hostname kernel:  cpuinfo_open+0xe/0x20
Feb 09 09:44:51 hostname kernel:  proc_reg_open+0x70/0x120
Feb 09 09:44:51 hostname kernel:  ? proc_reg_llseek+0x70/0x70
Feb 09 09:44:51 hostname kernel:  do_dentry_open+0x1b3/0x2d0
Feb 09 09:44:51 hostname kernel:  path_openat+0x4f9/0x12e0
Feb 09 09:44:51 hostname kernel:  do_filp_open+0x9b/0x110
Feb 09 09:44:51 hostname kernel:  ? fsnotify_grab_connector+0x3c/0x60
Feb 09 09:44:51 hostname kernel:  ? __check_object_size+0xaf/0x1b0
Feb 09 09:44:51 hostname kernel:  ? __alloc_fd+0xaf/0x160
Feb 09 09:44:51 hostname kernel:  ? do_sys_open+0x1bd/0x250
Feb 09 09:44:51 hostname kernel:  do_sys_open+0x1bd/0x250
Feb 09 09:44:51 hostname kernel:  entry_SYSCALL_64_fastpath+0x20/0x83
Feb 09 09:44:51 hostname kernel: RIP: 0033:0x7f8c44a06180
Feb 09 09:44:51 hostname kernel: RSP: 002b:00007ffff46edd70 EFLAGS: 00000293
Feb 09 09:44:51 hostname kernel: Code: 73 0c 9c 58 0f 1f 44 00 00 f6 c4 02 74 7e 85 c9 75 57 48 c7 c0 c0 06 02 00 65 48 03 05 0d 0d f0 78 8b 48 18 83 e1 01 74 0a f3 90 <8b> 48 18 83 e1 01 75 f6 83 48
Feb 09 09:44:53 hostname kernel: [UFW BLOCK] IN=enp2s0 OUT= MAC=01:00:5e:00:00:01:38:10:d5:19:3e:e8:08:00 SRC=192.168.178.1 DST=224.0.0.1 LEN=36 TOS=0x00 PREC=0xC0 TTL=1 ID=26419 DF PROTO=2
Feb 09 09:44:56 hostname kernel: INFO: rcu_preempt detected stalls on CPUs/tasks:
Feb 09 09:44:56 hostname kernel:         13-...0: (1 GPs behind) idle=c6e/140000000000000/0 softirq=78498/78498 fqs=4884
Feb 09 09:44:56 hostname kernel:         (detected by 4, t=18002 jiffies, g=187112, c=187111, q=563)
Feb 09 09:44:56 hostname kernel: Sending NMI from CPU 4 to CPUs 13:
Feb 09 09:44:56 hostname kernel: NMI backtrace for cpu 13
Feb 09 09:44:56 hostname kernel: CPU: 13 PID: 650 Comm: Xorg Tainted: G      D      L   4.15.0-1-MANJARO #1
Feb 09 09:44:56 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./EPC612D4I, BIOS P1.20 07/13/2015
Feb 09 09:44:56 hostname kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x12d/0x190
Feb 09 09:44:56 hostname kernel: RSP: 0018:ffffaa3c82b8fa78 EFLAGS: 00000002
Feb 09 09:44:56 hostname kernel: Code: 89 c2 89 d0 66 31 c0 41 39 c0 74 ea 4d 85 c9 c6 07 01 74 28 41 c7 41 08 01 00 00 00 eb 96 83 fa 01 74 98 8b 07 84 c0 74 08 f3 90 <8b> 07 84 c0 75 f8 b8 01 00 00
Feb 09 09:45:19 hostname kernel: watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [panel-16-cpufre:1541]
Feb 09 09:45:19 hostname kernel: Modules linked in: tun xt_recent ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 xt_hl ip6t_rt nf_conntrack_ipv6 nf_defrag_ipv6 ipt_REJECT nf_reject_ipv4 nf_log_ipv4 nf_log_com
Feb 09 09:45:19 hostname kernel:  snd_usbmidi_lib drm_kms_helper aesni_intel snd_rawmidi btusb snd_hwdep snd_seq_device aes_x86_64 btrtl crypto_simd glue_helper btbcm snd_pcm cryptd btintel intel_csta
Feb 09 09:45:19 hostname kernel: CPU: 7 PID: 1541 Comm: panel-16-cpufre Tainted: G      D      L   4.15.0-1-MANJARO #1
Feb 09 09:45:19 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./EPC612D4I, BIOS P1.20 07/13/2015
Feb 09 09:45:19 hostname kernel: RIP: 0010:smp_call_function_single+0x95/0x120
Feb 09 09:45:19 hostname kernel: RSP: 0018:ffffaa3c82dd7be0 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff11
Feb 09 09:45:19 hostname kernel: RAX: ffff904c2efe06c0 RBX: 00000000000003e7 RCX: 0000000000000001
Feb 09 09:45:19 hostname kernel: RDX: 0000000000000000 RSI: ffffffff8703f1a0 RDI: 000000000000000e
Feb 09 09:45:19 hostname kernel: RBP: ffffaa3c82dd7c30 R08: ffffffffffffc000 R09: 000000000fffffff
Feb 09 09:45:19 hostname kernel: R10: ffffaa3c82dd7c48 R11: 0000000000000000 R12: 000004e15b159670
Feb 09 09:45:19 hostname kernel: R13: 0000000000000001 R14: ffffffff87267530 R15: 0000000000000000
Feb 09 09:45:19 hostname kernel: FS:  00007f8c46cbbe00(0000) GS:ffff904c2efc0000(0000) knlGS:0000000000000000
Feb 09 09:45:19 hostname kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 09:45:19 hostname kernel: CR2: 00007ff0a8d48000 CR3: 0000000454aec006 CR4: 00000000001606e0
Feb 09 09:45:19 hostname kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 09 09:45:19 hostname kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 09 09:45:19 hostname kernel: Call Trace:
Feb 09 09:45:19 hostname kernel:  ? recalibrate_cpu_khz+0x10/0x10
Feb 09 09:45:19 hostname kernel:  ? aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:45:19 hostname kernel:  aperfmperf_snapshot_cpu+0x5a/0x70
Feb 09 09:45:19 hostname kernel:  arch_freq_prepare_all+0x3d/0x80
Feb 09 09:45:19 hostname kernel:  cpuinfo_open+0xe/0x20
Feb 09 09:45:19 hostname kernel:  proc_reg_open+0x70/0x120
Feb 09 09:45:19 hostname kernel:  ? proc_reg_llseek+0x70/0x70
Feb 09 09:45:19 hostname kernel:  do_dentry_open+0x1b3/0x2d0
Feb 09 09:45:19 hostname kernel:  path_openat+0x4f9/0x12e0
Feb 09 09:45:19 hostname kernel:  do_filp_open+0x9b/0x110
Feb 09 09:45:19 hostname kernel:  ? fsnotify_grab_connector+0x3c/0x60
Feb 09 09:45:19 hostname kernel:  ? __check_object_size+0xaf/0x1b0
Feb 09 09:45:19 hostname kernel:  ? __alloc_fd+0xaf/0x160
Feb 09 09:45:19 hostname kernel:  ? do_sys_open+0x1bd/0x250
Feb 09 09:45:19 hostname kernel:  do_sys_open+0x1bd/0x250
Feb 09 09:45:19 hostname kernel:  entry_SYSCALL_64_fastpath+0x20/0x83
Feb 09 09:45:19 hostname kernel: RIP: 0033:0x7f8c44a06180
Feb 09 09:45:19 hostname kernel: RSP: 002b:00007ffff46edd70 EFLAGS: 00000293
Feb 09 09:45:19 hostname kernel: Code: 73 0c 9c 58 0f 1f 44 00 00 f6 c4 02 74 7e 85 c9 75 57 48 c7 c0 c0 06 02 00 65 48 03 05 0d 0d f0 78 8b 48 18 83 e1 01 74 0a f3 90 <8b> 48 18 83 e1 01 75 f6 83 48

CPU 2 and 7 then constantly get stuck and I need to reboot the systemusing the harware button.

Null pointer dereference? That’s a bug in the kernel and/or driver which we can’t fix. Please report upstream.

Already reported here.

Given there were a couple of closed/fixed issues regarding similar issues in earlier versions of the kernel I believe this is a regression.

UPDATE: A couple of patches were made available in the bug report which fixed the crashing issue as far as I have tested.

Note that the patches were based on amd-staging-next, which has a one-line difference in amdgpu_dm.c with that in the 4.15 version, that would cause Patch 1 to be rejected when trying to build the 4.15 kernel using the PKGBUILD file. You need to edit the patch file manually to match the corresponding block in the 4.15 version (by removing the additional line that caused the rejection, check here), in order to get it accepted so the kernel can be built.

1 Like

BUMP: It seems 4.16 kernel included some of the patches. However, the issue remains on vanilla build and you still need this patch (Patch 3) to get the problem fixed. The patches 1, 2 and 4 are already included in the 4.16 kernel and does not need to be patched again.

Putting this here as it's a forum search result for GL702ZC.

Asus have just made BIOS version 306 available ("Improve system stability").

If anyone is still having issues with spurious freezes and reboots then a BIOS update is worth trying.

It's been quite a while since I reported the issue, and many Linux kernel versions have passed since then (it's now at 5.2 at the time of writing this reply).

However, I've since then disabled most of the power saving features and avoided locking the screen (which doesn't honor the settings and would still blank it after a few minutes), as I still encounter the issue from time to time on my laptop, but it's no longer 100%-reproducible compared to the time when I reported it.

On my other desktops at home that also used AMD video cards, I haven't encountered this yet (those systems are all on kernels after the fixes), although it's also possible that it's due to that I'm using Cinnamon instead of XFCE on those systems, since I have a 4K monitor and XFCE doesn't scale well with such compared to Cinnamon.

I do suspect some other factors might be contributing, such as the board's own power-saving mechanisms, the built-in monitor itself, as well as potential issues with virtualization software such as VMware/VirtualBox, since I also run a Windows VM at work for some Windows-specific development workloads.

1 Like

I've been having some reboots and black screens over the past few days, possibly something changed in the latest 5.2 and 5.3rc8 releases. 5.3rc7 seemed OK, I need to test 5.3 final.

Forum kindly sponsored by Bytemark