Problems using Kernel 4.20 (Startup, Shutdown, Running)

manjaro
intelgpu
kernel420

#1

Hello,
I have a lot of problems, using another kernel excepts Linux manjaro 4.14.101-1-MANJARO.

I’m more or less an linux newbie, when it’s getting more detailed.

The Problems:

  • Boot on 4.20 takes long
  • System hangs, slow down, can’t do anything anymore
  • System won’t shutdown

I hope you can help me, I’ll give you guys any information needed.

Maybe there’s something wrong with my GPU-Drivers? Installed via Architect.
Like you can see, I have an Intel UHD Graphics 620 and an GeForce MX150

inxi:

System:    Host: manjaro Kernel: 4.20.10-1-MANJARO x86_64 bits: 64 compiler: gcc v: 8.2.1 
           Desktop: Awesome 4.3 Distro: Manjaro Linux 
Machine:   Type: Laptop System: LENOVO product: 20L8S02E00 v: ThinkPad T480s serial: <filter> 
           Mobo: LENOVO model: 20L8S02E00 serial: <filter> UEFI: LENOVO v: N22ET35W (1.12 ) 
           date: 04/09/2018 
Battery:   ID-1: BAT0 charge: 51.6 Wh condition: 53.5/57.0 Wh (94%) model: LGC 01AV478 
           status: Unknown 
CPU:       Topology: Quad Core model: Intel Core i7-8550U bits: 64 type: MT MCP 
           arch: Kaby Lake rev: A L2 cache: 8192 KiB 
           flags: lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx bogomips: 31880 
           Speed: 901 MHz min/max: 400/4000 MHz Core speeds (MHz): 1: 900 2: 900 3: 900 
           4: 900 5: 900 6: 902 7: 900 8: 900 
Graphics:  Device-1: Intel UHD Graphics 620 vendor: Lenovo driver: i915 v: kernel 
           bus ID: 00:02.0 
           Device-2: NVIDIA GP108M [GeForce MX150] vendor: Lenovo driver: nouveau v: kernel 
           bus ID: 01:00.0 
           Display: x11 server: X.org 1.20.3 driver: intel,modesetting 
           resolution: <xdpyinfo missing> 
           Message: Unable to show advanced data. Required tool glxinfo missing. 
Audio:     Device-1: Intel Sunrise Point-LP HD Audio vendor: Lenovo driver: snd_hda_intel 
           v: kernel bus ID: 00:1f.3 
           Sound Server: ALSA v: k4.20.10-1-MANJARO 
Network:   Device-1: Intel Ethernet I219-V vendor: Lenovo driver: e1000e v: 3.2.6-k 
           port: efa0 bus ID: 00:1f.6 
           IF: enp0s31f6 state: down mac: <filter> 
           Device-2: Intel Wireless 8265 / 8275 driver: iwlwifi v: kernel port: d000 
           bus ID: 3d:00.0 
           IF: wlp61s0 state: up mac: <filter> 
Drives:    Local Storage: total: 476.94 GiB used: 124.01 GiB (26.0%) 
           ID-1: /dev/nvme0n1 vendor: Samsung model: MZVLB512HAJQ-000L7 size: 476.94 GiB 
Partition: ID-1: / size: 452.21 GiB used: 124.01 GiB (27.4%) fs: ext4 dev: /dev/nvme0n1p2 
           ID-2: swap-1 size: 16.00 GiB used: 0 KiB (0.0%) fs: swap dev: /dev/nvme0n1p3 
Sensors:   System Temperatures: cpu: 64.0 C mobo: N/A gpu: nouveau temp: 511 C 
           Fan Speeds (RPM): cpu: 3393 
Info:      Processes: 193 Uptime: 1m Memory: 15.56 GiB used: 422.9 MiB (2.7%) Init: systemd 
           Compilers: gcc: 8.2.1 Shell: zsh v: 5.7.1 inxi: 3.0.30 

Last lines of dmesg, after shutdown via. holding powerbutton and reboot on 4.20:

[   70.385862] ------------[ cut here ]------------
[   70.385864] nouveau 0000:01:00.0: timeout
[   70.385899] WARNING: CPU: 1 PID: 245 at drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgf100.c:207 gf100_vmm_flush_+0x14e/0x190 [nouveau]
[   70.385900] Modules linked in: fuse ccm snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic joydev mousedev elan_i2c arc4 i915 nouveau iwlmvm intel_rapl x86_pkg_temp_thermal intel_powerclamp kvm_intel kvmgt vfio_mdev mdev vfio_iommu_type1 vfio mac80211 kvm snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp mxm_wmi snd_soc_acpi_intel_match snd_soc_acpi ttm crct10dif_pclmul irqbypass i2c_algo_bit crc32_pclmul snd_soc_core iwlwifi ghash_clmulni_intel nls_iso8859_1 drm_kms_helper nls_cp437 snd_compress vfat ac97_bus snd_pcm_dmaengine fat snd_hda_intel snd_hda_codec wmi_bmof iTCO_wdt iTCO_vendor_support drm cfg80211 snd_hda_core intel_wmi_thunderbolt snd_hwdep aesni_intel aes_x86_64 crypto_simd cryptd snd_pcm glue_helper intel_cstate thinkpad_acpi intel_uncore intel_gtt mei_me agpgart snd_timer nvram syscopyarea idma64 snd sysfillrect intel_rapl_perf e1000e psmouse input_leds processor_thermal_device int3403_thermal tpm_crb mei
[   70.385921]  intel_soc_dts_iosf sysimgblt intel_lpss_pci fb_sys_fops rfkill i2c_i801 intel_lpss intel_pch_thermal wmi soundcore battery int340x_thermal_zone ac tpm_tis tpm_tis_core evdev mac_hid tpm rng_core int3400_thermal acpi_thermal_rel pcc_cpufreq coretemp msr acpi_call(OE) crypto_user ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 fscrypto serio_raw atkbd libps2 xhci_pci crc32c_intel xhci_hcd i8042 serio
[   70.385934] CPU: 1 PID: 245 Comm: kworker/1:2 Tainted: G        W  OE     4.20.10-1-MANJARO #1
[   70.385934] Hardware name: LENOVO 20L8S02E00/20L8S02E00, BIOS N22ET35W (1.12 ) 04/09/2018
[   70.385938] Workqueue: pm pm_runtime_work
[   70.385956] RIP: 0010:gf100_vmm_flush_+0x14e/0x190 [nouveau]
[   70.385956] Code: 5f e9 c6 87 7b dd 48 8b 7d 10 48 8b 5f 50 48 85 db 74 4c e8 04 ed 51 dd 48 89 da 48 c7 c7 e6 ee ff c0 48 89 c6 e8 3c 08 fe dc <0f> 0b eb c2 48 8b 7d 10 48 8b 5f 50 48 85 db 74 1e e8 dc ec 51 dd
[   70.385957] RSP: 0018:ffff988dc1f177e0 EFLAGS: 00010282
[   70.385958] RAX: 0000000000000000 RBX: ffff89016c653db0 RCX: 0000000000000000
[   70.385958] RDX: 0000000000000007 RSI: ffffffff9eeaad36 RDI: 00000000ffffffff
[   70.385959] RBP: ffff890167424800 R08: 0000000000000001 R09: 000000000000069b
[   70.385960] R10: 0000000000000003 R11: 0000000000000000 R12: ffff89016b341060
[   70.385960] R13: 0000000fec46bb20 R14: ffff89015d140420 R15: ffff8901575c6a00
[   70.385961] FS:  0000000000000000(0000) GS:ffff89016f240000(0000) knlGS:0000000000000000
[   70.385962] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   70.385962] CR2: 00005613dc38b77c CR3: 00000002dd80a004 CR4: 00000000003606e0
[   70.385963] Call Trace:
[   70.385982]  nvkm_vmm_map+0x143/0x3b0 [nouveau]
[   70.385997]  ? gp100_vmm_pgt_mem+0x180/0x180 [nouveau]
[   70.386000]  ? __switch_to_asm+0x40/0x70
[   70.386001]  ? __switch_to_asm+0x34/0x70
[   70.386014]  nvkm_vram_map+0x56/0x80 [nouveau]
[   70.386030]  nvkm_uvmm_mthd+0x5f8/0x8d0 [nouveau]
[   70.386040]  nvkm_ioctl+0xd8/0x170 [nouveau]
[   70.386049]  nvif_object_mthd+0x108/0x130 [nouveau]
[   70.386052]  ? trace_event_raw_event_workqueue_queue_work+0x90/0xc0
[   70.386054]  ? dma_fence_wait_timeout+0x120/0x120
[   70.386063]  nvif_vmm_map+0x118/0x130 [nouveau]
[   70.386065]  ? reservation_object_wait_timeout_rcu+0x1f2/0x370
[   70.386083]  nouveau_mem_map+0x81/0xf0 [nouveau]
[   70.386101]  nouveau_vma_map+0x44/0x70 [nouveau]
[   70.386118]  nouveau_bo_move_ntfy+0x70/0xd0 [nouveau]
[   70.386122]  ttm_bo_handle_move_mem+0x3a6/0x4f0 [ttm]
[   70.386124]  ttm_bo_evict+0x151/0x1c0 [ttm]
[   70.386142]  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
[   70.386152]  ? drm_vma_offset_add+0x53/0x60 [drm]
[   70.386153]  ? rwsem_down_write_failed_killable+0xd2/0x2f0
[   70.386156]  ttm_mem_evict_first+0x190/0x210 [ttm]
[   70.386158]  ttm_bo_force_list_clean+0x9f/0x170 [ttm]
[   70.386160]  ? pci_has_legacy_pm_support+0x60/0x60
[   70.386177]  nouveau_do_suspend+0x7f/0x2c0 [nouveau]
[   70.386179]  ? __switch_to_asm+0x40/0x70
[   70.386196]  nouveau_pmops_runtime_suspend+0x42/0xa0 [nouveau]
[   70.386198]  pci_pm_runtime_suspend+0x5f/0x180
[   70.386199]  ? __switch_to_asm+0x40/0x70
[   70.386200]  __rpm_callback+0x75/0x1b0
[   70.386202]  ? __switch_to_asm+0x40/0x70
[   70.386203]  ? pci_has_legacy_pm_support+0x60/0x60
[   70.386204]  rpm_callback+0x1f/0x70
[   70.386205]  ? pci_has_legacy_pm_support+0x60/0x60
[   70.386207]  rpm_suspend+0x138/0x6a0
[   70.386209]  ? finish_task_switch+0x84/0x2d0
[   70.386210]  ? __switch_to_asm+0x40/0x70
[   70.386211]  pm_runtime_work+0x7d/0xa0
[   70.386213]  process_one_work+0x1eb/0x410
[   70.386214]  worker_thread+0x2d/0x3d0
[   70.386216]  ? process_one_work+0x410/0x410
[   70.386217]  kthread+0x112/0x130
[   70.386218]  ? kthread_park+0x80/0x80
[   70.386219]  ret_from_fork+0x35/0x40
[   70.386221] ---[ end trace 40e30b02dd5f2922 ]---
[   70.386467] [TTM] Buffer eviction failed

#2

Then use the 4.14 kernel, if that works, and wait for a new update for 4.20 kernel and test it again.


#3

The Problem is, it doesn’t only appear on 4.20. When I got my T480s, back in August of 2018 I tried to install different Manjaro distributions, but everytime the same problems appear. After hours, I tried different Kernels, and only 4.14 is working for me.
I’ll try them from 4.14 up again and will figure out on which Kernel the problem appears. Maybe I can find something interesting in the Kernel realease information.


#4

I’m not an expert here, but maybe it’s not Kernels fault…

Looking at your video (which is pretty funky nvidia chip on it’s own GeForce MX150), and also considering the fact that it’s having Optimus with Intel card, and looking at logs…

I think it’s most likely video drivers / probably some bios issues.

Have you tried to update bios?
Maybe if your bios has advanced switches, try to search for switching off Optimus technology (so this way OS would think that you have 2 separate video cards) and then remove all video drivers, and install latest video-nvidia.

At least that what i would consider to try exculde Kernel out of equation.

P.S. Also system hangs / slowdowns could possibly indicate some hardware problems, have you tested this machine with Windows or some other Linux OS to exclude it? Though then it’s weird that 4.14 is fine…