NVME Resume from Sleep/Suspend - Missing or invalid SUBNQN Field

I am using a Laptop (Infinity / Tongfang W5; AMD 5900HX & NVIDIA RTX 3080 Max-Q) with dual SSD’s. Initially, I could not get my system to reliably wake from sleep, with the system either appearing in write-only mode or completely locking up.

After referring to Arch Wiki - NVMe Troubleshooting I was able to get the system to resume from suspend reliably, but still can not mount the second drive. I have hit a bit of a wall so any suggestions would be welcome. Update: I think the offending issue at its core is:

nvme nvme0: missing or invalid SUBNQN field.

The secondary drive behaviour is that it will appear in Nautilus, etc. for about 30 seconds after resume before disappearing from the system entirely.

  • System is on nvme1n1; Secondary drive is nvme0n1
  • Both drives contain encrypted partitions that are set up to mount via crypttab and fstab.
  • nvme0n1p1 has a LUKS and an unencrypted partition. After resume from suspend, neither mount.
  • Adding iommu=soft to kernel flags via GRUB does not solve the issue.
  • LTS and Latest Kernel installations behave in the same way.
  • Adding nvme_core.default_ps_max_latency_us=100 resolved R/W issues on nvme1n1 (System drive - Crucial SSD)
  • Determined value of nvme_core.default_ps_max_latency_us via sudo nvme get-feature /dev/ng1n1 -f 0x0c -H; also for /dev/ng0n1. Paths found using sudo nvme list.
  • Drives are a Crucial P5 Plus 2TB PCIe M.2 2280SS Gaming SSD (CT2000P5SSD8) and Seagate FireCuda 510 SSD.
  • After resuming from suspend /dev/nvme0n1 will not appear when invoking sudo nvme list.

My GRUB Options are:

GRUB_CMDLINE_LINUX_DEFAULT="quiet cryptdevice=UUID=a***8748:luks-a***8748 root=/dev/mapper/luks-a***8748 splash apparmor=1 security=apparmor resume=/dev/mapper/luks-4a***ccba1 udev.log_priority=3 nvme_core.default_ps_max_latency_us=100"

Results of lsblk -fa:

NAME   FSTYPE FSVER LABEL    UUID                                 FSAVAIL FSUSE% MOUNTPOINTS
loop0                                                                            
sda                                                                              
nvme0n1
                                                                                 
├─nvme0n1p1
│      crypto 2              d***fa06d                
│ └─luks-d***fa06d
│      ext4   1.0   Data - Crypt
│                            f***3be07  233.7G    69% /mnt/Data-Crypt
└─nvme0n1p2
       ext4   1.0   Data - Clear
                             6***c747  836.7G     4% /mnt/Data-Clear
nvme1n1
                                                                                 
├─nvme1n1p1
│      vfat   FAT32 NO_LABEL 12B0-2F49                             298.9M     0% /boot/efi
├─nvme1n1p2
│      crypto 1              a***8748                
│ └─luks-a***8748
│      ext4   1.0            d***a60be  698.1G    56% /
└─nvme1n1p3
       crypto 1              4***ccba1                
  └─luks-4***ccba1
       swap   1     swap     fa***11dd                [SWAP]

Results of journalctl -b0 -p4 --no-pager:

Oct 22 11:20:52 Laptop123 kernel: nvme nvme0: Device not ready; aborting reset, CSTS=0x3
Oct 22 11:20:52 Laptop123 kernel: nvme nvme0: Removing after probe failure status: -19
Oct 22 11:20:52 Laptop123 systemd-cryptsetup[41320]: device-mapper: remove ioctl on luks-d***fa06d  failed: Device or resource busy
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, lost sync page write
Oct 22 11:20:52 Laptop123 kernel: EXT4-fs (dm-2): I/O error while writing superblock
Oct 22 11:20:52 Laptop123 gnome-shell[4097]: Object .Gjs_drive-menu_gnome-shell-extensions_gcampax_github_com_extension_MountMenuItem (0x55ccac23f810), has been already disposed — impossible to set any property on it. This might be caused by the object having been destroyed from C code using something such as destroy(), dispose(), or remove() vfuncs.
Oct 22 11:20:52 Laptop123 gnome-shell[4097]: Object .Gjs_drive-menu_gnome-shell-extensions_gcampax_github_com_extension_MountMenuItem (0x55cca58357f0), has been already disposed — impossible to set any property on it. This might be caused by the object having been destroyed from C code using something such as destroy(), dispose(), or remove() vfuncs.
Oct 22 11:20:52 Laptop123 gnome-shell[4097]: Object .Gjs_drive-menu_gnome-shell-extensions_gcampax_github_com_extension_MountMenuItem (0x55cca58357f0), has been already disposed — impossible to set any property on it. This might be caused by the object having been destroyed from C code using something such as destroy(), dispose(), or remove() vfuncs.
Oct 22 11:20:52 Laptop123 gnome-shell[4097]: Object .Gjs_drive-menu_gnome-shell-extensions_gcampax_github_com_extension_MountMenuItem (0x55cca58357f0), has been already disposed — impossible to set any property on it. This might be caused by the object having been destroyed from C code using something such as destroy(), dispose(), or remove() vfuncs.
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read
Oct 22 11:20:52 Laptop123 kernel: Buffer I/O error on dev dm-2, logical block 0, async page read

I will post my inxi below, but would appreciate any pointers on where to from here! Please let me know if I can provide any further diagnostic info.

Results of inxi -v7azy:

System:
  Kernel: 6.0.2-2-MANJARO arch: x86_64 bits: 64 compiler: gcc v: 12.2.0
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.0-x86_64
    root=UUID=d***60be rw quiet
    cryptdevice=UUID=a***8748
    root=/dev/mapper/luks-a***8748 splash
    apparmor=1 security=apparmor
    resume=/dev/mapper/luks-4***ccba1
    udev.log_priority=3 nvme_core.default_ps_max_latency_us=100
  Desktop: GNOME v: 42.5 tk: GTK v: 3.24.34 wm: gnome-shell dm: GDM v: 42.0
    Distro: Manjaro Linux base: Arch Linux
Machine:
  Type: Laptop System: _COM1_ product: NBINF-W5-5R9R8N v: Standard
    serial: <superuser required>
  Mobo: Standard model: GM5ZG8W v: Standard serial: <superuser required>
    UEFI: American Megatrends LLC. v: N.1.14COM03 date: 03/20/2021
Battery:
  ID-1: BAT0 charge: 93.5 Wh (100.0%) condition: 93.5/93.5 Wh (100.0%)
    volts: 12.4 min: 11.4 model: standard type: Li-ion serial: <filter>
    status: full
  ID-2: hidpp_battery_0 charge: 93% condition: N/A volts: 4.1 min: N/A
    model: Logitech G502 LIGHTSPEED Wireless Gaming Mouse type: N/A
    serial: <filter> status: discharging
Memory:
  RAM: total: 30.76 GiB used: 1.77 GiB (5.7%)
  RAM Report: permissions: Unable to run dmidecode. Root privileges
    required.
CPU:
  Info: model: AMD Ryzen 9 5900HX with Radeon Graphics bits: 64 type: MT MCP
    arch: Zen 3 gen: 4 level: v3 note: check built: 2021-22 process: TSMC n7
    (7nm) family: 0x19 (25) model-id: 0x50 (80) stepping: 0
    microcode: 0xA50000C
  Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
    L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB L2: 4 MiB desc: 8x512 KiB
    L3: 16 MiB desc: 1x16 MiB
  Speed (MHz): avg: 3300 min/max: 1200/4679 boost: enabled scaling:
    driver: acpi-cpufreq governor: schedutil cores: 1: 3300 2: 3300 3: 3300
    4: 3300 5: 3300 6: 3300 7: 3300 8: 3300 9: 3300 10: 3300 11: 3300 12: 3300
    13: 3300 14: 3300 15: 3300 16: 3300 bogomips: 105442
  Flags: 3dnowprefetch abm adx aes aperfmperf apic arat avic avx avx2 bmi1
    bmi2 bpext cat_l3 cdp_l3 clflush clflushopt clwb clzero cmov cmp_legacy
    constant_tsc cpb cppc cpuid cqm cqm_llc cqm_mbm_local cqm_mbm_total
    cqm_occup_llc cr8_legacy cx16 cx8 de decodeassists erms extapic
    extd_apicid f16c flushbyasid fma fpu fsgsbase fsrm fxsr fxsr_opt ht
    hw_pstate ibpb ibrs ibs invpcid irperf lahf_lm lbrv lm mba mca mce
    misalignsse mmx mmxext monitor movbe msr mtrr mwaitx nonstop_tsc nopl npt
    nrip_save nx ospke osvw overflow_recov pae pat pausefilter pclmulqdq
    pdpe1gb perfctr_core perfctr_llc perfctr_nb pfthreshold pge pku pni popcnt
    pse pse36 rapl rdpid rdpru rdrand rdseed rdt_a rdtscp rep_good sep sha_ni
    skinit smap smca smep ssbd sse sse2 sse4_1 sse4_2 sse4a ssse3 stibp succor
    svm svm_lock syscall tce topoext tsc tsc_scale umip v_spec_ctrl
    v_vmsave_vmload vaes vgif vmcb_clean vme vmmcall vpclmulqdq wbnoinvd wdt
    xgetbv1 xsave xsavec xsaveerptr xsaveopt xsaves
  Vulnerabilities:
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: retbleed status: Not affected
  Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
    prctl
  Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
    sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, IBRS_FW,
    STIBP: always-on, RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: NVIDIA GA104M [GeForce RTX 3080 Mobile / Max-Q 8GB/16GB]
    vendor: Tongfang Hongkong driver: nvidia v: 520.56.06
    alternate: nouveau,nvidia_drm non-free: 515.xx+ status: current (as of
    2022-10) arch: Ampere code: GAxxx process: TSMC n7 (7nm) built: 2020-22
    pcie: gen: 1 speed: 2.5 GT/s lanes: 8 link-max: gen: 4 speed: 16 GT/s
    lanes: 16 ports: active: none empty: DP-1,HDMI-A-1,eDP-1 bus-ID: 01:00.0
    chip-ID: 10de:249c class-ID: 0300
  Device-2: AMD Cezanne vendor: Tongfang Hongkong driver: amdgpu v: kernel
    arch: GCN-5.1 code: Vega-2 process: TSMC n7 (7nm) built: 2018-21 pcie:
    gen: 3 speed: 8 GT/s lanes: 16 link-max: gen: 4 speed: 16 GT/s ports:
    active: eDP-2 empty: none bus-ID: 06:00.0 chip-ID: 1002:1638
    class-ID: 0300 temp: 49.0 C
  Device-3: Chicony HD Webcam type: USB driver: uvcvideo bus-ID: 1-4:4
    chip-ID: 04f2:b71a class-ID: 0e02 serial: <filter>
  Display: x11 server: X.org v: 1.21.1.4 with: Xwayland v: 22.1.3
    compositor: gnome-shell driver: X: loaded: amdgpu,nvidia
    unloaded: modesetting,nouveau,radeon alternate: fbdev,nv,vesa
    dri: radeonsi gpu: amdgpu display-ID: :1 screens: 1
  Screen-1: 0 s-res: 2560x1440 s-size: <missing: xdpyinfo>
  Monitor-1: eDP-2 mapped: eDP-1 model: BOE Display 0x0974 built: 2020
    res: 2560x1440 hz: 165 dpi: 189 gamma: 1.2 size: 344x194mm (13.54x7.64")
    diag: 395mm (15.5") ratio: 16:9 modes: max: 2560x1440 min: 640x480
  OpenGL: renderer: AMD RENOIR (LLVM 14.0.6 DRM 3.48 6.0.2-2-MANJARO) v: 4.6
    Mesa 22.1.7 direct render: Yes
Logical:
  Message: No logical block device data found.
  Device-1: luks-d***a06d maj-min: 254:2
    type: LUKS dm: dm-2 size: 931.31 GiB
  Components:
  p-1: nvme0n1p1 maj-min: 259:1 size: 931.32 GiB
  Device-2: luks-a***8748 maj-min: 254:0
    type: LUKS dm: dm-0 size: 1.79 TiB
  Components:
  p-1: nvme1n1p2 maj-min: 259:5 size: 1.79 TiB
  Device-3: luks-4***ccba1 maj-min: 254:1
    type: LUKS dm: dm-1 size: 33.83 GiB
  Components:
  p-1: nvme1n1p3 maj-min: 259:6 size: 33.83 GiB
RAID:
  Message: No RAID data found.
Drives:
  Local Storage: total: 3.64 TiB used: 1.64 TiB (45.0%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Seagate model: FireCuda 510 SSD
    ZP2000GM30001 size: 1.82 TiB block-size: physical: 512 B logical: 512 B
    speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter> rev: STOSC014
    temp: 41.9 C scheme: GPT
  ID-2: /dev/nvme1n1 maj-min: 259:3 vendor: Crucial model: CT2000P5SSD8
    size: 1.82 TiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 type: SSD serial: <filter> rev: P4CR324 temp: 50.9 C scheme: GPT
  Message: No optical or floppy data found.
Partition:
  ID-1: / raw-size: 1.79 TiB size: 1.76 TiB (98.37%) used: 1009.55 GiB (56.1%)
    fs: ext4 dev: /dev/dm-0 maj-min: 254:0
    mapped: luks-a***8748 label: N/A
    uuid: d***60be
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%) used: 472 KiB
    (0.2%) fs: vfat dev: /dev/nvme1n1p1 maj-min: 259:4 label: NO_LABEL
    uuid: 12B0-2F49
  ID-3: /mnt/Data-Clear raw-size: 931.69 GiB size: 915.99 GiB (98.32%)
    used: 32.71 GiB (3.6%) fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2 label: Data
    - Clear uuid: 6***c747
  ID-4: /mnt/Data-Crypt raw-size: 931.31 GiB size: 915.62 GiB (98.32%)
    used: 635.34 GiB (69.4%) fs: ext4 dev: /dev/dm-2 maj-min: 254:2
    mapped: luks-d***a06d label: Data - Crypt
    uuid: f***be07
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default)
  ID-1: swap-1 type: partition size: 33.83 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/dm-1 maj-min: 254:1
    mapped: luks-4***ccba1 label: swap
    uuid: f***11dd
Unmounted:
  Message: No unmounted partitions found.

Sensors:
  System Temperatures: cpu: 61.2 C mobo: N/A gpu: amdgpu temp: 52.0 C
  Fan Speeds (RPM): N/A
Info:
  Processes: 403 Uptime: 0m wakeups: 74 Init: systemd v: 251
  default: graphical tool: systemctl Compilers: gcc: 12.2.0 clang: 14.0.6
  Packages: 1566 pm: dpkg pkgs: 0 pm: pacman pkgs: 1532 libs: 453
  tools: gnome-software,pamac pm: flatpak pkgs: 34 Shell: Zsh v: 5.9

and of sudo dmesg | grep "nvme" / dmesg from last resume.


[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-6.0-x86_64 root=UUID=d84f8e09-b03e-42a6-a845-c98770ea60be rw quiet cryptdevice=UUID=a***8748:luks-a***8748 root=/dev/mapper/luks-a***8748 splash apparmor=1 security=apparmor resume=/dev/mapper/luks-4***cba1 udev.log_priority=3 nvme_core.default_ps_max_latency_us=100
[    0.040998] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-6.0-x86_64 root=UUID=d84f8e09-b03e-42a6-a845-c98770ea60be rw quiet cryptdevice=UUID=a***8748:luks-a***8748 root=/dev/mapper/luks-a***8748 splash apparmor=1 security=apparmor resume=/dev/mapper/luks-4***cba1 udev.log_priority=3 nvme_core.default_ps_max_latency_us=100
[    0.657789] nvme 0000:03:00.0: platform quirk: setting simple suspend
[    0.657830] nvme nvme0: pci function 0000:03:00.0
[    0.664721] nvme nvme0: missing or invalid SUBNQN field.
[    0.664746] nvme nvme0: Shutdown timeout set to 10 seconds
[    0.666248] nvme nvme0: 8/0/0 default/read/poll queues
[    0.666588] nvme 0000:05:00.0: platform quirk: setting simple suspend
[    0.666644] nvme nvme1: pci function 0000:05:00.0
[    0.668050]  nvme0n1: p1 p2
[    0.708226] nvme nvme1: 16/0/0 default/read/poll queues
[    0.712685]  nvme1n1: p1 p2 p3
[   10.191858] EXT4-fs (nvme0n1p2): recovery complete
[   10.191869] EXT4-fs (nvme0n1p2): mounted filesystem with ordered data mode. Quota mode: none.
[ 1419.666856] nvme nvme1: 16/0/0 default/read/poll queues
[ 1450.156008] nvme nvme0: Device not ready; aborting reset, CSTS=0x3
[ 1450.156018] nvme nvme0: Removing after probe failure status: -19
[ 1450.181631] nvme0n1: detected capacity change from 3907029168 to 0
[ 1450.278499] EXT4-fs (nvme0n1p2): unmounting filesystem.

Resume from Suspend dmesg

[ 1421.112467] PM: suspend exit
(...)

[ 1450.156008] nvme nvme0: Device not ready; aborting reset, CSTS=0x3
[ 1450.156018] nvme nvme0: Removing after probe failure status: -19
[ 1450.181631] nvme0n1: detected capacity change from 3907029168 to 0
[ 1450.276509] EXT4-fs (dm-2): unmounting filesystem.
[ 1450.278119] Buffer I/O error on dev dm-2, logical block 0, lost sync page write
[ 1450.278125] EXT4-fs (dm-2): I/O error while writing superblock
[ 1450.278499] EXT4-fs (nvme0n1p2): unmounting filesystem.
[ 1450.281275] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.281280] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.281299] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.281301] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.287933] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.287938] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.287958] Buffer I/O error on dev dm-2, logical block 0, async page read
[ 1450.287960] Buffer I/O error on dev dm-2, logical block 0, async page read

Update 2:

I have found related posts on ArchBBS - The best solution I can see is tldr replace the SSD with a different brand:
nvme failed to set APST feature

On GentooForums, I found this, but it is over my head and I’m not sure how I would apply it in Manjaro:
SOLVED ! nvme0: missing or invalid SUBNQN field

AFAIK, you can ignore this warning, I have my two different SSDs that show the same warning as well.