Hi, everyone!
Note to future readers: This turned out to be a flaky PCI card that decided it didn’t want to work. After sitting powered off overnight, it started working again.
I’m looking for some help troubleshooting a problem that surfaced after the 2021-01-19 update (which I applied a few days ago). My system has one NIC built into the motherboard, and a PCI card providing four more, which I use for work-related VMs. After this update, my PCI card is no longer recognized.
The igb
module drives both—or at least it used to.
Here’s a snip from the kernel logs in a previous boot, in which the card was properly detected.
Previous Boot
[carl@kotoko ~]$ journalctl -k -b -2 | grep igb
Jan 27 08:00:28 kotoko kernel: igb: Intel(R) Gigabit Ethernet Network Driver - version 5.6.0-k
Jan 27 08:00:28 kotoko kernel: igb: Copyright (c) 2007-2014 Intel Corporation.
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: enabling device (0000 -> 0002)
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: added PHC on eth0
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: Intel(R) Gigabit Ethernet Network Connection
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: eth0: (PCIe:5.0Gb/s:Width x1) 00:1b:21:d3:86:20
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: eth0: PBA No: Unknown
Jan 27 08:00:28 kotoko kernel: igb 0000:05:00.0: Using MSI-X interrupts. 8 rx queue(s), 8 tx queue(s)
[... 3 more entries like this, which are the add-in card ...]
Jan 27 08:00:29 kotoko kernel: igb 0000:06:00.0: added PHC on eth4
Jan 27 08:00:29 kotoko kernel: igb 0000:06:00.0: Intel(R) Gigabit Ethernet Network Connection
Jan 27 08:00:29 kotoko kernel: igb 0000:06:00.0: eth4: (PCIe:2.5Gb/s:Width x1) 04:d4:c4:4a:3c:57
Jan 27 08:00:29 kotoko kernel: igb 0000:06:00.0: eth4: PBA No: FFFFFF-0FF
Jan 27 08:00:29 kotoko kernel: igb 0000:06:00.0: Using MSI-X interrupts. 2 rx queue(s), 2 tx queue(s)
[The above 5 lines are for the on-board NIC.]
Currently, the log shows this instead:
Current Boot
[carl@kotoko ~]$ journalctl -k | grep igb
Jan 31 11:08:34 kotoko kernel: igb: Intel(R) Gigabit Ethernet Network Driver - version 5.6.0-k
Jan 31 11:08:34 kotoko kernel: igb: Copyright (c) 2007-2014 Intel Corporation.
Jan 31 11:08:34 kotoko kernel: igb 0000:06:00.0: added PHC on eth0
Jan 31 11:08:34 kotoko kernel: igb 0000:06:00.0: Intel(R) Gigabit Ethernet Network Connection
Jan 31 11:08:34 kotoko kernel: igb 0000:06:00.0: eth0: (PCIe:2.5Gb/s:Width x1) 04:d4:c4:4a:3c:57
Jan 31 11:08:34 kotoko kernel: igb 0000:06:00.0: eth0: PBA No: FFFFFF-0FF
Jan 31 11:08:34 kotoko kernel: igb 0000:06:00.0: Using MSI-X interrupts. 2 rx queue(s), 2 tx queue(s)
Jan 31 11:08:35 kotoko kernel: igb 0000:06:00.0 enp6s0: renamed from eth0
Jan 31 11:08:43 kotoko kernel: igb 0000:06:00.0 enp6s0: igb: enp6s0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
Jan 31 11:08:47 kotoko kernel: igb 0000:06:00.0 enp6s0: igb: enp6s0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
Considering the igb
version is the same, I can only assume something has changed so that igb
no longer recognizes the card. Searching kernel log for the PCI device identifier (if that’s the correct term) shows that it was detected with the previous kernel but is not now:
Kernel log searches
journalctl -k -b -2 | grep "0000:05:00"
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: [8086:150e] type 00 class 0x020000
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: reg 0x10: [mem 0xfc500000-0xfc57ffff]
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: reg 0x1c: [mem 0xfc58c000-0xfc58ffff]
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: reg 0x30: [mem 0xfc480000-0xfc4fffff pref]
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: PME# supported from D0 D3hot D3cold
Jan 27 08:00:28 kotoko kernel: pci 0000:05:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 GT/s x1 link at 0000:03:01.0 (capable of 16.000 Gb/s with 5 GT/s x4 link)
journalctl -k | grep "0000:05"
(Nothing returned.)
Using modprobe -r igb
and then reloading it does not produce any different results. At this point I have run out of ideas.
Here are my system details, in case that’s useful. Although I’m booted into 5.10 at this moment (as a troubleshooting step) I normally run 5.4 as shown below:
System Info
System: Host: kotoko Kernel: 5.4.89-1-MANJARO x86_64 bits: 64 Desktop: KDE Plasma 5.20.5 Distro: Manjaro Linux
Machine: Type: Desktop Mobo: ASUSTeK model: ROG CROSSHAIR VII HERO v: Rev 1.xx serial: <superuser required>
UEFI: American Megatrends v: 2501 date: 07/12/2019
CPU: Info: 8-Core model: AMD Ryzen 7 2700X bits: 64 type: MT MCP L2 cache: 4 MiB
Speed: 4098 MHz min/max: 2200/3700 MHz Core speeds (MHz): 1: 4098 2: 2063 3: 2058 4: 2060 5: 3474 6: 2088 7: 2090
8: 2079 9: 2065 10: 2062 11: 3913 12: 2060 13: 2068 14: 2082 15: 4121 16: 2054
Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X/590] driver: amdgpu
v: kernel
Display: x11 server: X.Org 1.20.10 driver: loaded: amdgpu,ati unloaded: modesetting resolution: 2560x1440
OpenGL: renderer: Radeon RX 580 Series v: 4.6.13572 Core Profile Context
Audio: Device-1: AMD Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590] driver: snd_hda_intel
Device-2: Advanced Micro Devices [AMD] Family 17h HD Audio driver: snd_hda_intel
Device-3: Sunplus Innovation Full HD webcam type: USB driver: snd-usb-audio,uvcvideo
Sound Server: ALSA v: k5.4.89-1-MANJARO
Network: Device-1: Intel Wireless-AC 9260 driver: iwlwifi
IF: wlp4s0 state: down mac: 7e:84:cf:70:02:b4
Device-2: Intel I211 Gigabit Network driver: igb
IF: enp6s0 state: up speed: 1000 Mbps duplex: full mac: 04:d4:c4:4a:3c:57
RAID: Device-1: datastore type: zfs status: ONLINE size: 2.72 TiB free: 1.1 TiB
Components: Online: N/A
Drives: Local Storage: total: raw: 3.18 TiB usable: 5.9 TiB used: 1011.42 GiB (16.7%)
ID-1: /dev/nvme0n1 vendor: Western Digital model: WDS500G3X0C-00SJG0 size: 465.76 GiB
ID-2: /dev/sda vendor: Crucial model: CT1000MX500SSD1 size: 931.51 GiB
ID-3: /dev/sdb vendor: Crucial model: CT1000MX500SSD1 size: 931.51 GiB
ID-4: /dev/sdc vendor: Crucial model: CT1000MX500SSD1 size: 931.51 GiB
Partition: ID-1: / size: 389.18 GiB used: 237.26 GiB (61.0%) fs: ext4 dev: /dev/nvme0n1p2
ID-2: /boot/efi size: 299.4 MiB used: 296 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p1
Swap: ID-1: swap-1 type: partition size: 69.06 GiB used: 0 KiB (0.0%) dev: /dev/nvme0n1p3
Sensors: System Temperatures: cpu: 46.6 C mobo: N/A gpu: amdgpu temp: 61.0 C
Fan Speeds (RPM): N/A gpu: amdgpu fan: 785
Info: Processes: 477 Uptime: 10h 32m Memory: 62.78 GiB used: 13.16 GiB (21.0%) Shell: Bash inxi: 3.2.02
Any help will be greatly appreciated! On Monday, I will need to use this card again, and I’d prefer not to have to Timeshift the entire update away just for that.