after the last stable update I started getting those messages that I have never seen before:
aug 30 07:58:58 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 07:58:58 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
aug 30 07:58:58 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00000080/00006000
aug 30 07:58:58 Zen kernel: pcieport 0000:00:01.3: AER: [ 7] BadDLLP
aug 30 09:00:38 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 09:00:38 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
aug 30 09:00:38 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00001000/00006000
aug 30 09:00:38 Zen kernel: pcieport 0000:00:01.3: AER: [12] Timeout
aug 30 09:27:28 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 09:27:28 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
aug 30 09:27:28 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00001000/00006000
aug 30 09:27:28 Zen kernel: pcieport 0000:00:01.3: AER: [12] Timeout
aug 30 10:44:33 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 10:44:33 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
aug 30 10:44:33 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00000080/00006000
aug 30 10:44:33 Zen kernel: pcieport 0000:00:01.3: AER: [ 7] BadDLLP
aug 30 10:59:10 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 10:59:10 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
aug 30 10:59:10 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00000080/00006000
aug 30 10:59:10 Zen kernel: pcieport 0000:00:01.3: AER: [ 7] BadDLLP
aug 30 11:56:16 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 11:56:16 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
aug 30 11:56:16 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00000080/00006000
aug 30 11:56:16 Zen kernel: pcieport 0000:00:01.3: AER: [ 7] BadDLLP
aug 30 13:19:16 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 13:19:16 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
aug 30 13:19:16 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00000080/00006000
aug 30 13:19:16 Zen kernel: pcieport 0000:00:01.3: AER: [ 7] BadDLLP
aug 30 13:46:40 Zen kernel: pcieport 0000:00:01.3: AER: Corrected error received: 0000:00:00.0
aug 30 13:46:40 Zen kernel: pcieport 0000:00:01.3: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
aug 30 13:46:40 Zen kernel: pcieport 0000:00:01.3: AER: device [1022:1453] error status/mask=00001000/00006000
aug 30 13:46:40 Zen kernel: pcieport 0000:00:01.3: AER: [12] Timeout
Although they are “corrected” I would at least like to know, what is the hardware piece being corrected.
the pcieport info I got from `lspci -vvv’:
But what is this device [1022:1453] and how to find out?
And is this yet another actual hardware problem is there something wrong with drivers or such (given, then “problem” started with update)?
I guess, you must have at least 2 PCIE devices plugged in. If one of them are working on a lower rate and the others are working normally on a higher rate than the other devices are fitting to the lower one. This is displayed somehow as error here which gets corrected. Due the UEFI implementation the kernel does this this more or less nicely.
I think thats the reason. Maybe i am wrong?
The kernel option pci=nommconf disables Memory-Mapped PCI Configuration Space, which is available in Linux since kernel 2.6. Very roughly, all PCI devices have an area that describe this device (which you see with lspci -vv ), and the originally method to access this area involves going through I/O ports, while PCIe allows this space to be mapped to memory for simpler access.
That means in this particular case, something goes wrong when the PCIe controller uses this method to access the configuraton space of a particular device. It may be a hardware bug in the device, in the PCIe root controller on the motherboard, in the specific interaction of those two, or something else.
By using pci=nommconf , the configuration space of all devices will be accessed in the original way, and changing the access methods works around this problem. So if you want, it’s both resolving and suppressing it.
Sadly it isn’t. Upgraded bios/uefi to last that the vendor doesn’t warn against using with first gen Ryzen.
added. will see if this fixes something.
But still puzzling, how this started with stable update… can kernel or something mess this up? Or the problem might have been there all along and now just somehow became “more visible”?
*ASRock do NOT recommend updating this BIOS if Pinnacle, Raven, Summit or Bristol Ridge CPU is being used on your system.
Have Ryzen 1700 => “Summit Ridge”. Basically the BIOS updates beyond and including 6.00 are for Ryzen 3000 series only.
Found another thing to try out in level1techs forum. Will swap soon the pci=nommconf for pci_aspm=off and see if this also works.
Although it has been errorless for about 7 hours with pci=nommconf, many say it may impact performance in some way or another. And I can always come back to this when the pcie_aspm=off doesn’t work.
edit: and wtf is “you can’t add links to your posts” ??? why I am blocked from adding sources?