Persistent Hard Drive Error Message

Hi all. I am getting a lot of these Logs. They have been present for a fair while, and are not going away like I hoped. Any ideas on how to address this. I am pretty sure it is /dev/sdc that this Manjaro system resides on - just to complicate things.

ata6.00: exception Emask 0x0 SAct 0x200802 SErr 0x50000 action 0x6 frozen
[107280.190712] ata6: SError: { PHYRdyChg CommWake }
[107280.190717] ata6.00: failed command: WRITE FPDMA QUEUED
[107280.190718] ata6.00: cmd 61/b8:08:18:9e:fe/01:00:00:00:00/40 tag 1 ncq dma 225280 out
                         res 40/00:01:00:00:00/00:00:00:00:00/e0 Emask 0x4 (timeout)
[107280.190724] ata6.00: status: { DRDY }
[107280.190726] ata6.00: failed command: READ FPDMA QUEUED
[107280.190728] ata6.00: cmd 60/08:58:38:6d:fc/00:00:0c:00:00/40 tag 11 ncq dma 4096 in
                         res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[107280.190733] ata6.00: status: { DRDY }
[107280.190735] ata6.00: failed command: WRITE FPDMA QUEUED
[107280.190736] ata6.00: cmd 61/b8:a8:10:db:d7/00:00:06:00:00/40 tag 21 ncq dma 94208 out
                         res 40/00:01:00:00:00/00:00:00:00:00/e0 Emask 0x4 (timeout)
[107280.190741] ata6.00: status: { DRDY }
[107280.190744] ata6: hard resetting link
[107280.654069] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[107280.675175] ata6.00: configured for UDMA/133
[107280.675624] ata6: EH complete
Filesystem           Type      Size  Used Avail Use% Mounted on
dev                  devtmpfs   16G     0   16G   0% /dev
run                  tmpfs      16G  2.1M   16G   1% /run
efivarfs             efivarfs  128K   47K   77K  38% /sys/firmware/efi/efivars
/dev/sdc2            ext4      110G   94G   11G  90% /
tmpfs                tmpfs      16G  6.7M   16G   1% /dev/shm
tmpfs                tmpfs     1.0M     0  1.0M   0% /run/credentials/systemd-journald.service
tmpfs                tmpfs      16G  2.1G   14G  14% /tmp
/dev/sdc1            vfat      300M  288K  300M   1% /boot/efi
tmpfs                tmpfs     100K     0  100K   0% /var/lib/lxd/shmounts
tmpfs                tmpfs     100K     0  100K   0% /var/lib/lxd/devlxd

It is pretty low on space.
10% and under can be considered in the danger zone ā€¦ and this is not even an extra large partition.
You should strive to free up some space on this partition/system.

To that end I may offer this;

But if there is any important information on the disk, with these errors, I would also check the SMART statusā€¦

smartctl -a /dev/sdc
2 Likes

:bangbang: Tip :bangbang:

When posting terminal output, copy the output and paste it here, wrapped in three (3) backticks, before AND after the pasted text. Like this:

```
pasted text
```

Or three (3) tilde signs, like this:

~~~
pasted text
~~~

This will just cause it to be rendered like this:

Sed
sollicitudin dolor
eget nisl elit id
condimentum
arcu erat varius
cursus sem quis eros.

Instead of like this:

Sed sollicitudin dolor eget nisl elit id condimentum arcu erat varius cursus sem quis eros.

Alternatively, paste the text you wish to format as terminal output, select all pasted text, and click the </> button on the taskbar. This will indent the whole pasted section with one TAB, causing it to render the same way as described above.

Thereby increasing legibility thus making it easier for those trying to provide assistance.

For more information, please see:

As Iā€™m a moderator on the forum, I have taken the liberty of doing this for you, this time.


:bangbang::bangbang: Additionally

If your language isnā€™t English, please prepend any and all terminal commands with LC_ALL=C. For example:

LC_ALL=C bluetoothctl

This will just cause the terminal output to be in English, making it easier to understand and debug.

Note that the above text is partially pre-prepared as a general introduction for new forum Users. Please take the time to to understand how it is done and encourage quality responses.[quote=ā€œbrm, post:1, topic:174430ā€]
failed command: WRITE FPDMA QUEUED
[107280.190718] ata6.00: cmd 61/b8:08:18:9e:fe/01:00:00:00:00/40 tag 1 ncq dma 225280 out
res 40/00:01:00:00:00/00:00:00:00:00/e0 Emask 0x4 (timeout)
[107280.190724] ata6.00: status: { DRDY }
[/quote]

1 Like
  1. SATA cable defective.
  2. Drive going dead.

I want the SMART status to be sure but it is likely.

In which case OP should stop using the disk immediately and begin backup procedures.

IE - No more booting or interacting with the disk at all except to clone.

2 Likes

And even that from a Live Environment.

1 Like

Hi, Smartctl results enclosed.

sudo smartctl -a /dev/sdc
[sudo] password for brm: 
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.12.12-2-MANJARO] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Intel 330/335 Series SSDs
Device Model:     INTEL SSDSC2CT120A3
Serial Number:    CVMP216607LR120BGN
LU WWN Device Id: 5 001517 bb28e8816
Firmware Version: 300i
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
TRIM Command:     Available, deterministic
Device is:        In smartctl database 7.3/5528
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.0, 3.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Wed Feb 19 13:19:37 2025 ACDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x05) Offline data collection activity
                                        was aborted by an interrupting command from host.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (  35) The self-test routine was interrupted
                                        by the host with a hard or soft reset.
Total time to complete Offline 
data collection:                ( 1046) seconds.
Offline data collection
capabilities:                    (0x7f) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Abort Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (  48) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x0021) SCT Status supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours_and_Msec 0x0032   000   000   000    Old_age   Always       -       931780h+58m+46.170s
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       994
181 Program_Fail_Cnt_Total  0x0032   000   000   000    Old_age   Always       -       0
182 Erase_Fail_Count_Total  0x0032   000   000   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       988
225 Host_Writes_32MiB       0x0032   100   100   000    Old_age   Always       -       295016
232 Available_Reservd_Space 0x0033   100   100   010    Pre-fail  Always       -       0
233 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       0
241 Host_Writes_32MiB       0x0032   100   100   000    Old_age   Always       -       295016
242 Host_Reads_32MiB        0x0032   100   100   000    Old_age   Always       -       656244
249 NAND_Writes_1GiB        0x0013   100   100   000    Pre-fail  Always       -       11289

SMART Error Log not supported

SMART Self-test Log not supported

SMART Selective self-test log data structure revision number 0
Note: revision number not 1 implies that no selective self-test has ever been run
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

The above only provides legacy SMART information - try 'smartctl -x' for more

Hi. How do we go about cloning this disk? Can you point to any Manjaro docs?

Yeah.
I would back up everything (/anything important).
Then you can use it for a ā€˜useful but not depended onā€™ external or NAS?
Otherwise its for the recyclers.

Thanks, sounds like a plan.

that would be 106 years - unlikely

I donā€™t know about Manjaro docs, but this is just as good:

https://wiki.archlinux.org/title/Dd#Create_disk_image