Does this SMART selftest indicate a failing drive?

Solution 1:

Hopefully you've long since replaced the drive, but since no one has yet directly answered the question...

You ran two tests, both of which failed to read the same logical sector of the disk, as indicated by Completed: read failure and the same LBA in both tests. This does indeed indicate the disk has a defect, and you should be able to have it replaced under warranty. Attempting to store data in this sector may or may not cause the drive to notice it's defective during the write process and remap the sector, but if the drive doesn't notice, and can't read the data later on, you've lost it.

Solution 2:

Is your data worth risking on a suspect drive?

If it were me, I'd replace the drive and be thankful that SMART saved me a big headache.


Solution 3:

I want to add to the comments in the other answer, but I can't due to lack of rep, go figure.

You don't need to make a cron script, there is a smartd daemon in the smartmontools package that handles just what you want to do: regular checking of SMART status. All you need is to create a configuration and start the service. The smartmontools package also contains some sample scripts that smartd can call when something starts failing.


Solution 4:

What will I do in your situation?

First of all I find out which files are affected. There are some instructions how to do this https://www.smartmontools.org/wiki/BadBlockHowto. Yeah. In your case it is harder because you have an array. But it is possible. Than, ensure that this file is backuped, than write zeros to the failing sector. Two things can happen.

  1. The drive successfully writes zeroes to this sector. Current_Pending_Sector, Reallocated_Sector_Ct should be zeros afterwards.
  2. The drive fails to write to this sector. Than it remaps this sector to a "spare" area.

In any case you end up with a fixed drive. You should restore your file from backup (because you overwrote one sector of it). Also you should rerun en extended self-test to ensure that there are no more errors.

Stay healthy!

P.S. I know that this post is kind of old. But I goolged it. And I think it is a good idea to provide another good answer.