[TriLUG] smartmontools - selftest fails, Health status passed. ???

lfwelty at nc.rr.com lfwelty at nc.rr.com
Thu Jan 8 14:00:21 EST 2004


Hi y'all,

I have a hdd that is showing some seemingly (to me at least)
conflicting information. smartctl's health status shows the
hdd as PASSED, but it's failing the short and long selftests
at the same place.

- relevent smartctl output below.

If the health status were FAILED and I was seeing the errors
I would definately replace the hdd. But since they're conflicting,
I'm not sure if I need to replace it.

Thanks for the help,

- Frank.

tiresias|ROOT:lfwelty-2# smartctl -a /dev/hda
smartctl version 5.1-18 Copyright (C) 2002-3 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     MAXTOR 6L080J4
Serial Number:    664204750210
Firmware Version: A93.0500
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   5
ATA Standard is:  ATA/ATAPI-5 T13 1321D revision 1
Local Time is:    Thu Jan  8 13:54:32 2004 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Off-line data collection status: (0x00) Offline data collection activity was
                                         never started.
                                         Auto Off-line Data Collection: Disabled.
Self-test execution status:      ( 112) The previous self-test completed having
                                         the read element of the test failed.
Total time to complete off-line
data collection:                 (  35) seconds.
Offline data collection
capabilities:                    (0x1b) SMART execute Offline immediate.
                                         Automatic timer ON/OFF support.
                                         Suspend Offline collection upon new
                                         command.
                                         Offline surface scan supported.
                                         Self-test supported.
                                         No Conveyance Self-test supported.
                                         No Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                         power-saving mode.
                                         Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                         No General Purpose Logging support.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  40) minutes.

SMART Attributes Data Structure revision number: 11
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED 
RAW_VALUE
   1 Raw_Read_Error_Rate     0x0029   100   253   020    Pre-fail  Offline      -       0
   3 Spin_Up_Time            0x0027   068   065   020    Pre-fail  Always       -       4092
   4 Start_Stop_Count        0x0032   100   100   008    Old_age   Always       -       162
   5 Reallocated_Sector_Ct   0x0033   099   099   020    Pre-fail  Always       -       5
   7 Seek_Error_Rate         0x000b   100   100   023    Pre-fail  Always       -       0
   9 Power_On_Hours          0x0012   079   079   001    Old_age   Always       -       14083
  10 Spin_Retry_Count        0x0026   100   100   000    Old_age   Always       -       0
  11 Calibration_Retry_Count 0x0013   100   100   020    Pre-fail  Always       -       0
  12 Power_Cycle_Count       0x0032   100   100   008    Old_age   Always       -       62
  13 Read_Soft_Error_Rate    0x000b   100   093   023    Pre-fail  Always       -       0
194 Temperature_Celsius     0x0022   086   082   042    Old_age   Always       -       37
195 Hardware_ECC_Recovered  0x001a   100   001   000    Old_age   Always       - 
99292106
196 Reallocated_Event_Count 0x0010   100   100   020    Old_age   Offline      -       0
197 Current_Pending_Sector  0x0032   100   100   020    Old_age   Always       -       3
198 Offline_Uncorrectable   0x0010   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x001a   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 39 (device log contains only the most recent five errors)
         CR = Command Register [HEX]
         FR = Features Register [HEX]
         SC = Sector Count Register [HEX]
         SN = Sector Number Register [HEX]
         CL = Cylinder Low Register [HEX]
         CH = Cylinder High Register [HEX]
         DH = Device/Head Register [HEX]
         DC = Device Command Register [HEX]
         ER = Error register [HEX]
         ST = Status register [HEX]
Timestamp = decimal seconds since the previous disk power-on.
Note: timestamp "wraps" after 2^32 msec = 49.710 days.

Error 39 occurred at disk power-on lifetime: 10518 hours
   When the command that caused the error occurred, the device was in an unknown state.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   10 59 06 e9 01 0b e0

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Timestamp  Command/Feature_Name
   -- -- -- -- -- -- -- --   ---------  --------------------
   c8 00 08 e7 01 0b e0 0b     217.381  READ DMA
   c8 00 08 0f 02 0b e0 0b     217.381  READ DMA
   c8 00 08 07 02 0b e0 0b     217.380  READ DMA
   c8 00 08 47 1a 0b e0 00     217.380  READ DMA
   c8 00 08 ff 01 0b e0 0b     217.365  READ DMA

Error 38 occurred at disk power-on lifetime: 10335 hours
   When the command that caused the error occurred, the device was in an unknown state.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 59 06 e9 01 0b e0

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Timestamp  Command/Feature_Name
   -- -- -- -- -- -- -- --   ---------  --------------------
   c8 00 08 e7 01 0b e0 0b      85.432  READ DMA
   c8 00 08 0f 02 0b e0 0b      85.432  READ DMA
   c8 00 08 07 02 0b e0 0b      85.431  READ DMA
   c8 00 08 47 1a 0b e0 00      85.431  READ DMA
   c8 00 08 ff 01 0b e0 0b      85.423  READ DMA

Error 37 occurred at disk power-on lifetime: 10215 hours
   When the command that caused the error occurred, the device was in an unknown state.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 59 06 e9 01 0b e0

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Timestamp  Command/Feature_Name
   -- -- -- -- -- -- -- --   ---------  --------------------
   c8 00 08 e7 01 0b e0 0b      59.635  READ DMA
   ca 00 10 cf 00 60 e0 60      59.635  WRITE DMA
   c8 00 08 0f 02 0b e0 00      59.634  READ DMA
   ca 00 10 af 00 60 e0 60      59.634  WRITE DMA
   c8 00 08 07 02 0b e0 00      59.633  READ DMA

Error 36 occurred at disk power-on lifetime: 9762 hours
   When the command that caused the error occurred, the device was in an unknown state.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   10 59 06 e9 01 0b e0

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Timestamp  Command/Feature_Name
   -- -- -- -- -- -- -- --   ---------  --------------------
   c8 00 08 e7 01 0b e0 0b     139.350  READ DMA
   c8 00 08 0f 02 0b e0 0b     139.350  READ DMA
   c8 00 08 07 02 0b e0 0b     139.350  READ DMA
   c8 00 08 47 1a 0b e0 00     139.349  READ DMA
   c8 00 08 ff 01 0b e0 0b     139.333  READ DMA

Error 35 occurred at disk power-on lifetime: 9403 hours
   When the command that caused the error occurred, the device was in an unknown state.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 59 06 e9 01 0b e0

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Timestamp  Command/Feature_Name
   -- -- -- -- -- -- -- --   ---------  --------------------
   c8 00 08 e7 01 0b e0 0b     291.066  READ DMA
   c8 00 08 0f 02 0b e0 0b     291.059  READ DMA
   c8 00 08 07 02 0b e0 0b     291.058  READ DMA
   c8 00 08 47 1a 0b e0 00     291.058  READ DMA
   c8 00 08 ff 01 0b e0 0b     291.042  READ DMA

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours) 
LBA_of_first_error
# 1  Extended off-line   Completed: read failure       90%     14063         0x000aea3d
# 2  Short off-line      Completed: read failure       40%     14063         0x000aea3d


-- 
----------------------------------------------------------------------
   Frank Welty         |   Earth is a beta site, I just wish that damn
   lfwelty at nc.rr.com   |   pink elephant would give me my mouse back.
----------------------------------------------------------------------




More information about the TriLUG mailing list