회사 PC에서 RAIDZ로 묶어서 작업공간으로 사용하고 있는 오래된 HDD 셋이 있습니다.
이 볼륨이 나날이 상태가 안좋아지고 있는데.. 현상은 I/O가 발생할때 가끔 몇 초 정도
freeze되는 것입니다. 이 현상이 점점 자주 발생하고 있습니다. 그래서 답답한 맘에
SMART 정보를 한번 꺼내봤는데.. 감이 안와서 문의 드립니다.
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 165 165 021 Pre-fail Always - 4733
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 211
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 017 017 000 Old_age Always - 60780
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 207
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 153
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 211
194 Temperature_Celsius 0x0022 105 100 000 Old_age Always - 42
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 199 000 Old_age Always - 6763
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 163 162 021 Pre-fail Always - 4850
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 212
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 017 017 000 Old_age Always - 60783
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 208
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 153
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 212
194 Temperature_Celsius 0x0022 105 100 000 Old_age Always - 42
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 176 152 021 Pre-fail Always - 4175
4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2329
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 051 Old_age Always - 0
9 Power_On_Hours 0x0032 091 091 000 Old_age Always - 7001
10 Spin_Retry_Count 0x0032 100 100 051 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 051 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 650
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 292
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 2329
194 Temperature_Celsius 0x0022 106 087 000 Old_age Always - 41
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 051 Old_age Offline - 0
여기서 어떤 값이 문제의 증상을 설명하고 있을까요?
ù° ũ UDMA_CRC ö ִµ, ϴ ̺ ¸ Ȯغž ϴ.
ġȯǰų ũ ʴ .. ׳ ð ũ ...
( ĵ ũ SMART ϴٰ ϴ... io ִٸ ü غô° Űƿ)
̳.
ִ ũ Ǻϰ ʹٸ smartctl -t long /dev/sdX ˻غ.