회사 PC에서 RAIDZ로 묶어서 작업공간으로 사용하고 있는 오래된 HDD 셋이 있습니다.
이 볼륨이 나날이 상태가 안좋아지고 있는데.. 현상은 I/O가 발생할때 가끔 몇 초 정도
freeze되는 것입니다. 이 현상이 점점 자주 발생하고 있습니다. 그래서 답답한 맘에
SMART 정보를 한번 꺼내봤는데.. 감이 안와서 문의 드립니다.
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 165 165 021 Pre-fail Always - 4733
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 211
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 017 017 000 Old_age Always - 60780
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 207
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 153
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 211
194 Temperature_Celsius 0x0022 105 100 000 Old_age Always - 42
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 199 000 Old_age Always - 6763
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 163 162 021 Pre-fail Always - 4850
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 212
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 017 017 000 Old_age Always - 60783
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 208
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 153
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 212
194 Temperature_Celsius 0x0022 105 100 000 Old_age Always - 42
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 176 152 021 Pre-fail Always - 4175
4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2329
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 051 Old_age Always - 0
9 Power_On_Hours 0x0032 091 091 000 Old_age Always - 7001
10 Spin_Retry_Count 0x0032 100 100 051 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 051 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 650
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 292
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 2329
194 Temperature_Celsius 0x0022 106 087 000 Old_age Always - 41
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 051 Old_age Offline - 0
여기서 어떤 값이 문제의 증상을 설명하고 있을까요?
±×¸®°í ù¹ø° µð½ºÅ©ÀÇ UDMA_CRC °ªÀÌ Á» ¿Ã¶ó°¡ Àִµ¥, ÀÏ´Ü ÄÉÀÌºí ¿¬°á»óŸ¦ È®ÀÎÇغ¸¼Å¾ß ÇÒ °Å °°½À´Ï´Ù.
ġȯµÇ°Å³ª ¿¹Á¤ ¼½ÅÍÀÇ ¹®Á¦´Â Å©°Ô º¸ÀÌÁö ¾Ê´Â °ÍÀ¸·Î º¸¾Æ.. ±×³É »ç¿ë½Ã°£ÀÌ ¿À·¡µÈ µð½ºÅ©ÀÇ ³ëÈÄ Áõ»óÀÎ °Å °°±âµµ...
(±×·± ³ëÈÄµÈ µð½ºÅ©µéÀº SMART ¿¡ ¸ÖÂÄ ÇÏ´Ù°¡µµ ¾î´À³¯ ¾ðÁ¦Á×À»Áö ¸ð¸¨´Ï´Ù... io Áö¿¬ÀÌ ÀÖ´Ù¸é ±³Ã¼¸¦ °í·ÁÇغ¸½Ã´Â°Ô ÁÁÀ»°Å°°¾Æ¿ä)
À§ °ª ¸»°í´Â ¹®Á¦µÉ °ÍÀº ¾ø¾îº¸À̳׿ä.
Àû±ØÀûÀ¸·Î ¹®Á¦°¡ ÀÖ´Â µð½ºÅ©¸¦ ÆǺ°ÇÏ°í ½Í´Ù¸é smartctl -t long /dev/sdX ¸í·ÉÀ¸·Î °Ë»çÇغ¸¼¼¿ä.