Диск из RAID вылетает каждую неделю.

daga
На сайте с 01.06.2004
Offline
161
845

Всем привет!

Пора ли менять диск, если да то какой(sda?) и почему он вылетает?

Синхонизирую диски так: /sbin/mdadm /dev/md6 -r /dev/sdb8 ; /sbin/mdadm /dev/md6 -a /dev/sdb8

Personalities : [raid1] [raid0] [raid6] [raid5] [raid4] [raid10]

md0 : active raid1 sdb1[1] sda1[0]

8387520 blocks [2/2] [UU]

md1 : active raid1 sdb2[1] sda2[0]

1048512 blocks [2/2] [UU]

md3 : active raid1 sdb5[1] sda5[0]

20971392 blocks [2/2] [UU]

md4 : active raid1 sdb6[1] sda6[0]

104857472 blocks [2/2] [UU]

md5 : active raid1 sdb7[1] sda7[0]

52428672 blocks [2/2] [UU]

md6 : active raid1 sdb8[2] sda8[0]

534392704 blocks [2/1] [U_]

[=>...................] recovery = 5.0% (26853952/534392704) finish=151.0min speed=56011K/sec

md2 : active raid1 sdb3[1] sda3[0]

10485696 blocks [2/2] [UU]

============================

Информация по дискам sda и sdb соответственно, не SSD.

============================

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 3

3 Spin_Up_Time 0x0027 182 178 021 Pre-fail Always - 3891

4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 19

5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0

7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0

9 Power_On_Hours 0x0032 021 021 000 Old_age Always - 58160

10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0

11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0

12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 17

192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 16

193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 2

194 Temperature_Celsius 0x0022 096 088 000 Old_age Always - 51

196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0

197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0

198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0

199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0

200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 4

3 Spin_Up_Time 0x0027 183 180 021 Pre-fail Always - 3825

4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 21

5 Reallocated_Sector_Ct 0x0033 198 198 140 Pre-fail Always - 13

7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0

9 Power_On_Hours 0x0032 021 021 000 Old_age Always - 58151

10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0

11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0

12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 19

192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 16

193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 4

194 Temperature_Celsius 0x0022 096 088 000 Old_age Always - 51

196 Reallocated_Event_Count 0x0032 187 187 000 Old_age Always - 13

197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 1

198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 1

199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0

200 Multi_Zone_Error_Rate 0x0008 199 199 000 Old_age Offline - 236

Облачный хостинг, официальный регистратор доменов в Украине. За прогон сайта, проведу видео-сессию, трансформирующую сознание:)
L
На сайте с 13.01.2011
Offline
125
#1

дискам около 7 лет - пора с них сваливать. по моему диски живут в среднем 5-6-7-8 лет +-

Контакты-icq 535609 ()
S
На сайте с 08.06.2018
Offline
84
#2

Вы бы смарты показали бы лучше

daga
На сайте с 01.06.2004
Offline
161
#3

Что именно показать?

И как Вы увидели что 7 лет...

Вот еще информация, на что обратить внимание.

=============SDA==========================

SMART overall-health self-assessment test result: PASSED

General SMART Values:

Offline data collection status: (0x84) Offline data collection activity

was suspended by an interrupting command from host.

Auto Offline Data Collection: Enabled.

Self-test execution status: ( 0) The previous self-test routine completed

without error or no self-test has ever

been run.

Total time to complete Offline

data collection: (12360) seconds.

Offline data collection

capabilities: (0x7b) SMART execute Offline immediate.

Auto Offline data collection on/off support.

Suspend Offline collection upon new

command.

Offline surface scan supported.

Self-test supported.

Conveyance Self-test supported.

Selective Self-test supported.

SMART capabilities: (0x0003) Saves SMART data before entering

power-saving mode.

Supports SMART auto save timer.

Error logging capability: (0x01) Error logging supported.

General Purpose Logging supported.

Short self-test routine

recommended polling time: ( 2) minutes.

Extended self-test routine

recommended polling time: ( 145) minutes.

Conveyance self-test routine

recommended polling time: ( 5) minutes.

SCT capabilities: (0x3037) SCT Status supported.

SCT Feature Control supported.

SCT Data Table supported.

SMART Attributes Data Structure revision number: 16

..................................

SMART Error Log Version: 1

No Errors Logged

SMART Self-test log structure revision number 1

No self-tests have been logged. [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1

SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS

1 0 0 Not_testing

2 0 0 Not_testing

3 0 0 Not_testing

4 0 0 Not_testing

5 0 0 Not_testing

Selective self-test flags (0x0):

==============SDB=====================

SMART overall-health self-assessment test result: PASSED

General SMART Values:

Offline data collection status: (0x84) Offline data collection activity

was suspended by an interrupting command from host.

Auto Offline Data Collection: Enabled.

Self-test execution status: ( 0) The previous self-test routine completed

without error or no self-test has ever

been run.

Total time to complete Offline

data collection: (12480) seconds.

Offline data collection

capabilities: (0x7b) SMART execute Offline immediate.

Auto Offline data collection on/off support.

Suspend Offline collection upon new

command.

Offline surface scan supported.

Self-test supported.

Conveyance Self-test supported.

Selective Self-test supported.

SMART capabilities: (0x0003) Saves SMART data before entering

power-saving mode.

Supports SMART auto save timer.

Error logging capability: (0x01) Error logging supported.

General Purpose Logging supported.

Short self-test routine

recommended polling time: ( 2) minutes.

Extended self-test routine

recommended polling time: ( 146) minutes.

Conveyance self-test routine

recommended polling time: ( 5) minutes.

SCT capabilities: (0x3037) SCT Status supported.

SCT Feature Control supported.

SCT Data Table supported.

SMART Attributes Data Structure revision number: 16

..........................................

SMART Error Log Version: 1

No Errors Logged

SMART Self-test log structure revision number 1

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error

# 1 Extended offline Completed without error 00% 14 -

# 2 Short offline Completed without error 00% 5 -

SMART Selective self-test log data structure revision number 1

SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS

1 0 0 Not_testing

2 0 0 Not_testing

3 0 0 Not_testing

4 0 0 Not_testing

5 0 0 Not_testing

Selective self-test flags (0x0):

After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.

S
На сайте с 08.06.2018
Offline
84
#4

ну не хотите скидывать, сами хотя бы посмотрите смарты, вот тут все описано, посмотрите утилиту Hard Disk Sentinel и smartmontools

LEOnidUKG
На сайте с 25.11.2006
Offline
1723
#5
И как Вы увидели что 7 лет...

9 Power_On_Hours 0x0032 021 021 000 Old_age Always - 58160

Где-то 6,6 лет им.

✅ Мой Телеграм канал по SEO, оптимизации сайтов и серверов: https://t.me/leonidukgLIVE ✅ Качественное и рабочее размещение SEO статей СНГ и Бурж: https://getmanylinks.ru/
daga
На сайте с 01.06.2004
Offline
161
#6

В первом посте выложил смарт параметры командой /usr/sbin/smartctl -a /dev/sda

Какие смарт параметры хотите увидеть, какой командой?🍿

L
На сайте с 13.01.2011
Offline
125
#7

ясно что сыпется sdb

5 Reallocated_Sector_Ct 0x0033 198 198 140 Pre-fail Always - 13

его и замените

Авторизуйтесь или зарегистрируйтесь, чтобы оставить комментарий