Кто пользуется hetzner.de? - часть 5

L
На сайте с 23.09.2012
Offline
13
#331
1992bavard1992:
а чем fastvps не устроил тебя? ( Lopas ) а то не знаю ) вроде все с инми ок)

Форумы почитайте, яндекс в помощь

M
На сайте с 24.10.2011
Offline
173
#332

взял сервер с 240ГБ ссд, интел 520

ps надо кому EX4 с флекси-паком и ссд? оплачен до 29.10. диски нужно менять оба, лень этим заниматься

P
На сайте с 16.03.2009
Offline
144
#333

C жестяками периодически бывает очень весело. Вчера в логе нашел

Oct 15 17:43:37 host3 kernel: [6305083.627438] ata1.00: exception Emask 0x0 SAct 0x7f SErr 0x0 action 0x0
Oct 15 17:43:37 host3 kernel: [6305083.628419] ata1.00: irq_stat 0x40000008
Oct 15 17:43:37 host3 kernel: [6305083.629392] ata1.00: failed command: READ FPDMA QUEUED
Oct 15 17:43:37 host3 kernel: [6305083.630370] ata1.00: cmd 60/10:00:98:17:81/00:00:d7:00:00/40 tag 0 ncq 8192 in
Oct 15 17:43:37 host3 kernel: [6305083.630372] res 41/40:10:98:17:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 15 17:43:37 host3 kernel: [6305083.632130] ata1.00: status: { DRDY ERR }
Oct 15 17:43:37 host3 kernel: [6305083.632971] ata1.00: error: { UNC }
Oct 15 17:43:37 host3 kernel: [6305083.646971] ata1.00: configured for UDMA/133
Oct 15 17:43:37 host3 kernel: [6305083.647819] ata1: EH complete
Oct 15 17:43:42 host3 kernel: [6305087.783449] ata1.00: exception Emask 0x0 SAct 0x58 SErr 0x0 action 0x0
Oct 15 17:43:42 host3 kernel: [6305087.784388] ata1.00: irq_stat 0x40000008
Oct 15 17:43:42 host3 kernel: [6305087.785311] ata1.00: failed command: READ FPDMA QUEUED
Oct 15 17:43:42 host3 kernel: [6305087.786574] ata1.00: cmd 60/10:30:98:17:81/00:00:d7:00:00/40 tag 6 ncq 8192 in
Oct 15 17:43:42 host3 kernel: [6305087.786576] res 41/40:10:a0:17:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 15 17:43:42 host3 kernel: [6305087.788357] ata1.00: status: { DRDY ERR }
Oct 15 17:43:42 host3 kernel: [6305087.789278] ata1.00: error: { UNC }
Oct 15 17:43:42 host3 kernel: [6305087.852357] ata1.00: configured for UDMA/133
Oct 15 17:43:42 host3 kernel: [6305087.853145] ata1: EH complete
Oct 15 17:43:45 host3 kernel: [6305090.713954] ata1.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0
Oct 15 17:43:45 host3 kernel: [6305090.715023] ata1.00: irq_stat 0x40000008
Oct 15 17:43:45 host3 kernel: [6305090.716063] ata1.00: failed command: READ FPDMA QUEUED
Oct 15 17:43:45 host3 kernel: [6305090.717111] ata1.00: cmd 60/10:00:98:17:81/00:00:d7:00:00/40 tag 0 ncq 8192 in
Oct 15 17:43:45 host3 kernel: [6305090.717112] res 41/40:10:a0:17:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 15 17:43:45 host3 kernel: [6305090.719196] ata1.00: status: { DRDY ERR }
Oct 15 17:43:45 host3 kernel: [6305090.720003] ata1.00: error: { UNC }
Oct 15 17:43:45 host3 kernel: [6305090.778590] ata1.00: configured for UDMA/133
Oct 15 17:43:45 host3 kernel: [6305090.779311] ata1: EH complete

Заменили SATA кабель. Окей, думаю ну теперь все ок.

Днем внезапно!

Oct 16 14:01:24 host3 shutdown [248151]: shutting down for system reboot
Oct 16 14:05:49 host3 shutdown [19907]: shutting down for system reboot

И уже вечером

Oct 16 23:27:19 host3 kernel: [32397.984036] ata1.00: exception Emask 0x0 SAct 0x7fffff SErr 0x0 action 0x0
Oct 16 23:27:19 host3 kernel: [32397.984636] ata1.00: irq_stat 0x40000008
Oct 16 23:27:19 host3 kernel: [32397.985226] ata1.00: failed command: READ FPDMA QUEUED
Oct 16 23:27:19 host3 kernel: [32397.985802] ata1.00: cmd 60/a0:18:b0:23:81/00:00:d7:00:00/40 tag 3 ncq 81920 in
Oct 16 23:27:19 host3 kernel: [32397.985804] res 41/40:a0:b0:23:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 16 23:27:19 host3 kernel: [32397.986965] ata1.00: status: { DRDY ERR }
Oct 16 23:27:19 host3 kernel: [32397.987586] ata1.00: error: { UNC }
Oct 16 23:27:19 host3 kernel: [32398.052825] ata1.00: configured for UDMA/133
Oct 16 23:27:19 host3 kernel: [32398.053504] ata1: EH complete
Oct 16 23:27:24 host3 kernel: [32402.771121] ata1.00: exception Emask 0x0 SAct 0x1bffee SErr 0x0 action 0x0
Oct 16 23:27:24 host3 kernel: [32402.771736] ata1.00: irq_stat 0x40000008
Oct 16 23:27:24 host3 kernel: [32402.772349] ata1.00: failed command: READ FPDMA QUEUED
Oct 16 23:27:24 host3 kernel: [32402.772935] ata1.00: cmd 60/a0:98:b0:23:81/00:00:d7:00:00/40 tag 19 ncq 81920 in
Oct 16 23:27:24 host3 kernel: [32402.772936] res 41/40:a0:c0:23:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 16 23:27:24 host3 kernel: [32402.774096] ata1.00: status: { DRDY ERR }
Oct 16 23:27:24 host3 kernel: [32402.774675] ata1.00: error: { UNC }
Oct 16 23:27:24 host3 kernel: [32402.909861] ata1.00: configured for UDMA/133
Oct 16 23:27:24 host3 kernel: [32402.910479] ata1: EH complete
Oct 16 23:27:27 host3 kernel: [32405.879173] ata1.00: exception Emask 0x0 SAct 0x1ffff SErr 0x0 action 0x0
Oct 16 23:27:27 host3 kernel: [32405.879888] ata1.00: irq_stat 0x40000008
Oct 16 23:27:27 host3 kernel: [32405.880610] ata1.00: failed command: READ FPDMA QUEUED
Oct 16 23:27:27 host3 kernel: [32405.881394] ata1.00: cmd 60/a0:08:b0:23:81/00:00:d7:00:00/40 tag 1 ncq 81920 in
Oct 16 23:27:27 host3 kernel: [32405.881396] res 41/40:a0:c0:23:81/00:00:d7:00:00/00 Emask 0x409 (media error) <F>
Oct 16 23:27:27 host3 kernel: [32405.882848] ata1.00: status: { DRDY ERR }
Oct 16 23:27:27 host3 kernel: [32405.883646] ata1.00: error: { UNC }
Oct 16 23:27:27 host3 kernel: [32405.951255] ata1.00: configured for UDMA/133
Oct 16 23:27:27 host3 kernel: [32405.951932] ata1: EH complete

Теперь предлагают полностью проверить весь сервер (10-14 часов тест).

Pavel.Odintsov
На сайте с 13.05.2009
Offline
169
#334
Oct 16 14:01:24 host3 shutdown [248151]: shutting down for system reboot
Oct 16 14:05:49 host3 shutdown [19907]: shutting down for system reboot

Это не из-за проблем с железом, кто-то ребутнул машину.

Покажите: smartctl --all /dev/sda и smartctl --all /dev/sdb, а также cat /proc/mdstat.

Решение по обнаружению DDoS атак для хостинг компаний, дата центров и операторов связи: FastNetMon (https://fastnetmon.com)
P
На сайте с 16.03.2009
Offline
144
#335
Pavel.Odintsov:
Это не из-за проблем с железом, кто-то ребутнул машину.

Покажите: smartctl --all /dev/sda и smartctl --all /dev/sdb, а также cat /proc/mdstat.

В том то и дело, что ее не ребутали. В это время сидел в ssh. Смарты нормальные. mdstat тоже.

Вот еще кусок лога.

Oct 17 01:06:42 host3 kernel: [38358.420940] ata2.00: exception Emask 0x10 SAct 0x1f002 SErr 0x400100 action 0x6 frozen
Oct 17 01:06:42 host3 kernel: [38358.421591] ata2.00: irq_stat 0x08000000, interface fatal error
Oct 17 01:06:42 host3 kernel: [38358.422273] ata2: SError: { UnrecovData Handshk }
Oct 17 01:06:42 host3 kernel: [38358.422938] ata2.00: failed command: READ FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.423593] ata2.00: cmd 60/10:08:90:ed:b0/00:00:e1:00:00/40 tag 1 ncq 8192 in
Oct 17 01:06:42 host3 kernel: [38358.423594] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.424930] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.425611] ata2.00: failed command: WRITE FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.426319] ata2.00: cmd 61/08:60:20:6f:ba/00:00:ca:00:00/40 tag 12 ncq 4096 out
Oct 17 01:06:42 host3 kernel: [38358.426320] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.427703] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.428417] ata2.00: failed command: WRITE FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.429097] ata2.00: cmd 61/08:68:28:6f:ba/00:00:ca:00:00/40 tag 13 ncq 4096 out
Oct 17 01:06:42 host3 kernel: [38358.429099] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.430419] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.431057] ata2.00: failed command: WRITE FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.431700] ata2.00: cmd 61/08:70:30:6f:ba/00:00:ca:00:00/40 tag 14 ncq 4096 out
Oct 17 01:06:42 host3 kernel: [38358.431702] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.432963] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.433627] ata2.00: failed command: WRITE FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.434295] ata2.00: cmd 61/08:78:38:6f:ba/00:00:ca:00:00/40 tag 15 ncq 4096 out
Oct 17 01:06:42 host3 kernel: [38358.434296] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.435591] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.436260] ata2.00: failed command: WRITE FPDMA QUEUED
Oct 17 01:06:42 host3 kernel: [38358.436918] ata2.00: cmd 61/08:80:40:6f:ba/00:00:ca:00:00/40 tag 16 ncq 4096 out
Oct 17 01:06:42 host3 kernel: [38358.436920] res c0/00:08:48:3a:71/00:00:e1:00:00/40 Emask 0x12 (ATA bus error)
Oct 17 01:06:42 host3 kernel: [38358.438231] ata2.00: status: { Busy }
Oct 17 01:06:42 host3 kernel: [38358.438874] ata2: hard resetting link
Oct 17 01:06:43 host3 kernel: [38358.744635] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Oct 17 01:06:43 host3 kernel: [38358.813032] ata2.00: configured for UDMA/133
Oct 17 01:06:43 host3 kernel: [38358.813690] ata2: EH complete
smartctl -A /dev/sda
smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 113 099 006 Pre-fail Always - 54365432
3 Spin_Up_Time 0x0003 092 092 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 18
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 081 060 030 Pre-fail Always - 4453205658
9 Power_On_Hours 0x0032 095 095 000 Old_age Always - 4693
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 18
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 068 068 000 Old_age Always - 32
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 029 045 Old_age Always In_the_past 39 (0 10 44 36)
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 14
193 Load_Cycle_Count 0x0032 097 097 000 Old_age Always - 6143
194 Temperature_Celsius 0x0022 039 071 000 Old_age Always - 39 (0 20 0 0)
197 Current_Pending_Sector 0x0012 100 001 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 001 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 24962349929011
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 88987481756827
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 89486928888334
smartctl -A /dev/sdb
smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 114 099 006 Pre-fail Always - 159783104
3 Spin_Up_Time 0x0003 093 093 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 10
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail Always - 325119407
9 Power_On_Hours 0x0032 094 094 000 Old_age Always - 5643
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 10
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
189 High_Fly_Writes 0x003a 088 088 000 Old_age Always - 12
190 Airflow_Temperature_Cel 0x0022 065 033 045 Old_age Always In_the_past 35 (0 7 41 33)
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 6
193 Load_Cycle_Count 0x0032 083 083 000 Old_age Always - 34264
194 Temperature_Celsius 0x0022 035 067 000 Old_age Always - 35 (0 19 0 0)
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 2
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 4733053965599
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 273127036603939
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 157156093870460
cat /proc/mdstat
Personalities : [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
md1 : active raid1 sda2[2] sdb2[1]
2929739071 blocks super 1.2 [2/2] [UU]

md0 : active raid1 sda1[2] sdb1[1]
524276 blocks super 1.2 [2/2] [UU]

unused devices: <none>
P
На сайте с 16.03.2009
Offline
144
#336

И еще кусок лога

Oct 17 03:58:08 host3 kernel: [48638.176273] ------------[ cut here ]------------
Oct 17 03:58:08 host3 kernel: [48638.176917] WARNING: at drivers/gpu/drm/i915/i915_irq.c:652 ironlake_irq_handler+0x4ea/0x500 [i915]() (Not tainted)
Oct 17 03:58:08 host3 kernel: [48638.177580] Hardware name: System Product Name
Oct 17 03:58:08 host3 kernel: [48638.178216] Missed a PM interrupt
Oct 17 03:58:08 host3 kernel: [48638.178844] Modules linked in: cls_u32 sch_sfq sch_htb coretemp vzethdev vznetdev pio_nfs pio_direct pfmt_raw pfmt_ploop1 ploop simfs vzrst nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 vzcpt nfs lockd fscache nfs_acl auth_rpcgss sunrpc nf_conntrack vzdquota vzmon vzdev xt_length xt_hl xt_tcpmss xt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_limit xt_dscp ipt_REJECT ip_tables acpi_cpufreq cpufreq_powersave cpufreq_stats cpufreq_ondemand cpufreq_conservative vzevent ipv6 ext2 snd_pcsp i915 drm_kms_helper drm i2c_algo_bit snd_pcm i2c_i801 video snd_timer tpm_tis tpm snd i2c_core tpm_bios xhci_hcd output soundcore snd_page_alloc ext4 mbcache jbd2 dm_mod freq_table mperf aacraid 3w_9xxx 3w_xxxx raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 sata_nv sata_sil sata_via sd_mod crc_t10dif ahci r8169 mii [last unloaded: scsi_wait_scan]
Oct 17 03:58:08 host3 kernel: [48638.183870] Pid: 0, comm: swapper veid: 0 Not tainted 2.6.32-042stab059.7 #1
Oct 17 03:58:08 host3 kernel: [48638.184584] Call Trace:
Oct 17 03:58:08 host3 kernel: [48638.185298] <IRQ> [<ffffffff8106d7d7>] ? warn_slowpath_common+0x87/0xc0
Oct 17 03:58:08 host3 kernel: [48638.186052] [<ffffffff8106d8c6>] ? warn_slowpath_fmt+0x46/0x50
Oct 17 03:58:08 host3 kernel: [48638.186804] [<ffffffffa02cf51a>] ? ironlake_irq_handler+0x4ea/0x500 [i915]
Oct 17 03:58:08 host3 kernel: [48638.187560] [<ffffffff810ebce0>] ? handle_IRQ_event+0x60/0x170
Oct 17 03:58:08 host3 kernel: [48638.188293] [<ffffffff810ee46e>] ? handle_edge_irq+0xde/0x180
Oct 17 03:58:08 host3 kernel: [48638.189026] [<ffffffff8100dfc9>] ? handle_irq+0x49/0xa0
Oct 17 03:58:08 host3 kernel: [48638.189759] [<ffffffff815018ec>] ? do_IRQ+0x6c/0xf0
Oct 17 03:58:08 host3 kernel: [48638.190467] [<ffffffff8100bb13>] ? ret_from_intr+0x0/0x11
Oct 17 03:58:08 host3 kernel: [48638.191218] <EOI> [<ffffffff812ced7e>] ? intel_idle+0xde/0x170
Oct 17 03:58:08 host3 kernel: [48638.191966] [<ffffffff812ced61>] ? intel_idle+0xc1/0x170
Oct 17 03:58:08 host3 kernel: [48638.192707] [<ffffffff81409547>] ? cpuidle_idle_call+0xa7/0x140
Oct 17 03:58:08 host3 kernel: [48638.193422] [<ffffffff81009e66>] ? cpu_idle+0xb6/0x110
Oct 17 03:58:08 host3 kernel: [48638.194125] [<ffffffff814e0835>] ? rest_init+0x85/0x90
Oct 17 03:58:08 host3 kernel: [48638.194821] [<ffffffff81c2bf6e>] ? start_kernel+0x412/0x41e
Oct 17 03:58:08 host3 kernel: [48638.195507] [<ffffffff81c2b33a>] ? x86_64_start_reservations+0x125/0x129
Oct 17 03:58:08 host3 kernel: [48638.196196] [<ffffffff81c2b438>] ? x86_64_start_kernel+0xfa/0x109
Oct 17 03:58:08 host3 kernel: [48638.196883] ---[ end trace a94e657250f4e08d ]---
Pavel.Odintsov
На сайте с 13.05.2009
Offline
169
#337

Стектрейс - это уже причуды OpenVZ, а по дискам - похоже на кабель-таки...

P
На сайте с 16.03.2009
Offline
144
#338
Pavel.Odintsov:
Стектрейс - это уже причуды OpenVZ, а по дискам - похоже на кабель-таки...

Отправлю в баг трекер. Кабель заменяли день-два назад.

M
На сайте с 24.10.2011
Offline
173
#339

кстати о дисках :)

Dear Michael,

we have replaced both drives again. The server is in the rescue system.
WEB-мастер
На сайте с 23.07.2009
Offline
174
#340

Я правильно понял что сервер у одного меня не пингуется?

Отправил на перезагрузку через панель, пинга то же нет о_О

Лучший парсер ( https://goo.gl/aw7tPJ ) чего угодно.

Авторизуйтесь или зарегистрируйтесь, чтобы оставить комментарий