LINUX.ORG.RU
решено ФорумAdmin

split brain drbd диска на proxmox 3.1 HELP


0

1

после обновления proxmox с 2.3 до 3.1 drbd диск развалился, скорее всего из за необходимой перезагрузки, т.к. обновлений в части drbd не было

виртуалки на drbd диске запущены

proxmox1

root@proxmox1:/var/log# cat /proc/drbd
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
 0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown   r-----
    ns:0 nr:0 dw:27570495 dr:824728362 al:73564 bm:6743 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:1632860
proxmox2

root@proxmox2:/etc/drbd.d# cat /proc/drbd
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
 0: cs:StandAlone ro:Secondary/Unknown ds:UpToDate/DUnknown   r-----
    ns:0 nr:0 dw:0 dr:828 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:0

конфиг

# more rs0.res
resource r0 {
        protocol C;
        startup {
                wfc-timeout  180;     
                degr-wfc-timeout 60;
                become-primary-on both;
        }
        net {
                cram-hmac-alg sha1;
                shared-secret "my-secret";
                allow-two-primaries;
                after-sb-0pri discard-zero-changes;
                after-sb-1pri discard-secondary;
                after-sb-2pri disconnect;
                #data-integrity-alg crc32c;     
        }
        on proxmox1 {
                device /dev/drbd0;
                disk /dev/sdb1;
                address 192.168.100.1:7788;
                meta-disk internal;
        }
        on proxmox2 {
                device /dev/drbd0;
                disk /dev/sdb1;
                address 192.168.100.2:7788;
                meta-disk internal;
        }
}

восстанавливаю так

на proxmox2
drbdadm disconnect r0
drbdadm secondary r0
drbdadm -- --discard-my-data connect r0

на proxmox1
drbdadm connect r0

в логах нод

Aug 24 11:46:43 proxmox1 kernel: block drbd0: conn( StandAlone -> Unconnected )
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Starting receiver thread (from drbd0_worker [2685])
Aug 24 11:46:43 proxmox1 kernel: block drbd0: receiver (re)started
Aug 24 11:46:43 proxmox1 kernel: block drbd0: conn( Unconnected -> WFConnection )
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Handshake successful: Agreed network protocol version 96
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
Aug 24 11:46:43 proxmox1 kernel: block drbd0: conn( WFConnection -> WFReportParams )
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Starting asender thread (from drbd0_receiver [72831])
Aug 24 11:46:43 proxmox1 kernel: block drbd0: data-integrity-alg: <not-used>
Aug 24 11:46:43 proxmox1 kernel: block drbd0: conn( WFReportParams -> Disconnecting )
Aug 24 11:46:43 proxmox1 kernel: block drbd0: asender terminated
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Terminating asender thread
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Connection closed
Aug 24 11:46:43 proxmox1 kernel: block drbd0: conn( Disconnecting -> StandAlone )
Aug 24 11:46:43 proxmox1 kernel: block drbd0: receiver terminated
Aug 24 11:46:43 proxmox1 kernel: block drbd0: Terminating receiver thread



Aug 24 10:44:49 proxmox2 kernel: block drbd0: conn( StandAlone -> Unconnected )
Aug 24 10:44:49 proxmox2 kernel: block drbd0: Starting receiver thread (from drbd0_worker [2827])
Aug 24 10:44:49 proxmox2 kernel: block drbd0: receiver (re)started
Aug 24 10:44:49 proxmox2 kernel: block drbd0: conn( Unconnected -> WFConnection )
Aug 24 11:45:34 proxmox2 kernel: block drbd0: conn( WFConnection -> Disconnecting )
Aug 24 11:45:34 proxmox2 kernel: block drbd0: Discarding network configuration.
Aug 24 11:45:34 proxmox2 kernel: block drbd0: Connection closed
Aug 24 11:45:34 proxmox2 kernel: block drbd0: conn( Disconnecting -> StandAlone )
Aug 24 11:45:34 proxmox2 kernel: block drbd0: receiver terminated
Aug 24 11:45:34 proxmox2 kernel: block drbd0: Terminating receiver thread
Aug 24 11:46:41 proxmox2 kernel: block drbd0: conn( StandAlone -> Unconnected )
Aug 24 11:46:41 proxmox2 kernel: block drbd0: Starting receiver thread (from drbd0_worker [2827])
Aug 24 11:46:41 proxmox2 kernel: block drbd0: receiver (re)started
Aug 24 11:46:41 proxmox2 kernel: block drbd0: conn( Unconnected -> WFConnection )
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Handshake successful: Agreed network protocol version 96
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
Aug 24 11:46:43 proxmox2 kernel: block drbd0: conn( WFConnection -> WFReportParams )
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Starting asender thread (from drbd0_receiver [39274])
Aug 24 11:46:43 proxmox2 kernel: block drbd0: data-integrity-alg: <not-used>
Aug 24 11:46:43 proxmox2 kernel: block drbd0: conn( WFReportParams -> Disconnecting )
Aug 24 11:46:43 proxmox2 kernel: block drbd0: asender terminated
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Terminating asender thread
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Connection closed
Aug 24 11:46:43 proxmox2 kernel: block drbd0: conn( Disconnecting -> StandAlone )
Aug 24 11:46:43 proxmox2 kernel: block drbd0: receiver terminated
Aug 24 11:46:43 proxmox2 kernel: block drbd0: Terminating receiver thread

не пойму почему не запускается

★★★★

Последнее исправление: Vlad-76 (всего исправлений: 1)

оказалось что конфиги drbd в /etc/drbd.d/ не были идентичными на двух нодах как так получилось - загадка. ответ нашелся по команде dmesg.
вот так обновил proxmox :)

Vlad-76 ★★★★
() автор топика
Вы не можете добавлять комментарии в эту тему. Тема перемещена в архив.