DRBD always causing splt brain after restart

drbdpacemaker

I'm absolutely new to the whole cluster topic (pacemaker with DRBD, etc) and in general new to linux (using it on a daily basis for about two months now). Actually I'm playing around with a simple Apache cluster, accessing a shared DRBD mounting point, just for fun. Everything works fine now, despite the fact, that every time one of the two nodes reboots I end up with a split brain situation. My guess is, that the network connection is cut off before the primary node can declare itself as secondary (does it in fact work that way?). So my question is how to investigate what's the problem here. Any advice would be appreciated.

Thanks!

Best Answer

Looking at the syslog on both nodes will likely help you determine where the split-brain is happening.

Understanding how DRBD uses generation IDs to decide that a split-brain has occurred may help as well. Whenever the generation IDs are updated or compared, there will be an entry placed in syslog.

Reference: https://drbd.linbit.com/users-guide/s-gi.html

EDIT: I just noticed the Pacemaker tag on this post. Do you have DRBD/Apache set to start at boot? When using a cluster manager like Pacemaker, you will want to leave all the starting and stopping of services up to Pacemaker.