Yeah, we are looking into what’s causing the zero leader to go down.
But what we are also seeing is that even after zero goes down, it takes over 20 seconds before alpha starts responding to queries. I have attached the logs from both alpha and zero where you can see the zero-4 (leader) goes down at 20:39:19 and a new leader is elected by 20:39:20.
But from alpha logs, it keeps trying to connect to zero-4 when zero-2 was elected leader.
alpha-logs.txt (35.9 KB)
zero-logs.txt (52.0 KB)