[Tarantool-patches] [PATCH] raft: make sure the leader stays ro till it clears the limbo
Serge Petrenko
sergepetrenko at tarantool.org
Fri Nov 27 16:00:07 MSK 2020
27.11.2020 00:01, Vladislav Shpilevoy пишет:
> Hi! Thanks for the patch!
>
> On 24.11.2020 14:18, Serge Petrenko wrote:
>> When running a cluster with leader election, its useful to wait till the
>> instance is writeable to determine that it has become a leader. However,
>> sometimes the instance fails to write data right after transitioning to
>> leader because its limbo still contains pending transactions from the
>> old leader. Make sure the instance deals with pending transactions first
>> and becomes writeable only once the limbo is empty.
> I just realized one thing. We can add a function txn_limbo_is_ro(),
> like we did with raft_is_ro(), account it in box_update_ro_summary(),
> and call box_update_ro_summary() when we see that the limbo is emptied,
> or when its ownership changes to a different instance.
>
> Probably would be simpler, and also we could make it work with manual
> election! So users could call box.ctl.wait_rw() even without using raft!
>
> To show concrete error if somebody still tries to write, we could
> patch box_check_writable() to show the reason why the instance is not
> writable. We will do it anyway for raft, to tell the users the real
> leader in case they are trying to write on a replica. In scope of
> https://github.com/tarantool/tarantool/issues/5568.
>
> Your version of the patch also looks good.
>
> What do you think?
Thanks for your answer!
Your proposal looks good. One question though. What about multimaster
synchro? Are we planning to support it one day? If yes, then limbo
emptiness will mean nothing.
So, there're two options:
1) we may leave this patch as is. Then one won't be
able to call wait_rw() with manual election. That's a pity, since
your proposal looks quite logical, especially from the user's point
of view. Having a single error for all these cases would be good.
2) Make limbo affect is_ro. Then everything's good for now, but we'll
have to rewrite it back once (and if) we decide to implement
multimaster synchro. Then the patch will look exactly like it does
now.
I'm not sure whether we're planning to make multimaster synchro work,
so I can't choose between these options and leave you to decide.
Besides, looks like if we take option 2 the patch will differ in a single
line: `box_update_ro_summary()` after `box_clear_synchro_queue`
--
Serge Petrenko
More information about the Tarantool-patches
mailing list