From: Serge Petrenko <sergepetrenko@tarantool.org> To: Vladislav Shpilevoy <v.shpilevoy@tarantool.org>, tarantool-patches@dev.tarantool.org Subject: Re: [Tarantool-patches] [PATCH 1/5] [tosquash] replication: fix multiple rollbacks Date: Sun, 5 Jul 2020 12:34:17 +0300 [thread overview] Message-ID: <31755095-f5c6-cc73-f3d3-6bfe233b78c1@tarantool.org> (raw) In-Reply-To: <f642fd0e2aac5f6785741dcfc4c03c312cb29558.1593733161.git.v.shpilevoy@tarantool.org> 03.07.2020 02:40, Vladislav Shpilevoy пишет: > The problem was that if several transactions time out in one > event loop iteration, the all will write rollback. Moreover, they > will do that in a weird order, starting from the oldest, not in > a reversed order. > > This patch makes limbo write only one rollback at once. > --- > src/box/txn_limbo.c | 25 +++++++++++++++++++++++++ > test/replication/qsync_basic.result | 2 +- > 2 files changed, 26 insertions(+), 1 deletion(-) > > diff --git a/src/box/txn_limbo.c b/src/box/txn_limbo.c > index 0402664cb..2cb687f4d 100644 > --- a/src/box/txn_limbo.c > +++ b/src/box/txn_limbo.c > @@ -44,6 +44,13 @@ txn_limbo_create(struct txn_limbo *limbo) > limbo->got_rollback = false; > } > > +static inline struct txn_limbo_entry * > +txn_limbo_first_entry(struct txn_limbo *limbo) > +{ > + return rlist_first_entry(&limbo->queue, struct txn_limbo_entry, > + in_queue); > +} > + > struct txn_limbo_entry * > txn_limbo_append(struct txn_limbo *limbo, uint32_t id, struct txn *txn) > { > @@ -150,6 +157,24 @@ txn_limbo_wait_complete(struct txn_limbo *limbo, struct txn_limbo_entry *entry) > bool timed_out = fiber_yield_timeout(txn_limbo_confirm_timeout(limbo)); > fiber_set_cancellable(cancellable); > if (timed_out) { > + assert(!txn_limbo_is_empty(limbo)); > + if (txn_limbo_first_entry(limbo) != entry) { > + /* > + * If this is not a first entry in the > + * limbo, it is definitely not a first > + * timed out entry. And since it managed > + * to time out too, it means there is > + * currently another fiber writing > + * rollback. Wait when it will finish and > + * wake us up. > + */ Why isn't it the first timed out? Is it because once previous entry was confirmed, it is removed from the queue immediately? Looks fragile. > + bool cancellable = fiber_set_cancellable(false); > + fiber_yield(); > + fiber_set_cancellable(cancellable); > + assert(txn_limbo_entry_is_complete(entry)); > + goto complete; > + } > + > txn_limbo_write_rollback(limbo, entry); > struct txn_limbo_entry *e, *tmp; > rlist_foreach_entry_safe_reverse(e, &limbo->queue, > diff --git a/test/replication/qsync_basic.result b/test/replication/qsync_basic.result > index cdecf00e8..32deb2ac3 100644 > --- a/test/replication/qsync_basic.result > +++ b/test/replication/qsync_basic.result > @@ -272,7 +272,7 @@ box.cfg{replication_synchro_timeout = 0.001, replication_synchro_quorum = 3} > | ... > f = fiber.create(box.space.sync.replace, box.space.sync, {6}) s:replace{6} > | --- > - | - error: Quorum collection for a synchronous transaction is timed out > + | - error: A rollback for a synchronous transaction is received > | ... > f:status() > | --- -- Serge Petrenko
next prev parent reply other threads:[~2020-07-05 9:34 UTC|newest] Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top 2020-07-02 23:40 [Tarantool-patches] [PATCH 0/5] Qsync: local rows fixes Vladislav Shpilevoy 2020-07-02 23:40 ` [Tarantool-patches] [PATCH 1/5] [tosquash] replication: fix multiple rollbacks Vladislav Shpilevoy 2020-07-05 9:34 ` Serge Petrenko [this message] 2020-07-05 15:13 ` Vladislav Shpilevoy 2020-07-02 23:40 ` [Tarantool-patches] [PATCH 2/5] [tosquash] applier: remove unnecessary fiber name check Vladislav Shpilevoy 2020-07-05 8:40 ` Serge Petrenko 2020-07-02 23:40 ` [Tarantool-patches] [PATCH 3/5] [tosquash] txn_limbo: fix release build Vladislav Shpilevoy 2020-07-05 8:41 ` Serge Petrenko 2020-07-02 23:40 ` [Tarantool-patches] [PATCH 4/5] [tosquash] replication: rework how local transactions wait sync Vladislav Shpilevoy 2020-07-05 9:04 ` Serge Petrenko 2020-07-05 15:13 ` Vladislav Shpilevoy 2020-07-02 23:40 ` [Tarantool-patches] [PATCH 5/5] [tosquash] replication: add test on local row in the end of sync txn Vladislav Shpilevoy 2020-07-05 9:11 ` Serge Petrenko 2020-07-05 15:13 ` Vladislav Shpilevoy
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=31755095-f5c6-cc73-f3d3-6bfe233b78c1@tarantool.org \ --to=sergepetrenko@tarantool.org \ --cc=tarantool-patches@dev.tarantool.org \ --cc=v.shpilevoy@tarantool.org \ --subject='Re: [Tarantool-patches] [PATCH 1/5] [tosquash] replication: fix multiple rollbacks' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox