[Tarantool-patches] [PATCH 2/2] relay: fix segfault on replica transition from anonymous
Serge Petrenko
sergepetrenko at tarantool.org
Sat Apr 11 15:33:45 MSK 2020
relay_subscribe_f sets a recovery trigger notifying tx when a full log
is read and gc consumer corresponding to the replica may be advanced.
Since anonymous replicas do not have gc consumers, the trigger isn't
added for them. However, on relay exit, the trigger deletion depends
on replica->anon flag. This is buggy in case relay stalls on exit due to
replica disconnect. Replica has time to reconnect and register as a
normal instance, hence its replica->anon flag will be false by the time
we check whether to clear triggers or not, effectively makinig us to
clear unset triggers and segfault.
Fix this by actually checking whether the trigger is set instead of
relying on replica->anon flag.
Closes #4731
---
src/box/relay.cc | 16 +++++++++++-----
1 file changed, 11 insertions(+), 5 deletions(-)
diff --git a/src/box/relay.cc b/src/box/relay.cc
index c634348a4..5f6edcd78 100644
--- a/src/box/relay.cc
+++ b/src/box/relay.cc
@@ -580,9 +580,8 @@ relay_subscribe_f(va_list ap)
* Not needed for anonymous replicas, since they
* aren't registered with gc at all.
*/
- struct trigger on_close_log = {
- RLIST_LINK_INITIALIZER, relay_on_close_log_f, relay, NULL
- };
+ struct trigger on_close_log;
+ trigger_create(&on_close_log, relay_on_close_log_f, relay, NULL);
if (!relay->replica->anon)
trigger_add(&r->on_close_log, &on_close_log);
@@ -662,8 +661,15 @@ relay_subscribe_f(va_list ap)
diag_log();
say_crit("exiting the relay loop");
- /* Clear garbage collector trigger and WAL watcher. */
- if (!relay->replica->anon)
+ /*
+ * Clear garbage collector trigger and WAL watcher.
+ * Note, even though we set the trigger only when the
+ * replica is anonymous, we cannot check replica->anon
+ * here to determine whether the trigger is set. The
+ * replica may have registered and become non-anonymous
+ * while this relay thread was still exiting.
+ */
+ if (trigger_is_set(&on_close_log))
trigger_clear(&on_close_log);
wal_clear_watcher(&relay->wal_watcher, cbus_process);
--
2.21.1 (Apple Git-122.3)
More information about the Tarantool-patches
mailing list