From: Vladimir Davydov <vdavydov.dev@gmail.com> To: tarantool-patches@freelists.org Subject: [PATCH 06/10] vinyl: lock out compaction while checkpointing is in progress Date: Fri, 17 May 2019 17:52:40 +0300 [thread overview] Message-ID: <d606800d53f6476cb4c1fcaec6ea002afeadd83d.1558103547.git.vdavydov.dev@gmail.com> (raw) In-Reply-To: <cover.1558103547.git.vdavydov.dev@gmail.com> In-Reply-To: <cover.1558103547.git.vdavydov.dev@gmail.com> Upon compaction completion we delete run files that are not needed for recovery from the last checkpoint. If we start compaction while checkpointing is in progress, it might compact and delete a run file dumped during checkpointing, which would make it impossible to backup files corresponding to the last checkpoint. Commit b25e31685096 ("vinyl: fix compaction vs checkpoint race resulting in invalid gc") claimed to eliminate the races, but in fact it just minimized probability of it: no matter whether we use the last checkpoint signature or vylog signature when checking if a run file doesn't belong to the last checkpoint, there's still a race window, which opens wider in case the more indexes we have to dump. Let's fix it once and for all by locking out compaction until checkpointing is complete. --- src/box/vy_scheduler.c | 17 +++++++++++++++++ test/vinyl/errinj.result | 16 +++++++++++++--- test/vinyl/errinj.test.lua | 9 ++++++--- 3 files changed, 36 insertions(+), 6 deletions(-) diff --git a/src/box/vy_scheduler.c b/src/box/vy_scheduler.c index 0180331e..2d1abc2e 100644 --- a/src/box/vy_scheduler.c +++ b/src/box/vy_scheduler.c @@ -741,6 +741,13 @@ vy_scheduler_end_checkpoint(struct vy_scheduler *scheduler) */ vy_scheduler_trigger_dump(scheduler); } + + /* + * Checkpointing temporarily blocks compaction. + * Wake up the scheduler to check if there are + * pending compaction tasks. + */ + fiber_cond_signal(&scheduler->scheduler_cond); } /** @@ -1896,6 +1903,16 @@ vy_scheduler_peek_compaction(struct vy_scheduler *scheduler, struct vy_worker *worker = NULL; retry: *ptask = NULL; + /* + * Upon completion a compaction task removes compacted run + * files unless they are needed for recovery from the last + * checkpoint. If we start compaction while checkpointing + * is in progress we might compact a run that belongs to + * the new, to be created, checkpoint. To avoid that we + * lock out compaction while checkpointing is in progress. + */ + if (scheduler->checkpoint_in_progress) + goto no_task; struct vy_lsm *lsm = vy_compaction_heap_top(&scheduler->compaction_heap); if (lsm == NULL) goto no_task; /* nothing to do */ diff --git a/test/vinyl/errinj.result b/test/vinyl/errinj.result index e8795143..e05a616a 100644 --- a/test/vinyl/errinj.result +++ b/test/vinyl/errinj.result @@ -966,8 +966,9 @@ box.snapshot() - ok ... -- Create another run file. This will trigger compaction --- as run_count_per_level is set to 1. Due to the error --- injection compaction will finish before snapshot. +-- as run_count_per_level is set to 1. Delay checkpointing +-- completion and check that compaction doesn't remove +-- files that are still needed for backup. _ = s:replace{2} --- ... @@ -981,8 +982,13 @@ c = fiber.channel(1) _ = fiber.create(function() box.snapshot() c:put(true) end) --- ... -while s.index.pk:stat().disk.compaction.count == 0 do fiber.sleep(0.001) end +test_run:wait_cond(function() return s.index.pk:stat().disk.compaction.queue.bytes > 0 end) --- +- true +... +test_run:wait_cond(function() return box.stat.vinyl().scheduler.tasks_inprogress == 0 end) +--- +- true ... errinj.set('ERRINJ_SNAP_COMMIT_DELAY', false) --- @@ -992,6 +998,10 @@ c:get() --- - true ... +test_run:wait_cond(function() return s.index.pk:stat().disk.compaction.count > 0 end) +--- +- true +... -- Check that all files corresponding to the last checkpoint -- are present. files = box.backup.start() diff --git a/test/vinyl/errinj.test.lua b/test/vinyl/errinj.test.lua index 034ed34c..4317ccb8 100644 --- a/test/vinyl/errinj.test.lua +++ b/test/vinyl/errinj.test.lua @@ -346,15 +346,18 @@ _ = s:create_index('pk', {run_count_per_level = 1}) _ = s:replace{1} box.snapshot() -- Create another run file. This will trigger compaction --- as run_count_per_level is set to 1. Due to the error --- injection compaction will finish before snapshot. +-- as run_count_per_level is set to 1. Delay checkpointing +-- completion and check that compaction doesn't remove +-- files that are still needed for backup. _ = s:replace{2} errinj.set('ERRINJ_SNAP_COMMIT_DELAY', true) c = fiber.channel(1) _ = fiber.create(function() box.snapshot() c:put(true) end) -while s.index.pk:stat().disk.compaction.count == 0 do fiber.sleep(0.001) end +test_run:wait_cond(function() return s.index.pk:stat().disk.compaction.queue.bytes > 0 end) +test_run:wait_cond(function() return box.stat.vinyl().scheduler.tasks_inprogress == 0 end) errinj.set('ERRINJ_SNAP_COMMIT_DELAY', false) c:get() +test_run:wait_cond(function() return s.index.pk:stat().disk.compaction.count > 0 end) -- Check that all files corresponding to the last checkpoint -- are present. files = box.backup.start() -- 2.11.0
next prev parent reply other threads:[~2019-05-17 14:52 UTC|newest] Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top 2019-05-17 14:52 [PATCH 00/10] vinyl: don't yield in DDL on_commit triggers Vladimir Davydov 2019-05-17 14:52 ` [PATCH 01/10] box: zap atfork callback Vladimir Davydov 2019-05-18 18:37 ` [tarantool-patches] " Konstantin Osipov 2019-05-20 8:13 ` Vladimir Davydov 2019-06-01 8:16 ` Konstantin Osipov 2019-06-06 10:04 ` Vladimir Davydov 2019-05-17 14:52 ` [PATCH 02/10] vinyl: add a separate thread for vylog Vladimir Davydov 2019-05-18 18:39 ` [tarantool-patches] " Konstantin Osipov 2019-05-20 8:17 ` Vladimir Davydov 2019-06-01 8:26 ` Konstantin Osipov 2019-06-06 10:20 ` Vladimir Davydov 2019-05-17 14:52 ` [PATCH 03/10] vinyl: move vylog recovery to vylog thread Vladimir Davydov 2019-06-01 8:36 ` [tarantool-patches] " Konstantin Osipov 2019-06-06 10:23 ` Vladimir Davydov 2019-06-07 13:39 ` Konstantin Osipov 2019-06-10 15:24 ` Vladimir Davydov 2019-06-07 13:40 ` Konstantin Osipov 2019-05-17 14:52 ` [PATCH 04/10] vinyl: rework vylog transaction backlog implementation Vladimir Davydov 2019-06-01 8:38 ` [tarantool-patches] " Konstantin Osipov 2019-06-06 11:58 ` Vladimir Davydov 2019-05-17 14:52 ` [PATCH 05/10] vinyl: don't purge deleted runs from vylog on compaction Vladimir Davydov 2019-05-18 18:47 ` [tarantool-patches] " Konstantin Osipov 2019-05-20 8:27 ` Vladimir Davydov 2019-06-01 8:39 ` Konstantin Osipov 2019-06-06 12:40 ` Vladimir Davydov 2019-05-17 14:52 ` Vladimir Davydov [this message] 2019-05-17 14:52 ` [PATCH 07/10] vinyl: don't access last vylog signature outside vylog implementation Vladimir Davydov 2019-05-17 14:52 ` [PATCH 08/10] vinyl: zap ERRINJ_VY_LOG_FLUSH_DELAY Vladimir Davydov 2019-05-17 14:52 ` [PATCH 09/10] key_def: pass alloc callback to key_def_dump_parts Vladimir Davydov 2019-05-18 18:52 ` [tarantool-patches] " Konstantin Osipov 2019-05-20 8:34 ` Vladimir Davydov 2019-06-01 8:41 ` Konstantin Osipov 2019-06-10 15:28 ` Vladimir Davydov 2019-06-16 14:57 ` Konstantin Osipov 2019-05-17 14:52 ` [PATCH 10/10] vinyl: get rid of the latch protecting vylog buffer Vladimir Davydov 2019-06-01 8:44 ` [tarantool-patches] " Konstantin Osipov 2019-06-06 13:15 ` Vladimir Davydov 2019-05-18 18:35 ` [tarantool-patches] Re: [PATCH 00/10] vinyl: don't yield in DDL on_commit triggers Konstantin Osipov 2019-05-20 8:09 ` Vladimir Davydov 2019-06-01 8:09 ` Konstantin Osipov
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=d606800d53f6476cb4c1fcaec6ea002afeadd83d.1558103547.git.vdavydov.dev@gmail.com \ --to=vdavydov.dev@gmail.com \ --cc=tarantool-patches@freelists.org \ --subject='Re: [PATCH 06/10] vinyl: lock out compaction while checkpointing is in progress' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox