From: Vladimir Davydov <vdavydov.dev@gmail.com> To: tarantool-patches@freelists.org Subject: Re: [PATCH 3/9] vinyl: use uncompressed run size for range split/coalesce/compaction Date: Mon, 21 Jan 2019 12:42:13 +0300 [thread overview] Message-ID: <20190121094213.yzfawcs7mgkqxi3e@esperanza> (raw) In-Reply-To: <fff9b3eddaeeb73ae19c2ff277b1ec49052bc030.1548017258.git.vdavydov.dev@gmail.com> On Mon, Jan 21, 2019 at 12:17:02AM +0300, Vladimir Davydov wrote: > Historically, when considering splitting or coalescing a range or > updating compaction priority, we use sizes of compressed runs (see > bytes_compressed). This makes the algorithms dependent on whether > compression is used or not and how effective it is, which is weird, > because compression is a way of storing data on disk - it shouldn't > affect the way data is partitioned. E.g. if we turned off compression > at the first LSM tree level, which would make sense, because it's > relatively small, we would affect the compaction algorithm because > of this. > > That said, let's use uncompressed run sizes when considering range > tree transformations. This results in occasional failures of vinyl/deferred_delete.test.lua. I amended the patch on the branch to fix this. Here's the diff: diff --git a/test/vinyl/deferred_delete.result b/test/vinyl/deferred_delete.result index 29945f8d..61f81ce2 100644 --- a/test/vinyl/deferred_delete.result +++ b/test/vinyl/deferred_delete.result @@ -668,16 +668,13 @@ test_run:cmd("switch test") fiber = require('fiber') --- ... -digest = require('digest') ---- -... s = box.schema.space.create('test', {engine = 'vinyl'}) --- ... -pk = s:create_index('pk', {run_count_per_level = 10}) +pk = s:create_index('pk', {run_count_per_level = 10, run_size_ratio = 2}) --- ... -sk = s:create_index('sk', {run_count_per_level = 10, parts = {2, 'unsigned', 3, 'string'}, unique = false}) +sk = s:create_index('sk', {run_count_per_level = 10, run_size_ratio = 2, parts = {2, 'unsigned', 3, 'string'}, unique = false}) --- ... -- Write a run big enough to prevent major compaction from kicking in @@ -685,13 +682,25 @@ sk = s:create_index('sk', {run_count_per_level = 10, parts = {2, 'unsigned', 3, dummy_rows = 100 --- ... -for i = 1, dummy_rows do s:replace{i + 1000, i + 1000, digest.urandom(100)} end +pad = string.rep('z', 50 * 1024) +--- +... +for i = 1, dummy_rows do s:replace{i + 1000, i + 1000, pad} end --- ... box.snapshot() --- - ok ... +pk:compact() +--- +... +sk:compact() +--- +... +while box.stat.vinyl().scheduler.compaction_queue > 0 do fiber.sleep(0.001) end +--- +... pad = string.rep('x', 10 * 1024) --- ... diff --git a/test/vinyl/deferred_delete.test.lua b/test/vinyl/deferred_delete.test.lua index d38802da..93b5b358 100644 --- a/test/vinyl/deferred_delete.test.lua +++ b/test/vinyl/deferred_delete.test.lua @@ -252,17 +252,20 @@ test_run:cmd("start server test with args='1048576'") test_run:cmd("switch test") fiber = require('fiber') -digest = require('digest') s = box.schema.space.create('test', {engine = 'vinyl'}) -pk = s:create_index('pk', {run_count_per_level = 10}) -sk = s:create_index('sk', {run_count_per_level = 10, parts = {2, 'unsigned', 3, 'string'}, unique = false}) +pk = s:create_index('pk', {run_count_per_level = 10, run_size_ratio = 2}) +sk = s:create_index('sk', {run_count_per_level = 10, run_size_ratio = 2, parts = {2, 'unsigned', 3, 'string'}, unique = false}) -- Write a run big enough to prevent major compaction from kicking in -- (run_count_per_level is ignored on the last level - see gh-3657). dummy_rows = 100 -for i = 1, dummy_rows do s:replace{i + 1000, i + 1000, digest.urandom(100)} end +pad = string.rep('z', 50 * 1024) +for i = 1, dummy_rows do s:replace{i + 1000, i + 1000, pad} end box.snapshot() +pk:compact() +sk:compact() +while box.stat.vinyl().scheduler.compaction_queue > 0 do fiber.sleep(0.001) end pad = string.rep('x', 10 * 1024) for i = 1, 120 do s:replace{i, i, pad} end
next prev parent reply other threads:[~2019-01-21 9:42 UTC|newest] Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top 2019-01-20 21:16 [PATCH 0/9] vinyl: compaction randomization and throttling Vladimir Davydov 2019-01-20 21:17 ` [PATCH 1/9] vinyl: update lsm->range_heap in one go on dump completion Vladimir Davydov 2019-01-24 16:55 ` Vladimir Davydov 2019-02-05 16:37 ` [tarantool-patches] " Konstantin Osipov 2019-01-20 21:17 ` [PATCH 2/9] vinyl: ignore unknown .run, .index and .vylog keys Vladimir Davydov 2019-01-24 16:56 ` Vladimir Davydov 2019-01-20 21:17 ` [PATCH 3/9] vinyl: use uncompressed run size for range split/coalesce/compaction Vladimir Davydov 2019-01-21 9:42 ` Vladimir Davydov [this message] 2019-02-05 16:49 ` [tarantool-patches] " Konstantin Osipov 2019-02-06 8:55 ` Vladimir Davydov 2019-02-06 10:46 ` Konstantin Osipov 2019-02-06 10:55 ` Vladimir Davydov 2019-02-05 16:43 ` Konstantin Osipov 2019-02-06 16:48 ` Vladimir Davydov 2019-01-20 21:17 ` [PATCH 4/9] vinyl: rename lsm->range_heap to max_compaction_priority Vladimir Davydov 2019-01-20 21:17 ` [PATCH 5/9] vinyl: keep track of dumps per compaction for each LSM tree Vladimir Davydov 2019-02-05 16:58 ` [tarantool-patches] " Konstantin Osipov 2019-02-06 9:20 ` Vladimir Davydov 2019-02-06 16:54 ` Vladimir Davydov 2019-01-20 21:17 ` [PATCH 6/9] vinyl: set range size automatically Vladimir Davydov 2019-01-22 9:17 ` Vladimir Davydov 2019-02-05 17:09 ` [tarantool-patches] " Konstantin Osipov 2019-02-06 9:23 ` Vladimir Davydov 2019-02-06 17:04 ` Vladimir Davydov 2019-01-20 21:17 ` [PATCH 7/9] vinyl: randomize range compaction to avoid IO load spikes Vladimir Davydov 2019-01-22 12:54 ` Vladimir Davydov 2019-02-05 17:39 ` [tarantool-patches] " Konstantin Osipov 2019-02-06 8:53 ` Vladimir Davydov 2019-02-06 10:44 ` Konstantin Osipov 2019-02-06 10:52 ` Vladimir Davydov 2019-02-06 11:06 ` Konstantin Osipov 2019-02-06 11:49 ` Vladimir Davydov 2019-02-06 13:43 ` Konstantin Osipov 2019-02-06 14:00 ` Vladimir Davydov 2019-02-05 17:14 ` Konstantin Osipov 2019-01-20 21:17 ` [PATCH 8/9] vinyl: introduce quota consumer types Vladimir Davydov 2019-01-20 21:17 ` [PATCH 9/9] vinyl: throttle tx to ensure compaction keeps up with dumps Vladimir Davydov 2019-01-21 14:14 ` Vladimir Davydov 2019-01-22 9:09 ` Vladimir Davydov
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20190121094213.yzfawcs7mgkqxi3e@esperanza \ --to=vdavydov.dev@gmail.com \ --cc=tarantool-patches@freelists.org \ --subject='Re: [PATCH 3/9] vinyl: use uncompressed run size for range split/coalesce/compaction' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox