Tarantool development patches archive
 help / color / mirror / Atom feed
From: Nikita Pettik <korablev@tarantool.org>
To: Leonid Vasiliev <lvasiliev@tarantool.org>
Cc: tarantool-patches@dev.tarantool.org, v.shpilevoy@tarantool.org
Subject: Re: [Tarantool-patches] [PATCH] sql: increase default cache size
Date: Mon, 28 Dec 2020 12:13:06 +0000	[thread overview]
Message-ID: <20201228121306.GA9543@tarantool.org> (raw)
In-Reply-To: <e66bd011-5c27-2895-acb5-e6f88593d1ab@tarantool.org>

On 28 Dec 11:54, Leonid Vasiliev wrote:
> Hi!
> 
> On 24.12.2020 19:57, Nikita Pettik wrote:
> > On 11 Dec 18:29, Leonid Vasiliev wrote:
> > 
> > IMHO before pushing performance aimed patches, we should carefully
> > review them. Could you run some SQL benchmarks to verify there's
> > no degradation on any case?
> 
> Can you tell me where we have benchmarks for SQL and how to run them?

We have tpc-h/tpc-c bench at least as I know:
https://github.com/tarantool/tpcc
https://github.com/tarantool/tpch

Please contact QA team to get instructions how to lauch them.
 
> > Also, does this change affect non-SQL
> > users? I mean does SQL tmp cache is lazy initialized or it allocates
> > all 20mb right on start? For sure, 18mb is not a lot memory in 2020,
> > but still it may turn out to be surprise for some users..
> 
> The memory allocation is done on per-request basis:
> "The default page cache implemention does not allocate the full amount
> of cache memory all at once. Cache memory is allocated in smaller chunks
> on an as-needed basis."

Ok, fine, at least it won't affect users which don't use SQL sub-system.
So then we should only make sure that cache increase only improves
SQL performance on standard benches.

> > > On 11.12.2020 15:09, Nikita Pettik wrote:
> > > > On 11 Dec 01:37, Leonid Vasiliev wrote:
> > > > > Increase the maximum number of in-memory pages to use
> > > > > for temporary tables.
> > > > > (https://www.sqlite.org/compile.html#default_cache_size)
> > > > > 
> > > > > Part of #5609
> > > > > ---
> > > > > 
> > > > > Hi SQL team. It is a simplest part of #5609.
> > > > > This patch increases the default SQL cache size by 10 times to 20 MB.
> > > > > A similar experiment shows a 10% performance increase for some datasets.
> > > > > (https://github.com/tarantool/tarantool/issues/5593#issuecomment-740044007).
> > > > > Mons approved these changes. I haven't done any research to determine the
> > > > > optimal value. So feel free to throw this patch away.
> > > > 
> > > > Hi, why not inroduce separate handler for user to change this setting?
> > > > Since SQLite is embedded database, almost all settings are set at
> > > > compile time. In constrast, we can move it to the box.
> > > > 
> > > 
> > > If I understood you correctly, the settings should be implemented when
> > > working on # 5609 (this path doesn't close the task). This is the
> > > simplest improvement to update the default.
> > > 
> > > > > https://github.com/tarantool/tarantool/issues/5609
> > > > > https://github.com/tarantool/tarantool/tree/lvasiliev/gh-5609-increase-default-sql-sort-cache-size
> > > > > 
> > > > >    src/box/CMakeLists.txt | 4 ++++
> > > > >    1 file changed, 4 insertions(+)
> > > > > 
> > > > > diff --git a/src/box/CMakeLists.txt b/src/box/CMakeLists.txt
> > > > > index 19203f7..7372179 100644
> > > > > --- a/src/box/CMakeLists.txt
> > > > > +++ b/src/box/CMakeLists.txt
> > > > > @@ -218,6 +218,10 @@ if(CMAKE_BUILD_TYPE STREQUAL "Debug")
> > > > >      add_definitions(-DSQL_DEBUG=1)
> > > > >    endif()
> > > > >    add_definitions(-DSQL_TEST=1)
> > > > > +# Set the maximum number of in-memory pages to use for temporary tables.
> > > > > +# 20000 * 1024 = 20480000 bytes.
> > > > > +# (https://www.sqlite.org/compile.html#default_cache_size)
> > > > > +add_definitions(-DSQL_DEFAULT_CACHE_SIZE=-20000)
> > > > >    set(EXT_SRC_DIR ${CMAKE_SOURCE_DIR}/extra)
> > > > >    set(EXT_BIN_DIR ${CMAKE_BINARY_DIR}/extra)
> > > > > -- 
> > > > > 2.7.4
> > > > > 

  reply	other threads:[~2020-12-28 12:13 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-12-10 22:37 Leonid Vasiliev
2020-12-11 12:09 ` Nikita Pettik
2020-12-11 15:29   ` Leonid Vasiliev
2020-12-24 16:57     ` Nikita Pettik
2020-12-28  8:54       ` Leonid Vasiliev
2020-12-28 12:13         ` Nikita Pettik [this message]
2020-12-30 13:02           ` Leonid Vasiliev
2020-12-11 14:12 ` Sergey Ostanevich
2020-12-11 15:24   ` Leonid Vasiliev

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20201228121306.GA9543@tarantool.org \
    --to=korablev@tarantool.org \
    --cc=lvasiliev@tarantool.org \
    --cc=tarantool-patches@dev.tarantool.org \
    --cc=v.shpilevoy@tarantool.org \
    --subject='Re: [Tarantool-patches] [PATCH] sql: increase default cache size' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox