From: Serge Petrenko via Tarantool-patches <tarantool-patches@dev.tarantool.org>
To: Cyrill Gorcunov <gorcunov@gmail.com>,
tml <tarantool-patches@dev.tarantool.org>
Cc: Vladislav Shpilevoy <v.shpilevoy@tarantool.org>
Subject: Re: [Tarantool-patches] [PATCH v8 2/2] relay: provide information about downstream lag
Date: Tue, 15 Jun 2021 13:03:59 +0300 [thread overview]
Message-ID: <73be2a27-dacf-c750-7bb6-da777b9c2f5a@tarantool.org> (raw)
In-Reply-To: <20210607155519.109626-3-gorcunov@gmail.com>
07.06.2021 18:55, Cyrill Gorcunov пишет:
> We already have `box.replication.upstream.lag` entry for monitoring
> sake. Same time in synchronous replication timeouts are key properties
> for quorum gathering procedure. Thus we would like to know how long
> it took of a transaction to traverse `initiator WAL -> network ->
> remote applier -> ACK` path.
Hi! Thanks for the patch! Please, find a couple of comments below.
>
> Typical output is
>
> | tarantool> box.info.replication[2].downstream
> | ---
> | - status: follow
> | idle: 0.61753897101153
> | vclock: {1: 147}
> | lag: 0
> | ...
> | tarantool> box.space.sync:insert{69}
> | ---
> | - [69]
> | ...
> |
> | tarantool> box.info.replication[2].downstream
> | ---
> | - status: follow
> | idle: 0.75324084801832
> | vclock: {1: 151}
> | lag: 0.0011014938354492
> | ...
>
> Closes #5447
>
> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>
>
> @TarantoolBot document
> Title: Add `box.info.replication[n].downstream.lag` entry
>
> `replication[n].downstream.lag` is the time difference between
> last transactions been written to the WAL journal of the transaction
> initiator and the transaction written to WAL on the `n` replica.
>
> In other words this is a lag in seconds between the main node writes
> data to own WAL and the replica `n` get this data replicated to own
> WAL journal.
This is not true. You describe `upstream.lag` in this paragraph.
Downstream lag is the time difference between the WAL write on master
side and the receipt of an ack (confirmation of a WAL write on replica)
for this transaction. Also on master side.
>
> In case if a transaction failed to replicate the lag value won't be
> modified because only successfully applied transactions are accounted.
> Same time if the main node or a repllica get restarted the lag value
> will be zero until next success transaction.
> ---
> .../unreleased/gh-5447-downstream-lag.md | 6 ++
> src/box/lua/info.c | 3 +
> src/box/relay.cc | 50 ++++++++++
> src/box/relay.h | 6 ++
> .../replication/gh-5447-downstream-lag.result | 93 +++++++++++++++++++
> .../gh-5447-downstream-lag.test.lua | 41 ++++++++
> 6 files changed, 199 insertions(+)
> create mode 100644 changelogs/unreleased/gh-5447-downstream-lag.md
> create mode 100644 test/replication/gh-5447-downstream-lag.result
> create mode 100644 test/replication/gh-5447-downstream-lag.test.lua
>
> diff --git a/changelogs/unreleased/gh-5447-downstream-lag.md b/changelogs/unreleased/gh-5447-downstream-lag.md
> new file mode 100644
> index 000000000..726175c6c
> --- /dev/null
> +++ b/changelogs/unreleased/gh-5447-downstream-lag.md
> @@ -0,0 +1,6 @@
> +#feature/replication
> +
> + * Introduced `box.info.replication[n].downstream.lag` field to monitor
> + state of replication. This member represents time spent between
> + transaction been written to initiator's WAL file and reached WAL
> + file of a replica (gh-5447).
Same here.
--
Serge Petrenko
next prev parent reply other threads:[~2021-06-15 10:04 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-06-07 15:55 [Tarantool-patches] [PATCH v8 0/2] relay: provide downstream lag information Cyrill Gorcunov via Tarantool-patches
2021-06-07 15:55 ` [Tarantool-patches] [PATCH v8 1/2] applier: send transaction's first row WAL time in the applier_writer_f Cyrill Gorcunov via Tarantool-patches
2021-06-07 19:20 ` Vladislav Shpilevoy via Tarantool-patches
2021-06-15 9:36 ` Serge Petrenko via Tarantool-patches
2021-06-16 13:32 ` Cyrill Gorcunov via Tarantool-patches
2021-06-17 9:16 ` Serge Petrenko via Tarantool-patches
2021-06-07 15:55 ` [Tarantool-patches] [PATCH v8 2/2] relay: provide information about downstream lag Cyrill Gorcunov via Tarantool-patches
2021-06-07 19:21 ` Vladislav Shpilevoy via Tarantool-patches
2021-06-08 8:40 ` Cyrill Gorcunov via Tarantool-patches
2021-06-08 9:58 ` Cyrill Gorcunov via Tarantool-patches
2021-06-08 18:15 ` Vladislav Shpilevoy via Tarantool-patches
2021-06-15 10:03 ` Serge Petrenko via Tarantool-patches [this message]
2021-06-17 6:55 ` Cyrill Gorcunov via Tarantool-patches
2021-06-17 9:01 ` Serge Petrenko via Tarantool-patches
2021-06-17 9:58 ` Cyrill Gorcunov via Tarantool-patches
2021-06-07 19:20 ` [Tarantool-patches] [PATCH v8 0/2] relay: provide downstream lag information Vladislav Shpilevoy via Tarantool-patches
2021-06-07 20:00 ` Cyrill Gorcunov via Tarantool-patches
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=73be2a27-dacf-c750-7bb6-da777b9c2f5a@tarantool.org \
--to=tarantool-patches@dev.tarantool.org \
--cc=gorcunov@gmail.com \
--cc=sergepetrenko@tarantool.org \
--cc=v.shpilevoy@tarantool.org \
--subject='Re: [Tarantool-patches] [PATCH v8 2/2] relay: provide information about downstream lag' \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox