From: Sergey Petrenko <sergepetrenko@tarantool.org>
To: Vladimir Davydov <vdavydov.dev@gmail.com>
Cc: Kirill Yukhin <kyukhin@tarantool.org>, tarantool-patches@freelists.org
Subject: Re: [tarantool-patches] [PATCH] replication: fix a failing assert in replica_on_applier_disconnect()
Date: Wed, 8 Aug 2018 13:10:30 +0300 [thread overview]
Message-ID: <51F21380-A795-4F40-B3DE-A9770B190BD6@tarantool.org> (raw)
In-Reply-To: <20180807165031.e2avj7waurok3zpk@esperanza>
> 7 авг. 2018 г., в 19:50, Vladimir Davydov <vdavydov.dev@gmail.com> написал(а):
>
> On Mon, Aug 06, 2018 at 05:14:05PM +0300, Sergey Petrenko wrote:
>> diff --git a/test/replication/misc.test.lua b/test/replication/misc.test.lua
>> index c05e52165..27c1a4821 100644
>> --- a/test/replication/misc.test.lua
>> +++ b/test/replication/misc.test.lua
>> @@ -81,3 +81,15 @@ test_run:cmd("switch default")
>> test_run:drop_cluster(SERVERS)
>>
>> box.schema.user.revoke('guest', 'replication')
>> +
>> +-- gh-3510 assertion failure in replica_on_applier_disconnect()
>> +test_run:cmd('create server er_load1 with script="replication/er_load1.lua"')
>> +test_run:cmd('create server er_load2 with script="replication/er_load2.lua"')
>> +test_run:cmd('start server er_load1 with wait=False, wait_load=False')
>> +test_run:cmd('start server er_load2 with wait=False, wait_load=False')
>> +require('fiber').sleep(0.5)
>> +test_run:cmd('stop server er_load1')
>> +require('fiber').sleep(1)
>
> <kostja-mode>
> These sleep()'s are way too long. They will make our tests run forever.
> Please change to 0.00001 or smaller or rewrite the test so that you do
> not need them.
> </kostja-mode>
Hi! Thank you for review.
Fixed. Here’s the new diff:
src/box/replication.cc | 4 ++++
test/replication/er_load.lua | 25 +++++++++++++++++++++++++
test/replication/er_load1.lua | 1 +
test/replication/er_load2.lua | 1 +
test/replication/misc.result | 32 ++++++++++++++++++++++++++++++++
test/replication/misc.test.lua | 12 ++++++++++++
6 files changed, 75 insertions(+)
create mode 100644 test/replication/er_load.lua
create mode 120000 test/replication/er_load1.lua
create mode 120000 test/replication/er_load2.lua
diff --git a/src/box/replication.cc b/src/box/replication.cc
index 26bbbe32a..0efbd7c0e 100644
--- a/src/box/replication.cc
+++ b/src/box/replication.cc
@@ -350,6 +350,10 @@ replica_on_applier_disconnect(struct replica *replica)
assert(replicaset.applier.connected > 0);
replicaset.applier.connected--;
break;
+ case APPLIER_LOADING:
+ assert(replicaset.applier.loading > 0);
+ replicaset.applier.loading--;
+ break;
case APPLIER_DISCONNECTED:
break;
default:
diff --git a/test/replication/er_load.lua b/test/replication/er_load.lua
new file mode 100644
index 000000000..0515b3cce
--- /dev/null
+++ b/test/replication/er_load.lua
@@ -0,0 +1,25 @@
+#!/usr/bin/env tarantool
+
+-- get instance id from filename (er_load1.lua => 1)
+local INSTANCE_ID = string.match(arg[0], '%d')
+
+local SOCKET_DIR = require('fio').cwd()
+local function instance_uri(instance_id)
+ return SOCKET_DIR..'/er_load'..instance_id..'.sock'
+end
+
+require('console').listen(os.getenv('ADMIN'))
+
+box.cfg{
+ listen = instance_uri(INSTANCE_ID);
+ replication = {
+ instance_uri(INSTANCE_ID),
+ instance_uri(INSTANCE_ID % 2 + 1)
+ },
+ replication_timeout = 0.01,
+ read_only = INSTANCE_ID == '2'
+}
+box.once('bootstrap', function()
+ box.schema.user.grant('guest', 'replication')
+ box.space._cluster:delete(2)
+end)
diff --git a/test/replication/er_load1.lua b/test/replication/er_load1.lua
new file mode 120000
index 000000000..18f7ffa5a
--- /dev/null
+++ b/test/replication/er_load1.lua
@@ -0,0 +1 @@
+er_load.lua
\ No newline at end of file
diff --git a/test/replication/er_load2.lua b/test/replication/er_load2.lua
new file mode 120000
index 000000000..18f7ffa5a
--- /dev/null
+++ b/test/replication/er_load2.lua
@@ -0,0 +1 @@
+er_load.lua
\ No newline at end of file
diff --git a/test/replication/misc.result b/test/replication/misc.result
index ff0dbf549..75316933f 100644
--- a/test/replication/misc.result
+++ b/test/replication/misc.result
@@ -208,3 +208,35 @@ test_run:drop_cluster(SERVERS)
box.schema.user.revoke('guest', 'replication')
---
...
+-- gh-3510 assertion failure in replica_on_applier_disconnect()
+test_run:cmd('create server er_load1 with script="replication/er_load1.lua"')
+---
+- true
+...
+test_run:cmd('create server er_load2 with script="replication/er_load2.lua"')
+---
+- true
+...
+test_run:cmd('start server er_load1 with wait=False, wait_load=False')
+---
+- true
+...
+-- instance er_load2 will fail with error ER_READONLY. this is ok.
+-- We only test here that er_load1 doesn't assert.
+test_run:cmd('start server er_load2 with wait=True, wait_load=True, crash_expected = True')
+---
+- false
+...
+test_run:cmd('stop server er_load1')
+---
+- true
+...
+-- er_load2 exits automatically.
+test_run:cmd('cleanup server er_load1')
+---
+- true
+...
+test_run:cmd('cleanup server er_load2')
+---
+- true
+...
diff --git a/test/replication/misc.test.lua b/test/replication/misc.test.lua
index c05e52165..acdfa737e 100644
--- a/test/replication/misc.test.lua
+++ b/test/replication/misc.test.lua
@@ -81,3 +81,15 @@ test_run:cmd("switch default")
test_run:drop_cluster(SERVERS)
box.schema.user.revoke('guest', 'replication')
+
+-- gh-3510 assertion failure in replica_on_applier_disconnect()
+test_run:cmd('create server er_load1 with script="replication/er_load1.lua"')
+test_run:cmd('create server er_load2 with script="replication/er_load2.lua"')
+test_run:cmd('start server er_load1 with wait=False, wait_load=False')
+-- instance er_load2 will fail with error ER_READONLY. this is ok.
+-- We only test here that er_load1 doesn't assert.
+test_run:cmd('start server er_load2 with wait=True, wait_load=True, crash_expected = True')
+test_run:cmd('stop server er_load1')
+-- er_load2 exits automatically.
+test_run:cmd('cleanup server er_load1')
+test_run:cmd('cleanup server er_load2')
--
2.15.2 (Apple Git-101.1)
next prev parent reply other threads:[~2018-08-08 10:10 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-08-03 5:59 Serge Petrenko
2018-08-03 15:43 ` Vladimir Davydov
2018-08-03 15:57 ` [tarantool-patches] " Kirill Yukhin
2018-08-06 14:14 ` Sergey Petrenko
2018-08-07 16:50 ` Vladimir Davydov
2018-08-08 10:10 ` Sergey Petrenko [this message]
2018-08-08 10:58 ` Vladimir Davydov
2018-08-08 15:19 ` Sergey Petrenko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51F21380-A795-4F40-B3DE-A9770B190BD6@tarantool.org \
--to=sergepetrenko@tarantool.org \
--cc=kyukhin@tarantool.org \
--cc=tarantool-patches@freelists.org \
--cc=vdavydov.dev@gmail.com \
--subject='Re: [tarantool-patches] [PATCH] replication: fix a failing assert in replica_on_applier_disconnect()' \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox