[tarantool-patches] Re: [PATCH v2 2/2] say: take getaddrinfo() errors into account
Roman Khabibov
roman.habibov at tarantool.org
Tue Sep 10 15:52:00 MSK 2019
> On Sep 6, 2019, at 16:44, Alexander Turenko <alexander.turenko at tarantool.org> wrote:
>
>>>> +--
>>>> +-- gh-4138: check getaddrinfo() error and panic after that.
>>>> +--
>>>> +code=[[
>>>> +local socket = require('socket')
>>>> +local log = require('log')
>>>> +local fio = require('fio')
>>>> +
>>>> +path = fio.pathjoin(fio.cwd(), 'log_unix_socket_test.sock')
>>>> +unix_socket = socket('AF_UNIX', 'SOCK_DGRAM', 0)
>>>> +unix_socket:bind('unix/', path)
>>>> +
>>>> +opt = string.format("syslog:server=non_exists_hostname:%s,identity=tarantool", path)
>>>> +box.cfg{log = opt, log_nonblock=true}
>>>
>>> log_nonblock is not needed here, so it is better to remove it.
>> Removed.
>
> Still no.
>
>>> box.cfg{log = 'syslog:server=non_exists_hostname:3301'} is enough, not need to
>>> form a file path, no need identity, no need requiring socket, log and fio.
>>>
>>> The test passes even before the patch, so what it is intended to test? I
>>> think we should write a test that verifies stderr output to find all log
>>> messages we expect to appear in the case:
>>>
>>> Linux:
>>>
>>> | SystemError getaddrinfo: Temporary failure in name resolution: Input/output error
>>> | SystemError syslog logger: Input/output error: Input/output error
>>> | failed to initialize logging subsystem
>>>
>>> gai_strerror() message corresponds to EAI_AGAIN.
>>>
>>> Mac OS:
>>>
>>> | SystemError getaddrinfo: nodename nor servname provided, or not known: Input/output error
>>> | SystemError syslog logger: Input/output error: Input/output error
>>> | failed to initialize logging subsystem
>>>
>>> gai_strerror() message corresponds to EAI_NONAME.
>>>
>>> I propose to call ffi.C.gai_strerror() right from a test to form two
>>> error messages and verify that the actual input match one of them.
>>>
>>> If it is hard to catch stderr, then let's proceed w/o this test. However
>>> I think it is doable.
>>>
>>> I also propose to test error messages in the similar way (using
>>> ffi.C.gai_strerror(GAI_AGAIN) and ffi.C.gai_strerror(GAI_NONAME)) in
>>> test cases in second patch of the patchset.
>> We discussed that with Vova and we decided, that it is too difficult.
>> It is better, to waste not time for this test. Test for PANIC is enough.
>
> I don't see any reason to add a test case that checks that tarantool
> exits in the case: it was so before the commit and nothing is changed
> except the error message. I propose to delete the test case.
Ok. Removed.
>>>> + /* gh-4138: Check getaddrinfo() error. */
>>>> + isnt(coio_getaddrinfo("non_exists_hostname", port, NULL, &i, 1), 0,
>>>> + "getaddrinfo error");
>>>
>>> I would say 'getaddrinfo retval' instead 'getaddrinfo error'.
>
> I don't insist, but remind about that here if it was missed by a
> mistake.
+ /* gh-4138: Check getaddrinfo() retval. */
+ rc = coio_getaddrinfo("non_exists_hostname", port, NULL, &i, 1);
+ isnt(rc, 0, "getaddrinfo retval");
+ isnt(strstr(diag_get()->last->errmsg, "getaddrinfo"), NULL,
+ "getaddrinfo error message");
+
>>>> + isnt(strstr(diag_get()->last->errmsg, "getaddrinfo"), NULL,
>>>> + "getaddrinfo error message");
>>>> +
>>>
>>> I propose to verify the entire error message using
>>> gai_strerror(GAI_AGAIN) and gai_strerror(GAI_NONAME)—just as proposed
>>> above for a log message.
>> I have not found the way, how to carry this macros/enum from the libc header to Lua.
>
> It is possible to add them into socket.c, but anyway it is not worth to
> do this just for a test. Let's check for certain error messages (not for
> just 'getaddrinfo: ' prefix).
commit 3e973c2203131ab663bdf1154be73ff6592568ea
Author: Roman Khabibov <roman.habibov at tarantool.org>
Date: Tue Jul 30 15:39:21 2019 +0300
coiox/say: fix getaddrinfo error handling on macOS
Before this patch, branch when getaddrinfo() returns error codes
couldn't be reached on macOS, because they are greater than 0 on
macOS (assumption "rc < 0" in commit ea1da04 is incorrect for
macOS).
Note: diag_log() in say.c was added, because otherwise it will be
hid by the following diagnostic and then say that it should
be handler in a better way after #1148. Also, two diag_set() in
syslog_connect_unix() was added to avoid asserts in this
diag_log().
Need for #4138
diff --git a/src/lib/core/coio_task.c b/src/lib/core/coio_task.c
index 908b336ed..83f669d05 100644
--- a/src/lib/core/coio_task.c
+++ b/src/lib/core/coio_task.c
@@ -413,7 +413,7 @@ coio_getaddrinfo(const char *host, const char *port,
return -1; /* timed out or cancelled */
/* Task finished */
- if (task->rc < 0) {
+ if (task->rc != 0) {
/* getaddrinfo() failed */
errno = EIO;
diag_set(SystemError, "getaddrinfo: %s",
diff --git a/src/lib/core/say.c b/src/lib/core/say.c
index 64a637c58..8ad88ad57 100644
--- a/src/lib/core/say.c
+++ b/src/lib/core/say.c
@@ -459,14 +459,17 @@ static inline int
syslog_connect_unix(const char *path)
{
int fd = socket(PF_UNIX, SOCK_DGRAM, 0);
- if (fd < 0)
+ if (fd < 0) {
+ diag_set(SystemError, "socket");
return -1;
+ }
struct sockaddr_un un;
memset(&un, 0, sizeof(un));
snprintf(un.sun_path, sizeof(un.sun_path), "%s", path);
un.sun_family = AF_UNIX;
if (connect(fd, (struct sockaddr *) &un, sizeof(un)) != 0) {
close(fd);
+ diag_set(SystemError, "connect");
return -1;
}
return fd;
@@ -506,7 +509,7 @@ syslog_connect_remote(const char *server_address)
hints.ai_protocol = IPPROTO_UDP;
ret = getaddrinfo(remote, portnum, &hints, &inf);
- if (ret < 0) {
+ if (ret != 0) {
errno = EIO;
diag_set(SystemError, "getaddrinfo: %s",
gai_strerror(ret));
@@ -593,6 +596,11 @@ log_syslog_init(struct log *log, const char *init_str)
say_free_syslog_opts(&opts);
log->fd = log_syslog_connect(log);
if (log->fd < 0) {
+ /*
+ * We need to log a diagnostics here until stacked
+ * diagnostics will be implemented (#1148).
+ */
+ diag_log();
/* syslog indent is freed in atexit(). */
diag_set(SystemError, "syslog logger: %s", strerror(errno));
return -1;
diff --git a/test/unit/coio.cc b/test/unit/coio.cc
index bb8bd7131..3758c8857 100644
--- a/test/unit/coio.cc
+++ b/test/unit/coio.cc
@@ -72,7 +72,7 @@ static void
test_getaddrinfo(void)
{
header();
- plan(1);
+ plan(3);
const char *host = "127.0.0.1";
const char *port = "3333";
struct addrinfo *i;
@@ -81,6 +81,12 @@ test_getaddrinfo(void)
is(rc, 0, "getaddrinfo");
freeaddrinfo(i);
+ /* gh-4138: Check getaddrinfo() retval. */
+ rc = coio_getaddrinfo("non_exists_hostname", port, NULL, &i, 1);
+ isnt(rc, 0, "getaddrinfo retval");
+ isnt(strstr(diag_get()->last->errmsg, "getaddrinfo"), NULL,
+ "getaddrinfo error message");
+
/*
* gh-4209: 0 timeout should not be a special value and
* detach a task. Before a fix it led to segfault
diff --git a/test/unit/coio.result b/test/unit/coio.result
index 5019fa48a..90b567140 100644
--- a/test/unit/coio.result
+++ b/test/unit/coio.result
@@ -7,6 +7,8 @@
# call done with res 0
*** test_call_f: done ***
*** test_getaddrinfo ***
-1..1
+1..3
ok 1 - getaddrinfo
+ok 2 - getaddrinfo retval
+ok 3 - getaddrinfo error message
*** test_getaddrinfo: done ***
More information about the Tarantool-patches
mailing list