[Tarantool-patches] [PATCH v3 3/4] crash: use errstat code in fatal signals
Cyrill Gorcunov
gorcunov at gmail.com
Sat Dec 5 23:04:11 MSK 2020
On Sat, Dec 05, 2020 at 07:33:49PM +0100, Vladislav Shpilevoy wrote:
> Thanks for the patch!
>
> See 2 comments below.
>
> On 04.12.2020 16:30, Cyrill Gorcunov via Tarantool-patches wrote:
> > In errstat code we fetch the signal statistic and
> > generate a backtrace for report. We don't send this
> > data right now but can reuse this code to not decode
> > registers and generate backtrace twice.
> >
> > Part-of #5261
> >
> > Signed-off-by: Cyrill Gorcunov <gorcunov at gmail.com>
> > ---
> > src/main.cc | 83 ++++++++++++++++++++++++++++++-----------------------
> > 1 file changed, 47 insertions(+), 36 deletions(-)
> >
> > diff --git a/src/main.cc b/src/main.cc
> > index 2f48f474c..260b9a0ff 100644
> > --- a/src/main.cc
> > +++ b/src/main.cc> @@ -253,6 +253,10 @@ sig_fatal_cb(int signo, siginfo_t *siginfo, void *context)
> > }
> >
> > in_cb = 1;
> > + /*
> > + * Notify errstat engine about the crash.
> > + */
> > + errstat_collect_crash(signo, siginfo, context);
>
> 1. So I probably see now. You have collect and send separated,
> because there is also printing below. In that case I can propose
> to make it more explicit somehow. Because when I firstly looked
> at that, it took quite some time to realize, that 'cinfo' variable
> you obtain above it filled inside errstat_collect_crash. It just is
> not passed to there.
>
> For example, you can delete errstat_get(), and make
> collect_crash() return struct errstat_crash*, which you will use
> below for the printing.
>
> Another option - move entire signal handler to your module, and then
> access any internals of it. This option looks the best for me. Having
> platform-dependent code both in the new module and in main.cc doing
> basically the same but for stderr vs network looks strange.
Thanks! I'll try to implement. Looks reasonable to me.
> > fdprintf(fd, "Current time: %u\n", (unsigned) time(0));
> > @@ -290,8 +294,14 @@ sig_fatal_cb(int signo, siginfo_t *siginfo, void *context)
> > #ifdef ENABLE_BACKTRACE
> > fdprintf(fd, "Attempting backtrace... Note: since the server has "
> > "already crashed, \nthis may fail as well\n");
> > - print_backtrace();
> > + fdprintf(STDERR_FILENO, "%s", cinfo->backtrace_buf);
> > #endif
> > + /*
> > + * If sending crash report to the feedback server is
> > + * allowed we won't be generating local core dump but
> > + * rather try to send data and exit.
>
> 2. But why?
>
> If as a dump you mean the text printed above, it is still printed.
> If you mean the dump with the data, your module does not send it,
> and it is still dumped locally, below.
Actually this comment left from previous version which I didn't send.
There I was simply using execve, instead of vfork+execve. Thus once
execve passed no more local abort() call happened. Thanks for pointing,
need to drop this comment.
> Btw, having errstat_exec_send_crash() with void args makes existence
> of errstat_get() even stranger.
Indeed. I'll think. Thanks for all your comments!
More information about the Tarantool-patches
mailing list