From: Sergey Kaplun via Tarantool-patches <tarantool-patches@dev.tarantool.org> To: Sergey Bronnikov <estetus@gmail.com> Cc: tarantool-patches@dev.tarantool.org Subject: Re: [Tarantool-patches] [PATCH luajit] Handle all stack layouts in (delayed) TRACE vmevent. Date: Wed, 5 Feb 2025 12:48:17 +0300 [thread overview] Message-ID: <Z6Mz4WyMAk0i90gk@root> (raw) In-Reply-To: <ab0a1de0bf8685c27ae12ffb9352ba754c846cd4.1738143971.git.sergeyb@tarantool.org> Hi, Segey! Thanks for the patch! It is really nice that we covered all branches, nice work! LGTM, after fixing a bunch of nits below. Side note: it is better to send v2 in the separate letter to glance it if necessary. On 29.01.25, Sergey Bronnikov wrote: > From: Mike Pall <mike> > > Thanks to Sergey Bronnikov and Peter Cawley. > > (cherry picked from commit b138ccfa918518a152bc830fef3d53cd0a922e36) > > When recording a trace using a Lua code like > `repeat until `a >= 'b' > 'b'` a Lua error is encountered > (`attempt to compare string with nil`), which (along with > raising the error) causes an asynchronous trace abort. > The trace abort remains pending until the call of > `lua_pcall(L, 2, 0, 0)` that actually calls `jit.attach(trace_cb, nil)`. I would rather say: | ... until the next reentrance to the LuaJIT VM via `lj_vm_call()` or | `lj_vm_pcall(). Also, please add this comment to the C test header to make it clear why do we use additional `lua_pcall()`-s with dummy functions. > > On handling abort LuaJIT is searching for the topmost Lua frame > on the stack, that was active when the trace abort happened, > it is needed to generate a better error message. > Unfortunately, because the abort was due to an error, and > the error was caught by a `lua_pcall` with unspecified Typo: s/unspecified/an unspecified/ > error function (4th argument), the Lua frame that caused the abort > was already removed as part of error processing, so the search > cannot find it. Furthermore, in this particular case, there are > no Lua frames on the stack, which isn't something the search code > had considered possible. > > Sergey Bronnikov: > * added the description and the test for the problem > > Part of tarantool/tarantool#10709 This is 11055 ticket now. > --- > Branch: https://github.com/tarantool/luajit/tree/ligurio/lj-1087-vm-handler-call > > Related issues: > - https://github.com/LuaJIT/LuaJIT/issues/1087 > - https://github.com/tarantool/tarantool/issues/10709 > > src/lj_trace.c | 26 +-- > test/tarantool-c-tests/CMakeLists.txt | 4 + > .../lj-1087-vm-handler-call.test.c | 170 ++++++++++++++++++ > 3 files changed, 190 insertions(+), 10 deletions(-) > create mode 100644 test/tarantool-c-tests/lj-1087-vm-handler-call.test.c > > diff --git a/src/lj_trace.c b/src/lj_trace.c > index 94cb27e5..6b97cc13 100644 > --- a/src/lj_trace.c > +++ b/src/lj_trace.c <snipped> > diff --git a/test/tarantool-c-tests/CMakeLists.txt b/test/tarantool-c-tests/CMakeLists.txt > index c4a402d0..0e7c4698 100644 > --- a/test/tarantool-c-tests/CMakeLists.txt > +++ b/test/tarantool-c-tests/CMakeLists.txt > @@ -36,6 +36,10 @@ add_test_suite_target(tarantool-c-tests > DEPENDS libluajit libtest tarantool-c-tests-build > ) > > +if(NOT LUAJIT_DISABLE_JIT) > + AppendFlags(TESTS_C_FLAGS "-DLJ_HASJIT") > +endif(NOT LUAJIT_DISABLE_JIT) > + It is better just to include <lj_arch.h> in the test instead. See the example of usage in the <misclib-getmetrics-capi.test.c>. Therefore this part is excess. > set(CTEST_SRC_SUFFIX ".test.c") > file(GLOB tests "${CMAKE_CURRENT_SOURCE_DIR}/*${CTEST_SRC_SUFFIX}") > foreach(test_source ${tests}) > diff --git a/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c b/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c > new file mode 100644 > index 00000000..b7db9857 > --- /dev/null > +++ b/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c > @@ -0,0 +1,170 @@ > +#include <assert.h> It is better to undef the `NDEBUG` macro for the non-Debug build to avoid hidden errors in the tests. See other C tests for examples. > +#include <stdbool.h> This include looks excess. > + > +#include "lua.h" > +#include "lauxlib.h" > + > +#include "test.h" > +#include "utils.h" > + > +/* > + * Test file to demonstrate a segmentation fault under > + * AddressSanitizer, when C function is used as a VM handler I suppose ASan isn't required to reproduce the issue. > + * in LuaJIT: > + * > + * Program received signal SIGSEGV, Segmentation fault. > + * 0x000055555557e77d in trace_abort (J=0x7ffff7f9b6b8) at lj_trace.c:615 > + * 615 lj_vmevent_send(L, TRACE, > + * (gdb) bt > + * > + * See details in https://github.com/LuaJIT/LuaJIT/issues/1087. > + */ > + > +#define UNUSED(x) ((void)(x)) Friendly reminder: You may rebase on the current master and not define this macro here. > + Minor: Please add the comment that `event == NULL` disables the corresponding handler. > +static void jit_attach(lua_State *L, void *cb, const char *event) Nit: Why don't you use `lua_CFunction` here to avoid the cast to it below? > +{ > + lua_getglobal(L, "jit"); > + lua_getfield(L, -1, "attach"); > + lua_pushcfunction(L, (lua_CFunction)cb); > + if (event != NULL) { Nit: Brackets are excess. Feel free to ignore. > + lua_pushstring(L, event); > + } else { > + lua_pushnil(L); > + } > + lua_pcall(L, 2, 0, 0); > +} > + > +static int trace_cb(lua_State *L) { > + UNUSED(L); > + return 0; > +} > + > +static int handle_luafunc_frame(void *test_state) > +{ <snipped> > +} > + > +static int nop(lua_State *L) > +{ Looks like here should be `UNUSED(L)` too. > + return 0; > +} This function is similar to the `trace_cb()` maybe we just use `nop()` as a trace callback always? > + > +static int cframe(lua_State *L) > +{ > + int rc = luaL_dostring(L, "repeat until nil > 1"); > + assert(rc == 1); > + lua_pop(L, 1); /* Remove errmsg. */ Nit: Please move the comment to the previous line. > + > + lua_pushcfunction(L, nop); > + lua_call(L, 0, 0); > + > + return 0; > +} > + > +static int handle_c_frame(void *test_state) > +{ > + /* Setup. */ > + lua_State *L = test_state; > + jit_attach(L, (void *)trace_cb, "trace"); > + > + lua_pushcfunction(L, cframe); > + lua_call(L, 0, 0); > + > + /* Teardown. */ > + lua_settop(L, 0); > + > + return TEST_EXIT_SUCCESS; > +} > + > +static int global_f(lua_State* L) > +{ > + lua_pushstring(L, "М"); > + lua_pushstring(L, "И"); > + lua_pushstring(L, "Р"); Please use only ASCII characters in the strings. > + lua_concat(L, 3); All this content looks excess, may we just use `nop()` as a `global_f()` instead? > + > + return 1; > +} > + > +static int handle_cont_frame(void *test_state) > +{ > + const char lua_chunk[] = > + "local t = setmetatable({}, { __index = global_f })" > + "for i = 1, 4 do" > + " _ = t[1]" > + "end"; > + > + /* Setup. */ > + lua_State *L = test_state; > + jit_attach(L, (void *)trace_cb, "trace"); > + int res = luaL_dostring(L, "jit.opt.start('minstitch=16')"); It is better to use the 32767 (REF_DROP - REF_BIAS) instead of 16 (with the corresponding comment). See the commit 0fdf06b456e6a8492308e53c96316a25eb058f30 ("test: relax JIT setup in misc.getmetrics test") for the details. > + assert(res == 0); > + lua_pushcfunction(L, global_f); > + lua_setglobal(L, "global_f"); > + > + res = luaL_loadstring(L, lua_chunk); > + assert(res == 0); > + lua_pcall(L, 0, LUA_MULTRET, 0); > + > + /* Teardown. */ > + lua_settop(L, 0); It is better to restore the default minstitch value here too. > + > + return TEST_EXIT_SUCCESS; > +} > + > +static int handle_bottom_frame(void *test_state) > +{ > + lua_State *L = test_state; > + > + /* Attach VM call handler. */ > + jit_attach(L, (void *)trace_cb, "trace"); > + > + /* Load a Lua code that generate a trace abort. */ > + int rc = luaL_dostring(L, "repeat until nil > 1"); > + assert(rc == 1); > + > + /* Triggers segmentation fault. */ > + jit_attach(L, (void *)trace_cb, NULL); > + > + /* Teardown. */ > + lua_settop(L, 0); > + > + return TEST_EXIT_SUCCESS; > +} > + > +int main(void) > +{ > +#if !defined(LJ_HASJIT) It is better to use early return here `if (!LJ_HASJIT)`. > + return skip_all("JIT is disabled"); > +#else /* LJ_HASJIT */ > + lua_State *L = utils_lua_init(); > + const struct test_unit tgroup[] = { > + test_unit_def(handle_luafunc_frame), > + test_unit_def(handle_bottom_frame), > + test_unit_def(handle_cont_frame), > + test_unit_def(handle_c_frame), > + }; > + luaL_openlibs(L); `luaL_openlibs()` is already called inside `utils_lua_init()`. > + int res = luaL_dostring(L, "jit.opt.start('hotloop=1')"); > + assert(res == 0); > + const int test_result = test_run_group(tgroup, L); > + utils_lua_close(L); > + return test_result; > +#endif /* LJ_HASJIT */ > +} > -- > 2.34.1 > -- Best regards, Sergey Kaplun
next prev parent reply other threads:[~2025-02-05 9:49 UTC|newest] Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top 2025-01-29 9:46 Sergey Bronnikov via Tarantool-patches 2025-02-05 9:48 ` Sergey Kaplun via Tarantool-patches [this message] 2025-02-05 14:32 ` Sergey Bronnikov via Tarantool-patches
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=Z6Mz4WyMAk0i90gk@root \ --to=tarantool-patches@dev.tarantool.org \ --cc=estetus@gmail.com \ --cc=skaplun@tarantool.org \ --subject='Re: [Tarantool-patches] [PATCH luajit] Handle all stack layouts in (delayed) TRACE vmevent.' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox