Tarantool development patches archive
 help / color / mirror / Atom feed
From: Sergey Kaplun via Tarantool-patches <tarantool-patches@dev.tarantool.org>
To: Sergey Bronnikov <estetus@gmail.com>
Cc: tarantool-patches@dev.tarantool.org
Subject: Re: [Tarantool-patches] [PATCH luajit] Handle all stack layouts in (delayed) TRACE vmevent.
Date: Wed, 5 Feb 2025 12:48:17 +0300	[thread overview]
Message-ID: <Z6Mz4WyMAk0i90gk@root> (raw)
In-Reply-To: <ab0a1de0bf8685c27ae12ffb9352ba754c846cd4.1738143971.git.sergeyb@tarantool.org>

Hi, Segey!
Thanks for the patch!
It is really nice that we covered all branches, nice work!

LGTM, after fixing a bunch of nits below.

Side note: it is better to send v2 in the separate letter to glance it
if necessary.

On 29.01.25, Sergey Bronnikov wrote:
> From: Mike Pall <mike>
> 
> Thanks to Sergey Bronnikov and Peter Cawley.
> 
> (cherry picked from commit b138ccfa918518a152bc830fef3d53cd0a922e36)
> 
> When recording a trace using a Lua code like
> `repeat until `a >= 'b' > 'b'` a Lua error is encountered
> (`attempt to compare string with nil`), which (along with
> raising the error) causes an asynchronous trace abort.
> The trace abort remains pending until the call of
> `lua_pcall(L, 2, 0, 0)` that actually calls `jit.attach(trace_cb, nil)`.

I would rather say:
| ... until the next reentrance to the LuaJIT VM via `lj_vm_call()` or
| `lj_vm_pcall().

Also, please add this comment to the C test header to make it clear why
do we use additional `lua_pcall()`-s with dummy functions.

> 
> On handling abort LuaJIT is searching for the topmost Lua frame
> on the stack, that was active when the trace abort happened,
> it is needed to generate a better error message.
> Unfortunately, because the abort was due to an error, and
> the error was caught by a `lua_pcall` with unspecified

Typo: s/unspecified/an unspecified/

> error function (4th argument), the Lua frame that caused the abort
> was already removed as part of error processing, so the search
> cannot find it. Furthermore, in this particular case, there are
> no Lua frames on the stack, which isn't something the search code
> had considered possible.
> 
> Sergey Bronnikov:
> * added the description and the test for the problem
> 
> Part of tarantool/tarantool#10709

This is 11055 ticket now.

> ---
> Branch: https://github.com/tarantool/luajit/tree/ligurio/lj-1087-vm-handler-call
> 
> Related issues:
> - https://github.com/LuaJIT/LuaJIT/issues/1087
> - https://github.com/tarantool/tarantool/issues/10709
> 
>  src/lj_trace.c                                |  26 +--
>  test/tarantool-c-tests/CMakeLists.txt         |   4 +
>  .../lj-1087-vm-handler-call.test.c            | 170 ++++++++++++++++++
>  3 files changed, 190 insertions(+), 10 deletions(-)
>  create mode 100644 test/tarantool-c-tests/lj-1087-vm-handler-call.test.c
> 
> diff --git a/src/lj_trace.c b/src/lj_trace.c
> index 94cb27e5..6b97cc13 100644
> --- a/src/lj_trace.c
> +++ b/src/lj_trace.c

<snipped>

> diff --git a/test/tarantool-c-tests/CMakeLists.txt b/test/tarantool-c-tests/CMakeLists.txt
> index c4a402d0..0e7c4698 100644
> --- a/test/tarantool-c-tests/CMakeLists.txt
> +++ b/test/tarantool-c-tests/CMakeLists.txt
> @@ -36,6 +36,10 @@ add_test_suite_target(tarantool-c-tests
>    DEPENDS libluajit libtest tarantool-c-tests-build
>  )
>  
> +if(NOT LUAJIT_DISABLE_JIT)
> +  AppendFlags(TESTS_C_FLAGS "-DLJ_HASJIT")
> +endif(NOT LUAJIT_DISABLE_JIT)
> +

It is better just to include <lj_arch.h> in the test instead. See the example
of usage in the <misclib-getmetrics-capi.test.c>. Therefore this part is
excess.

>  set(CTEST_SRC_SUFFIX ".test.c")
>  file(GLOB tests "${CMAKE_CURRENT_SOURCE_DIR}/*${CTEST_SRC_SUFFIX}")
>  foreach(test_source ${tests})
> diff --git a/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c b/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c
> new file mode 100644
> index 00000000..b7db9857
> --- /dev/null
> +++ b/test/tarantool-c-tests/lj-1087-vm-handler-call.test.c
> @@ -0,0 +1,170 @@
> +#include <assert.h>

It is better to undef the `NDEBUG` macro for the non-Debug build to
avoid hidden errors in the tests. See other C tests for examples.

> +#include <stdbool.h>

This include looks excess.

> +
> +#include "lua.h"
> +#include "lauxlib.h"
> +
> +#include "test.h"
> +#include "utils.h"
> +
> +/*
> + * Test file to demonstrate a segmentation fault under
> + * AddressSanitizer, when C function is used as a VM handler

I suppose ASan isn't required to reproduce the issue.

> + * in LuaJIT:
> + *
> + * Program received signal SIGSEGV, Segmentation fault.
> + * 0x000055555557e77d in trace_abort (J=0x7ffff7f9b6b8) at lj_trace.c:615
> + * 615         lj_vmevent_send(L, TRACE,
> + * (gdb) bt
> + *
> + * See details in https://github.com/LuaJIT/LuaJIT/issues/1087.
> + */
> +
> +#define UNUSED(x) ((void)(x))

Friendly reminder: You may rebase on the current master and not define
this macro here.

> +

Minor: Please add the comment that `event == NULL` disables the
corresponding handler.

> +static void jit_attach(lua_State *L, void *cb, const char *event)

Nit: Why don't you use `lua_CFunction` here to avoid the cast to it
below?

> +{
> +	lua_getglobal(L, "jit");
> +	lua_getfield(L, -1, "attach");
> +	lua_pushcfunction(L, (lua_CFunction)cb);
> +	if (event != NULL) {

Nit: Brackets are excess.
Feel free to ignore.

> +		lua_pushstring(L, event);
> +	} else {
> +		lua_pushnil(L);
> +	}
> +	lua_pcall(L, 2, 0, 0);
> +}
> +
> +static int trace_cb(lua_State *L) {
> +	UNUSED(L);
> +	return 0;
> +}
> +
> +static int handle_luafunc_frame(void *test_state)
> +{

<snipped>

> +}
> +
> +static int nop(lua_State *L)
> +{

Looks like here should be `UNUSED(L)` too.

> +	return 0;
> +}

This function is similar to the `trace_cb()` maybe we just use `nop()`
as a trace callback always?

> +
> +static int cframe(lua_State *L)
> +{
> +	int rc = luaL_dostring(L, "repeat until nil > 1");
> +	assert(rc == 1);
> +	lua_pop(L, 1);		/* Remove errmsg. */

Nit: Please move the comment to the previous line.

> +
> +	lua_pushcfunction(L, nop);
> +	lua_call(L, 0, 0);
> +
> +	return 0;
> +}
> +
> +static int handle_c_frame(void *test_state)
> +{
> +	/* Setup. */
> +	lua_State *L = test_state;
> +	jit_attach(L, (void *)trace_cb, "trace");
> +
> +	lua_pushcfunction(L, cframe);
> +	lua_call(L, 0, 0);
> +
> +	/* Teardown. */
> +	lua_settop(L, 0);
> +
> +	return TEST_EXIT_SUCCESS;
> +}
> +
> +static int global_f(lua_State* L)
> +{
> +	lua_pushstring(L, "М");
> +	lua_pushstring(L, "И");
> +	lua_pushstring(L, "Р");

Please use only ASCII characters in the strings.

> +	lua_concat(L, 3);

All this content looks excess, may we just use `nop()` as a `global_f()`
instead?

> +
> +	return 1;
> +}
> +
> +static int handle_cont_frame(void *test_state)
> +{
> +	const char lua_chunk[] =
> +		"local t = setmetatable({}, { __index = global_f })"
> +		"for i = 1, 4 do"
> +		"    _ = t[1]"
> +		"end";
> +
> +	/* Setup. */
> +	lua_State *L = test_state;
> +	jit_attach(L, (void *)trace_cb, "trace");
> +	int res = luaL_dostring(L, "jit.opt.start('minstitch=16')");

It is better to use the 32767 (REF_DROP - REF_BIAS) instead of 16 (with
the corresponding comment). See the commit
0fdf06b456e6a8492308e53c96316a25eb058f30 ("test: relax JIT setup in
misc.getmetrics test") for the details.

> +	assert(res == 0);
> +	lua_pushcfunction(L, global_f);
> +	lua_setglobal(L, "global_f");
> +
> +	res = luaL_loadstring(L, lua_chunk);
> +	assert(res == 0);
> +	lua_pcall(L, 0, LUA_MULTRET, 0);
> +
> +	/* Teardown. */
> +	lua_settop(L, 0);

It is better to restore the default minstitch value here too.

> +
> +	return TEST_EXIT_SUCCESS;
> +}
> +
> +static int handle_bottom_frame(void *test_state)
> +{
> +	lua_State *L = test_state;
> +
> +	/* Attach VM call handler. */
> +	jit_attach(L, (void *)trace_cb, "trace");
> +
> +	/* Load a Lua code that generate a trace abort. */
> +	int rc = luaL_dostring(L, "repeat until nil > 1");
> +	assert(rc == 1);
> +
> +	/* Triggers segmentation fault. */
> +	jit_attach(L, (void *)trace_cb, NULL);
> +
> +	/* Teardown. */
> +	lua_settop(L, 0);
> +
> +	return TEST_EXIT_SUCCESS;
> +}
> +
> +int main(void)
> +{
> +#if !defined(LJ_HASJIT)

It is better to use early return here `if (!LJ_HASJIT)`.

> +	return skip_all("JIT is disabled");
> +#else /* LJ_HASJIT */
> +	lua_State *L = utils_lua_init();
> +	const struct test_unit tgroup[] = {
> +		test_unit_def(handle_luafunc_frame),
> +		test_unit_def(handle_bottom_frame),
> +		test_unit_def(handle_cont_frame),
> +		test_unit_def(handle_c_frame),
> +	};
> +	luaL_openlibs(L);

`luaL_openlibs()` is already called inside `utils_lua_init()`.

> +	int res = luaL_dostring(L, "jit.opt.start('hotloop=1')");
> +	assert(res == 0);
> +	const int test_result = test_run_group(tgroup, L);
> +	utils_lua_close(L);
> +	return test_result;
> +#endif /* LJ_HASJIT */
> +}
> -- 
> 2.34.1
> 

-- 
Best regards,
Sergey Kaplun

  reply	other threads:[~2025-02-05  9:49 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-01-29  9:46 Sergey Bronnikov via Tarantool-patches
2025-02-05  9:48 ` Sergey Kaplun via Tarantool-patches [this message]
2025-02-05 14:32   ` Sergey Bronnikov via Tarantool-patches

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Z6Mz4WyMAk0i90gk@root \
    --to=tarantool-patches@dev.tarantool.org \
    --cc=estetus@gmail.com \
    --cc=skaplun@tarantool.org \
    --subject='Re: [Tarantool-patches] [PATCH luajit] Handle all stack layouts in (delayed) TRACE vmevent.' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox