[Tarantool-patches] [PATCH luajit] Avoid out-of-range PC for stack overflow error from snapshot restore.
Sergey Bronnikov
sergeyb at tarantool.org
Mon Sep 8 18:10:45 MSK 2025
Hi, Sergey,
thanks for the patch! LGTM
Sergey
On 8/19/25 20:11, Sergey Kaplun wrote:
> From: Mike Pall <mike>
>
> Reported by Sergey Kaplun.
>
> (cherry picked from commit e3fa3c48d8a4aadcf86429e9f7f6f1171914b15a)
>
> In case when the saved PC in the snapshot is the first (0th index) PC in
> the prototype like JFUNC*, the subtraction to determine the previous PC
> in the `debug_framepc()` overflows and contains `NO_BCPOS` value. After
> that, the pos is greater than sizebc. Hence, the code below may
> interpret the bits in `pt->varinfo` like `bc_isret()` and assign an
> invalid value to `pos` to be returned. Further, it may lead to the
> assertion failure in the lj_debug_frameline().
>
> This patch fixes it by pretending that this means the first non-header
> bytecode in the prototype. Also, this patch removes the skipcond
> introduced in the commit a74e5be07d54b4e98b85493de73317db520b3f71
> ("test: conditionally disable flaky lj-1196"). The new test isn't added
> since the assertion failure depends on the specific memory address of
> the `varinfo`, so it is too hard to create a stable reproducer.
>
> Sergey Kaplun:
> * added the description for the problem
>
> Part of tarantool/tarantool#11691
> ---
>
> Branch:https://github.com/tarantool/luajit/tree/skaplun/lj-1369-stackov-invalid-bc
> Related issues:
> *https://github.com/tarantool/tarantool/issues/11691
> *https://github.com/LuaJIT/LuaJIT/issues/1369
> *https://github.com/LuaJIT/LuaJIT/issues/1359
> *https://github.com/LuaJIT/LuaJIT/issues/1196
>
> src/lj_debug.c | 1 +
> .../lj-1196-partial-snap-restore.test.lua | 10 +---------
> 2 files changed, 2 insertions(+), 9 deletions(-)
>
> diff --git a/src/lj_debug.c b/src/lj_debug.c
> index 76e48aca..bc057cf6 100644
> --- a/src/lj_debug.c
> +++ b/src/lj_debug.c
> @@ -101,6 +101,7 @@ static BCPos debug_framepc(lua_State *L, GCfunc *fn, cTValue *nextframe)
> pt = funcproto(fn);
> pos = proto_bcpos(pt, ins) - 1;
> #if LJ_HASJIT
> + if (pos == NO_BCPOS) return 1; /* Pretend it's the first bytecode. */
> if (pos > pt->sizebc) { /* Undo the effects of lj_trace_exit for JLOOP. */
> if (bc_isret(bc_op(ins[-1]))) {
> GCtrace *T = (GCtrace *)((char *)(ins-1) - offsetof(GCtrace, startins));
> diff --git a/test/tarantool-tests/lj-1196-partial-snap-restore.test.lua b/test/tarantool-tests/lj-1196-partial-snap-restore.test.lua
> index 5199ca00..a74f97bd 100644
> --- a/test/tarantool-tests/lj-1196-partial-snap-restore.test.lua
> +++ b/test/tarantool-tests/lj-1196-partial-snap-restore.test.lua
> @@ -4,15 +4,7 @@ local tap = require('tap')
> -- in case of the stack overflow.
> -- See also:https://github.com/LuaJIT/LuaJIT/issues/1196.
>
> -local test = tap.test('lj-1196-partial-snap-restore'):skipcond({
> - -- Disable test for Tarantool to avoid failures, see also:
> - --https://github.com/LuaJIT/LuaJIT/issues/1369.
> - ['Disabled for Tarantool due to lj-1369'] = _TARANTOOL,
> - -- Also, it may fail on some non-arm64 runners stable after
> - -- adding the skip condition above.
> - ['Disabled for x86/x64 due to lj-1369'] = jit.arch ~= 'arm64',
> -})
> -
> +local test = tap.test('lj-1196-partial-snap-restore')
> test:plan(1)
>
> -- XXX: The reproducer below uses several stack slot offsets to
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.tarantool.org/pipermail/tarantool-patches/attachments/20250908/0151606b/attachment.htm>
More information about the Tarantool-patches
mailing list