Tarantool development patches archive
 help / color / mirror / Atom feed
* [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check.
@ 2024-09-10 14:05 Sergey Kaplun via Tarantool-patches
  2024-09-17  7:45 ` Sergey Bronnikov via Tarantool-patches
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: Sergey Kaplun via Tarantool-patches @ 2024-09-10 14:05 UTC (permalink / raw)
  To: Maxim Kokryashkin, Sergey Bronnikov; +Cc: tarantool-patches

From: Mike Pall <mike>

(cherry picked from commit 204cee2c917f55f288c0b166742e56c134fe578c)

It is possible that a snapshot topslot is less than the possible topslot
of the Lua stack. In that case, if the Lua stack overflows in
`lj_vmevent_prepare()`, the error is raised inside
`lj_vm_exit_handler()`, which has no corresponding DWARF eh_frame [1],
so it leads to the crash.

This patch fix-ups the topslot of the snapshot on trace exit to the
maximum possible one.

Sergey Kaplun:
* added the description and the test for the problem

[1]: https://refspecs.linuxfoundation.org/LSB_3.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html

Part of tarantool/tarantool#10199
---

Branch: https://github.com/tarantool/luajit/tree/skaplun/fix-stack-alloc-on-trace
Issue: https://github.com/tarantool/tarantool/issues/10199

 src/lj_trace.c                                |  6 ++-
 .../fix-stack-alloc-on-trace-exit.test.lua    | 53 +++++++++++++++++++
 2 files changed, 58 insertions(+), 1 deletion(-)
 create mode 100644 test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua

diff --git a/src/lj_trace.c b/src/lj_trace.c
index 20014ecb..94cb27e5 100644
--- a/src/lj_trace.c
+++ b/src/lj_trace.c
@@ -522,7 +522,11 @@ static void trace_stop(jit_State *J)
     lj_assertJ(J->parent != 0 && J->cur.root != 0, "not a side trace");
     lj_asm_patchexit(J, traceref(J, J->parent), J->exitno, J->cur.mcode);
     /* Avoid compiling a side trace twice (stack resizing uses parent exit). */
-    traceref(J, J->parent)->snap[J->exitno].count = SNAPCOUNT_DONE;
+    {
+      SnapShot *snap = &traceref(J, J->parent)->snap[J->exitno];
+      snap->count = SNAPCOUNT_DONE;
+      if (J->cur.topslot > snap->topslot) snap->topslot = J->cur.topslot;
+    }
     /* Add to side trace chain in root trace. */
     {
       GCtrace *root = traceref(J, J->cur.root);
diff --git a/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua b/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua
new file mode 100644
index 00000000..ca04e54e
--- /dev/null
+++ b/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua
@@ -0,0 +1,53 @@
+local tap = require('tap')
+
+-- Test file to demonstrate incorrect Lua stack restoration on
+-- exit from trace by the stack overflow.
+
+local test = tap.test('fix-stack-alloc-on-trace-exit'):skipcond({
+  ['Test requires JIT enabled'] = not jit.status(),
+})
+
+local jit_dump = require('jit.dump')
+
+test:plan(2)
+
+-- Before the patch, it is possible that a snapshot topslot is
+-- less than the possible topslot of the Lua stack. In that case,
+-- if the Lua stack overflows in `lj_vmevent_prepare()`, the error
+-- is raised inside `lj_vm_exit_handler()`, which has no
+-- corresponding DWARF eh_frame, so it leads to the crash.
+
+-- Need for the stack growing in `lj_vmevent_prepare`.
+jit_dump.start('x', '/dev/null')
+
+-- Create a coroutine with a fixed stack size.
+local coro = coroutine.create(function()
+  jit.opt.start('hotloop=1', 'hotexit=1', 'callunroll=1')
+
+  -- `math.modf` recording is NYI.
+  -- Local `math_modf` simplifies `jit.dump()` output.
+  local math_modf = math.modf
+
+  local function trace(n)
+    n = n + 1
+    -- luacheck: ignore
+    -- Start a side trace here.
+    if n % 2 == 0 then end
+    -- Stop the recording of the side trace and a main trace,
+    -- stitching.
+    math_modf(1, 1)
+    -- Grow stack, avoid tail calls.
+    local unused = trace(n)
+    return unused
+  end
+
+  local n = 0
+  trace(n)
+end)
+
+local result, errmsg = coroutine.resume(coro)
+
+test:ok(not result, 'correct status and no crash')
+test:like(errmsg, 'stack overflow', 'correct error message')
+
+test:done(true)
-- 
2.46.0


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check.
  2024-09-10 14:05 [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check Sergey Kaplun via Tarantool-patches
@ 2024-09-17  7:45 ` Sergey Bronnikov via Tarantool-patches
  2024-09-23  7:10 ` Maxim Kokryashkin via Tarantool-patches
  2024-10-18 15:15 ` Sergey Kaplun via Tarantool-patches
  2 siblings, 0 replies; 4+ messages in thread
From: Sergey Bronnikov via Tarantool-patches @ 2024-09-17  7:45 UTC (permalink / raw)
  To: Sergey Kaplun, Maxim Kokryashkin; +Cc: tarantool-patches

[-- Attachment #1: Type: text/plain, Size: 3974 bytes --]

Hi, Sergey,

thanks for the patch! LGTM

On 10.09.2024 17:05, Sergey Kaplun wrote:
> From: Mike Pall <mike>
>
> (cherry picked from commit 204cee2c917f55f288c0b166742e56c134fe578c)
>
> It is possible that a snapshot topslot is less than the possible topslot
> of the Lua stack. In that case, if the Lua stack overflows in
> `lj_vmevent_prepare()`, the error is raised inside
> `lj_vm_exit_handler()`, which has no corresponding DWARF eh_frame [1],
> so it leads to the crash.
>
> This patch fix-ups the topslot of the snapshot on trace exit to the
> maximum possible one.
>
> Sergey Kaplun:
> * added the description and the test for the problem
>
> [1]:https://refspecs.linuxfoundation.org/LSB_3.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html
>
> Part of tarantool/tarantool#10199
> ---
>
> Branch:https://github.com/tarantool/luajit/tree/skaplun/fix-stack-alloc-on-trace
> Issue:https://github.com/tarantool/tarantool/issues/10199
>
>   src/lj_trace.c                                |  6 ++-
>   .../fix-stack-alloc-on-trace-exit.test.lua    | 53 +++++++++++++++++++
>   2 files changed, 58 insertions(+), 1 deletion(-)
>   create mode 100644 test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua
>
> diff --git a/src/lj_trace.c b/src/lj_trace.c
> index 20014ecb..94cb27e5 100644
> --- a/src/lj_trace.c
> +++ b/src/lj_trace.c
> @@ -522,7 +522,11 @@ static void trace_stop(jit_State *J)
>       lj_assertJ(J->parent != 0 && J->cur.root != 0, "not a side trace");
>       lj_asm_patchexit(J, traceref(J, J->parent), J->exitno, J->cur.mcode);
>       /* Avoid compiling a side trace twice (stack resizing uses parent exit). */
> -    traceref(J, J->parent)->snap[J->exitno].count = SNAPCOUNT_DONE;
> +    {
> +      SnapShot *snap = &traceref(J, J->parent)->snap[J->exitno];
> +      snap->count = SNAPCOUNT_DONE;
> +      if (J->cur.topslot > snap->topslot) snap->topslot = J->cur.topslot;
> +    }
>       /* Add to side trace chain in root trace. */
>       {
>         GCtrace *root = traceref(J, J->cur.root);
> diff --git a/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua b/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua
> new file mode 100644
> index 00000000..ca04e54e
> --- /dev/null
> +++ b/test/tarantool-tests/fix-stack-alloc-on-trace-exit.test.lua
> @@ -0,0 +1,53 @@
> +local tap = require('tap')
> +
> +-- Test file to demonstrate incorrect Lua stack restoration on
> +-- exit from trace by the stack overflow.
> +
> +local test = tap.test('fix-stack-alloc-on-trace-exit'):skipcond({
> +  ['Test requires JIT enabled'] = not jit.status(),
> +})
> +
> +local jit_dump = require('jit.dump')
> +
> +test:plan(2)
> +
> +-- Before the patch, it is possible that a snapshot topslot is
> +-- less than the possible topslot of the Lua stack. In that case,
> +-- if the Lua stack overflows in `lj_vmevent_prepare()`, the error
> +-- is raised inside `lj_vm_exit_handler()`, which has no
> +-- corresponding DWARF eh_frame, so it leads to the crash.
> +
> +-- Need for the stack growing in `lj_vmevent_prepare`.
> +jit_dump.start('x', '/dev/null')
> +
> +-- Create a coroutine with a fixed stack size.
> +local coro = coroutine.create(function()
> +  jit.opt.start('hotloop=1', 'hotexit=1', 'callunroll=1')
> +
> +  -- `math.modf` recording is NYI.
> +  -- Local `math_modf` simplifies `jit.dump()` output.
> +  local math_modf = math.modf
> +
> +  local function trace(n)
> +    n = n + 1
> +    -- luacheck: ignore
> +    -- Start a side trace here.
> +    if n % 2 == 0 then end
> +    -- Stop the recording of the side trace and a main trace,
> +    -- stitching.
> +    math_modf(1, 1)
> +    -- Grow stack, avoid tail calls.
> +    local unused = trace(n)
> +    return unused
> +  end
> +
> +  local n = 0
> +  trace(n)
> +end)
> +
> +local result, errmsg = coroutine.resume(coro)
> +
> +test:ok(not result, 'correct status and no crash')
> +test:like(errmsg, 'stack overflow', 'correct error message')
> +
> +test:done(true)

[-- Attachment #2: Type: text/html, Size: 4662 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check.
  2024-09-10 14:05 [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check Sergey Kaplun via Tarantool-patches
  2024-09-17  7:45 ` Sergey Bronnikov via Tarantool-patches
@ 2024-09-23  7:10 ` Maxim Kokryashkin via Tarantool-patches
  2024-10-18 15:15 ` Sergey Kaplun via Tarantool-patches
  2 siblings, 0 replies; 4+ messages in thread
From: Maxim Kokryashkin via Tarantool-patches @ 2024-09-23  7:10 UTC (permalink / raw)
  To: Sergey Kaplun; +Cc: tarantool-patches

Hi, Sergey!
Thanks for the patch!
LGTM

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check.
  2024-09-10 14:05 [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check Sergey Kaplun via Tarantool-patches
  2024-09-17  7:45 ` Sergey Bronnikov via Tarantool-patches
  2024-09-23  7:10 ` Maxim Kokryashkin via Tarantool-patches
@ 2024-10-18 15:15 ` Sergey Kaplun via Tarantool-patches
  2 siblings, 0 replies; 4+ messages in thread
From: Sergey Kaplun via Tarantool-patches @ 2024-10-18 15:15 UTC (permalink / raw)
  To: Maxim Kokryashkin, Sergey Bronnikov; +Cc: tarantool-patches

I've applied the patch into all long-term branches in tarantool/luajit
and bumped a new version in master [1], release/3.2 [2] and
release/2.11 [3].

[1]: https://github.com/tarantool/tarantool/pull/10712
[2]: https://github.com/tarantool/tarantool/pull/10713
[3]: https://github.com/tarantool/tarantool/pull/10714

-- 
Best regards,
Sergey Kaplun

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-10-18 15:16 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-09-10 14:05 [Tarantool-patches] [PATCH luajit] Fix stack allocation after on-trace stack check Sergey Kaplun via Tarantool-patches
2024-09-17  7:45 ` Sergey Bronnikov via Tarantool-patches
2024-09-23  7:10 ` Maxim Kokryashkin via Tarantool-patches
2024-10-18 15:15 ` Sergey Kaplun via Tarantool-patches

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox