[Tarantool-patches] [PATCH luajit] Ensure correct stack top for OOM error message.
Igor Munkin
imun at tarantool.org
Fri Nov 11 11:53:27 MSK 2022
Sergey,
Thanks for the patch!
On 09.11.22, Sergey Kaplun wrote:
> From: Mike Pall <mike>
>
> Reported by Sergey Kaplun.
>
> (cherry picked from commit ca8d3257bb44e42100c7910c47dcdcf01f494187)
>
> `lj_err_mem()` doesn't set up `L->top` for Lua frames, but uses it for
I believe it "doesn't set `L->top` for the current Lua frame, but...",
doesn't it?
> pushing error message on the stack. So, when we call some routine that
Typo: s/some routine/a routine/ or s/some routine/arbitrary routine/.
> does some allocations, it can raise the OOM error (like `lj_tab_dup()`
Strictly saying it's LUA_ERRMEM, not OOM.
> in `BC_TDUP`) and this error may corrupt stack for unwind in situations
> when `L->top` < `L->base`.
>
> This patch restores `L->top` for Lua frames when raise the error via
Typo: s/raise the error/the error is raised/.
> `lj_err_mem()`.
>
> Sergey Kaplun:
> * added the description and the test for the problem
>
> Resolves tarantool/tarantool#3840
> Part of tarantool/tarantool#7230
> ---
>
> Issues:
> * https://github.com/LuaJIT/LuaJIT/issues/906
> * https://github.com/tarantool/tarantool/issues/7230
> * https://github.com/tarantool/tarantool/issues/3840
> PR: https://github.com/tarantool/tarantool/pull/7915
> Branch: https://github.com/tarantool/luajit/tree/skaplun/lj-906-fix-err-mem
>
> Red LuaJIT CI for MacOS Release builds is a known issue with self-hosted
> runners, as Igor has said before.
Everything is fixed at the moment, please try to rebase.
>
> src/lj_err.c | 1 +
> .../lj-906-fix-err-mem.test.lua | 90 +++++++++++++++++++
> 2 files changed, 91 insertions(+)
> create mode 100644 test/tarantool-tests/lj-906-fix-err-mem.test.lua
>
<snipped>
> diff --git a/test/tarantool-tests/lj-906-fix-err-mem.test.lua b/test/tarantool-tests/lj-906-fix-err-mem.test.lua
> new file mode 100644
> index 00000000..a139e1c9
> --- /dev/null
> +++ b/test/tarantool-tests/lj-906-fix-err-mem.test.lua
> @@ -0,0 +1,90 @@
> +local tap = require('tap')
> +local ffi = require('ffi')
> +local table_new = require('table.new')
> +
> +-- Avoid test to be killed.
> +require('utils').skipcond(ffi.abi('gc64'), 'test is not GC64 only')
> +
> +local test = tap.test('lj-906-fix-err-mem')
> +test:plan(1)
> +
> +local KB = 1024
> +local MB = 1024 * KB
> +
> +-- The maximum available table size, taking into account created
> +-- constants for one function.
> +local TNEW_SIZE = 511
> +
> +local gc_anchor = {}
> +
> +-- This function works until raises the error.
> +local function eat_chunks(size)
> + -- Need raise the OOM error inside TDUP, not TNEW, so reserve
> + -- memory for it.
> + -- luacheck: no unused
> + local tnew_anchor = table_new(TNEW_SIZE, 0)
> + while true do
> + table.insert(gc_anchor, ffi.new('char [?]', size))
> + end
> +end
> +
> +-- Function to format inner tab leading to TDUP emitting.
> +local function format_inner_tab()
Minor: Maybe <make_deep_table(depth)> instead of <format_inner_tab()>?
Just asking, feel free to ignore.
> + local inner_tab = ''
> + local inner_depth = 128
Why 128? Please, drop a few words.
> + -- Repeate table template for TDUP.
> + for _ = 1, inner_depth do
> + inner_tab = inner_tab .. '{a ='
> + end
> + inner_tab = inner_tab .. '{}'
> + for _ = 1, inner_depth do
> + inner_tab = inner_tab .. '},'
> + end
> + return inner_tab
> +end
> +
Minor: Same (also think about moving these helpers to utils.*).
Maybe <make_flat_table(size)> instead of <format_TDUP_chunk()>? And
mention TDUP specifics in the comment.
Just asking too, feel free to ignore.
> +local function format_TDUP_chunk()
> + local big_tab = 'local _ = {\n'
> + local inner_tab = format_inner_tab()
> + for _ = 1, TNEW_SIZE do
> + big_tab = big_tab .. inner_tab .. '\n'
> + end
> + big_tab = big_tab .. '}'
> + return big_tab
> +end
> +
> +local TDUP, err = loadstring(format_TDUP_chunk())
> +assert(TDUP, err)
> +
> +local function frame_before_TDUP()
> + -- Stack slots are needed for coredump in case of misbehaviour.
Why are they needed? Please drop few more words regarding this.
> + -- luacheck: no unused
> + local frame_slot1, frame_slot2
> + TDUP()
> + return frame_slot1, frame_slot2
> +end
> +
Minor: I believe you can pack these two lines into something one can
call <janitor>:
| local function janitor()
| collectgarbage('collect')
| collectgarbage('stop')
| end
Anyway, you can leave this as is.
> +collectgarbage()
> +collectgarbage('stop')
> +
> +-- Avoid OOM on traces.
> +jit.off()
> +
> +-- Stack slots are needed for coredump in case of misbehaviour.
> +-- luacheck: no unused
> +local r, e = pcall(eat_chunks, 8 * MB)
> +collectgarbage()
> +pcall(eat_chunks, 8 * KB)
> +collectgarbage()
> +pcall(eat_chunks, 8)
> +collectgarbage()
> +
> +pcall(frame_before_TDUP)
Do we need to check the status of the <pcall> above?
> +
> +-- Release memory for `tap` functions.
> +gc_anchor = nil
> +collectgarbage()
Is this step required? I doubt.
> +
> +test:ok(true, 'correctly throw memory error')
> +
> +os.exit(test:check() and 0 or 1)
> --
> 2.34.1
>
--
Best regards,
IM
More information about the Tarantool-patches
mailing list