[Tarantool-patches] [PATCH luajit] ARM64: Fix assembly of HREFK.

Sergey Bronnikov sergeyb at tarantool.org
Thu Jun 26 12:47:20 MSK 2025


LGTM, thanks!

On 6/25/25 17:41, Sergey Kaplun wrote:
> Hi, Sergey!
> Thanks for the review!
> Please consider my answers below.
>
> On 25.06.25, Sergey Bronnikov wrote:
>> Hi, Sergey,
>>
>> thanks for the patch! Please see my comments below.
>>
>> Sergey
>>
>> On 6/12/25 12:36, Sergey Kaplun wrote:
>>> From: Mike Pall <mike>
>>>
>>> Reported by caohongqing.
>>> Fix contributed by Peter Cawley.
>>>
>>> (cherry picked from commit 8fbd576fb9414a5fa70dfa6069733d3416a78269)
>>>
>>> `asm_hrefk()` uses the check for the offset for the corresponding node
>>> structure. However, the target load is performed from its inner `key`
>>> field with the offset 8. In the case of a huge table, it is possible
>>> that the offset of the node (4095 * 8) is less than 4096 * 8 and can be
>>> emitted via the corresponding instruction as an immediate offset, but
>>> the offset of the `key` field is not. This leads to the corresponding
>>> assertion failure in `emit_lso()`.
>> The issue [1] contains yet another fix in the same place [2]. We decided
>> to backport the patch
>>
>> separately. But please mention this in commit message.
>>
>>
>> 1.https://github.com/LuaJIT/LuaJIT/issues/1026
>>
>> 2.
>> https://github.com/LuaJIT/LuaJIT/commit/93ce12ee15abf28ef4cb24ae7e4b8a5b73d75c85
> These issues are completely independent, IMO. I would rather not mention
> it. Otherwise, by this logic, we should mention every problem related to
> the HREFK here.
ok
>>> This patch fixes this behaviour by the correct check.
>>>
>>> Sergey Kaplun:
>>> * added the description and the test for the problem
>>>
>>> Part of tarantool/tarantool#11278
>>> ---
>>>
>>> Related issues:
>>> *https://github.com/LuaJIT/LuaJIT/issues/1026
>>> *https://github.com/tarantool/tarantool/issues/11278
>>> Branch:https://github.com/tarantool/luajit/tree/skaplun/lj-1026-arm64-invalid-hrefk-offset-check
>>>
>>>    src/lj_asm_arm64.h                            |  2 +-
>>>    ...-arm64-invalid-hrefk-offset-check.test.lua | 48 +++++++++++++++++++
>>>    2 files changed, 49 insertions(+), 1 deletion(-)
>>>    create mode 100644 test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
>>>
>>> diff --git a/src/lj_asm_arm64.h b/src/lj_asm_arm64.h
>>> index 6c7b011f..a7f059a2 100644
>>> --- a/src/lj_asm_arm64.h
>>> +++ b/src/lj_asm_arm64.h
>>> @@ -885,7 +885,7 @@ static void asm_hrefk(ASMState *as, IRIns *ir)
> <snipped>
>
>>> diff --git a/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua b/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
>>> new file mode 100644
>>> index 00000000..de243814
>>> --- /dev/null
>>> +++ b/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
>>> @@ -0,0 +1,48 @@
>>> +local tap = require('tap')
>>> +
>>> +-- Test file to demonstrate LuaJIT misbehaviour when assembling
>>> +-- HREFK instruction on arm64 with the huge offset.
>>> +-- Seealso:https://github.com/LuaJIT/LuaJIT/issues/1026.
>>> +local test = tap.test('lj-1026-arm64-invalid-hrefk-offset-check'):skipcond({
>>> +  ['Test requires JIT enabled'] = not jit.status(),
>> It is an ARM-specific patch, should we add a condition for ARM here?
> It is a good question. It was once discussed, and we decided not to add
> the skip condition to make other architectures more covered by tests too
> (for example, we may check MIPS/PPC in the same test if we want to
> support them).
Ok.
>>> +})
>>> +
>>> +test:plan(1)
>>> +
>>> +-- The assertion fails since in HREFK we are checking the offset
>>> +-- from the hslots of the table of the Node structure itself
>> s/Node/`Node`/
> Fixed. See the iterative patch below.
>
> ===================================================================
> diff --git a/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua b/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
> index de243814..caa6291d 100644
> --- a/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
> +++ b/test/tarantool-tests/lj-1026-arm64-invalid-hrefk-offset-check.test.lua
> @@ -10,7 +10,7 @@ local test = tap.test('lj-1026-arm64-invalid-hrefk-offset-check'):skipcond({
>   test:plan(1)
>   
>   -- The assertion fails since in HREFK we are checking the offset
> --- from the hslots of the table of the Node structure itself
> +-- from the hslots of the table of the `Node` structure itself
>   -- instead of its inner field `key` (with additional 8 bytes).
>   -- So to test this, we generate a big table with constant keys
>   -- and compile a trace for each HREFK possible.
> ===================================================================
>
> Branch is force-pushed.
>
>>> +-- instead of its inner field `key` (with additional 8 bytes).
>>> +-- So to test this, we generate a big table with constant keys
>>> +-- and compile a trace for each HREFK possible.
>>> +
>>> +local big_tab = {}
>>> +-- The map of the characters to generate constant string keys.
>>> +-- The offset of the node should be 4096 * 8. It takes at least
>>> +-- 1365 keys to hit this value. The maximum possible slots in the
>>> +-- hash part is 2048, so to fill it with the maximum density (with
>>> +-- the way below), we need 45 * 45 = 2025 keys.
>>> +local chars = 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRS'
>>> +for c1inchars:gmatch('.') do
>>> +  for c2inchars:gmatch('.') do
>>> +    big_tab[c1 .. c2] = 1
>>> +  end
>>> +end
>>> +
>>> +jit.opt.start('hotloop=1')
>>> +
>>> +-- Generate bunch of traces.
>>> +for c1inchars:gmatch('.') do
>>> +  for c2inchars:gmatch('.') do
>>> +    loadstring([=[
>>> +      local t = ...
>>> +      for i = 1, 4 do
>>> +        -- HREFK generation.
>>> +        t[ ']=] .. c1 .. c2 .. [=[' ] = i
>>> +      end
>>> +    ]=])(big_tab)
>>> +  end
>>> +end
>>> +
>>> +test:ok(true, 'no assertion failed')
>> I would replace testcase description to something like "emitted assembly
>> is correct".
>>
>> Feel free to ignore.
> It triggers the assertion in the first place, so ignoring.
>
>>> +
>>> +test:done(true)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.tarantool.org/pipermail/tarantool-patches/attachments/20250626/6f513e4c/attachment.htm>


More information about the Tarantool-patches mailing list