From: Sergey Kaplun via Tarantool-patches <tarantool-patches@dev.tarantool.org> To: sergos <sergos@tarantool.org> Cc: tarantool-patches@dev.tarantool.org Subject: Re: [Tarantool-patches] [PATCH luajit] ARM64: Avoid side-effects of constant rematerialization. Date: Wed, 28 Sep 2022 11:44:22 +0300 [thread overview] Message-ID: <YzQJZlOxme8qfuAD@root> (raw) In-Reply-To: <DEB757EB-6D1B-4B7D-ACAE-A91EF7A267B9@tarantool.org> Hi, Sergos! Thanks for the review! On 06.09.22, sergos wrote: > Hi! > > Thanks for the patch! > As I can’t say much about the patch from Mike, LGTM. > Just some nits in the comment. > > Sergos > > > > On 31 Aug 2022, at 12:52, Sergey Kaplun <skaplun@tarantool.org> wrote: > > > > From: Mike Pall <mike> > > > > Thanks to Patrick Galizia. > > > > (cherry picked from commit b33e3f2d441590f4de0d189bd9a65661824a48f6) > > > > Constant rematerialization must not use other registers that contain > > constants, if the register is in-flight. When we have the high > ^^^^^^ > in use? Fixed. > > regitster pressure we can face the following issue: > > > > The assembly of an IR instruction allocates a constant into a free > > register. Then it spills another register (due to high register > > pressure), which is rematerialized using the same constant (which it > > assumes is now in the allocated register). In case when the first > > register also happens to be the destination register, the constant value > > is modified before the rematerialization. > > > > For the code in the test for this commit we get the following register > > allocation order (read from top to bottom (DBG RA reversed)): > > | current IR | operation | IR ref | register > > | 0048 alloc 0038 x0 > > | 0048 remat K038 x0 > > | 0048 alloc K023 x4 > > > > Which leads to the following asembly: > > | ... > > | add x4, x4, x0 # x4 modified before x0 rematerialization > > | ldrb w4, [x4, #24] > > | add x0, x4, #24 # constant x0 rematerialization > > | ... > > As a result, the value register x0 holding is incorrect. > > > > This patch moves allocation of constants for earlier to be sure that the > ^^^ remove it Fixed, thanks! > > > rematerialization can not make use of the same constant as one of the > > sources of the IR instruction. > > > > After the patch register allocation order is the following: > > | current IR | operation | IR ref | register > > | 0048 alloc K023 x4 > > | 0048 alloc 0038 x0 > > | 0048 remat K038 x0 > > > > Also, this patch fixes the `asm_fusexref()` logic for the `IR_STRREF` in > > case, when both operands don't fit in 32-bit constants (`asm_isk32()` > > fails). We want to use the IR operand holds the referenced value in > holding Fixed, thanks! > > > `ra_alloc1()` as one having the hint set (`ra_hashint()` check passes). > > It is set for the operand with a non constant value (`irref_isk()` > > fails). The code assumes that this is always the `ir->op1` operand, so > it Fixed. > > > for cases when this value holds `ir->op2` operand register allocator > the case the Fixed, thanks! Branch is force-pushed. > > > misses the aforementioned hint in `ir->op2`. As the result the wrong > > register is selected. This patch adds the corresponding `irref_isk()` > > check for the `ir->op1` to detect which operand contains the value with > > the hint. > > > > After the patch the resulting assembly is the following: > > | ... > > | add x4, x0, x4 > > | ldrb w4, [x4, #24] > > | add x0, x1, #112 > > | ... > > > > As we can see the constant is rematerialized from another, non-modified > > register. > > > > Sergey Kaplun: > > * added the description and the test for the problem > > > > Part of tarantool/tarantool#7230 > > --- > > > > The test case leads to the coredump when compile with > > -DCMAKE_BUILD_TYPE=[Release, RelWithDebInfo]. > > > > Issue: https://github.com/tarantool/tarantool/issues/7230 > > PRs: > > * https://github.com/LuaJIT/LuaJIT/pull/438 > > * https://github.com/LuaJIT/LuaJIT/pull/479 > > Branch: https://github.com/tarantool/luajit/tree/skaplun/lj-438-arm64-constant-rematerialization-full-ci > > Tarantool PR: https://github.com/tarantool/tarantool/pull/7628 > > > > src/lj_asm_arm64.h | 46 +++++--- > > ...-arm64-constant-rematerialization.test.lua | 102 ++++++++++++++++++ > > 2 files changed, 131 insertions(+), 17 deletions(-) > > create mode 100644 test/tarantool-tests/lj-438-arm64-constant-rematerialization.test.lua > > > > diff --git a/src/lj_asm_arm64.h b/src/lj_asm_arm64.h > > index da0ee4bb..a4de187f 100644 > > --- a/src/lj_asm_arm64.h > > +++ b/src/lj_asm_arm64.h <snipped> > > diff --git a/test/tarantool-tests/lj-438-arm64-constant-rematerialization.test.lua b/test/tarantool-tests/lj-438-arm64-constant-rematerialization.test.lua > > new file mode 100644 > > index 00000000..ffc449bc > > --- /dev/null > > +++ b/test/tarantool-tests/lj-438-arm64-constant-rematerialization.test.lua > > @@ -0,0 +1,102 @@ <snipped> > -- Best regards, Sergey Kaplun
next prev parent reply other threads:[~2022-09-28 8:47 UTC|newest] Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top 2022-08-31 9:52 Sergey Kaplun via Tarantool-patches 2022-09-06 8:43 ` sergos via Tarantool-patches 2022-09-28 8:44 ` Sergey Kaplun via Tarantool-patches [this message] 2023-03-10 18:00 ` Igor Munkin via Tarantool-patches 2023-03-30 17:38 ` Igor Munkin via Tarantool-patches
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=YzQJZlOxme8qfuAD@root \ --to=tarantool-patches@dev.tarantool.org \ --cc=sergos@tarantool.org \ --cc=skaplun@tarantool.org \ --subject='Re: [Tarantool-patches] [PATCH luajit] ARM64: Avoid side-effects of constant rematerialization.' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox