Tarantool development patches archive
 help / color / mirror / Atom feed
From: Sergey Bronnikov via Tarantool-patches <tarantool-patches@dev.tarantool.org>
To: Sergey Kaplun <skaplun@tarantool.org>,
	Evgeniy Temirgaleev <e.temirgaleev@tarantool.org>
Cc: tarantool-patches@dev.tarantool.org
Subject: Re: [Tarantool-patches] [PATCH luajit 3/4] dbg: introduce lj-gco command
Date: Fri, 5 Jun 2026 18:02:28 +0300	[thread overview]
Message-ID: <298f77c0-d555-4364-a6d1-c96ab322daf4@tarantool.org> (raw)
In-Reply-To: <20260604093052.2221827-4-skaplun@tarantool.org>

[-- Attachment #1: Type: text/plain, Size: 12971 bytes --]

Hi, Sergey,

thanks for the patch! LGTM

On 6/4/26 12:30, Sergey Kaplun wrote:
> Our GDB extension already has dumpers for TValues. But sometimes it
> may be useful to dump GC objects (GCobj) without stack context. This
> patch adds additional wrappers around dumpers for GC objects to get the
> corresponding GC object from a TValue. Also, the lj-gco command is
> introduced. It allows dumping GC objects without stack context. The
> output format is the same as for the lj-tv command.
>
> Part of tarantool/tarantool#4808
> ---
>   src/luajit_dbg.py                             | 182 ++++++++++++------
>   .../debug-extension-tests.py                  |  83 ++++++--
>   2 files changed, 190 insertions(+), 75 deletions(-)
>
> diff --git a/src/luajit_dbg.py b/src/luajit_dbg.py
> index 300d65e9..f5868e61 100644
> --- a/src/luajit_dbg.py
> +++ b/src/luajit_dbg.py
> @@ -882,44 +882,29 @@ def lightudV(tv):
>   
>   # Dumpers.
>   
> +# GCobj dumpers.
>   
> -def dump_lj_tnil(tv):
> -    return 'nil'
> -
> -
> -def dump_lj_tfalse(tv):
> -    return 'false'
> -
> -
> -def dump_lj_ttrue(tv):
> -    return 'true'
> -
> -
> -def dump_lj_tlightud(tv):
> -    return 'light userdata @ {}'.format(strx64(lightudV(tv)))
> -
> -
> -def dump_lj_tstr(tv):
> +def dump_lj_gco_str(gcobj):
>       return 'string {body} @ {address}'.format(
> -        body=strdata(gcval(tv['gcr'])),
> -        address=strx64(gcval(tv['gcr']))
> +        body=strdata(gcobj),
> +        address=strx64(gcobj)
>       )
>   
>   
> -def dump_lj_tupval(tv):
> -    return 'upvalue @ {}'.format(strx64(gcval(tv['gcr'])))
> +def dump_lj_gco_upval(gcobj):
> +    return 'upvalue @ {}'.format(strx64(gcobj))
>   
>   
> -def dump_lj_tthread(tv):
> -    return 'thread @ {}'.format(strx64(gcval(tv['gcr'])))
> +def dump_lj_gco_thread(gcobj):
> +    return 'thread @ {}'.format(strx64(gcobj))
>   
>   
> -def dump_lj_tproto(tv):
> -    return 'proto @ {}'.format(strx64(gcval(tv['gcr'])))
> +def dump_lj_gco_proto(gcobj):
> +    return 'proto @ {}'.format(strx64(gcobj))
>   
>   
> -def dump_lj_tfunc(tv):
> -    func = dbg.cast('struct GCfuncC *', gcval(tv['gcr']))
> +def dump_lj_gco_func(gcobj):
> +    func = dbg.cast('struct GCfuncC *', gcobj)
>       ffid = func['ffid']
>   
>       if ffid == 0:
> @@ -936,20 +921,20 @@ def dump_lj_tfunc(tv):
>           return 'fast function #{}'.format(int(ffid))
>   
>   
> -def dump_lj_ttrace(tv):
> -    trace = dbg.cast('struct GCtrace *', gcval(tv['gcr']))
> +def dump_lj_gco_trace(gcobj):
> +    trace = dbg.cast('struct GCtrace *', gcobj)
>       return 'trace {traceno} @ {addr}'.format(
>           traceno=strx64(trace['traceno']),
>           addr=strx64(trace)
>       )
>   
>   
> -def dump_lj_tcdata(tv):
> -    return 'cdata @ {}'.format(strx64(gcval(tv['gcr'])))
> +def dump_lj_gco_cdata(gcobj):
> +    return 'cdata @ {}'.format(strx64(gcobj))
>   
>   
> -def dump_lj_ttab(tv):
> -    table = dbg.cast('GCtab *', gcval(tv['gcr']))
> +def dump_lj_gco_tab(gcobj):
> +    table = dbg.cast('GCtab *', gcobj)
>       return 'table @ {gcr} (asize: {asize}, hmask: {hmask})'.format(
>           gcr=strx64(table),
>           asize=table['asize'],
> @@ -957,41 +942,94 @@ def dump_lj_ttab(tv):
>       )
>   
>   
> -def dump_lj_tudata(tv):
> -    return 'userdata @ {}'.format(strx64(gcval(tv['gcr'])))
> +def dump_lj_gco_udata(gcobj):
> +    return 'userdata @ {}'.format(strx64(gcobj))
> +
> +
> +def dump_lj_gco_invalid(gcobj):
> +    return 'not valid type @ {}'.format(strx64(gcobj))
> +
> +
> +# TValue dumpers
> +
> +def dump_lj_tv_nil(tv):
> +    return 'nil'
> +
> +
> +def dump_lj_tv_false(tv):
> +    return 'false'
> +
> +
> +def dump_lj_tv_true(tv):
> +    return 'true'
> +
> +
> +def dump_lj_tv_lightud(tv):
> +    return 'light userdata @ {}'.format(strx64(lightudV(tv)))
> +
> +
> +# Generate wrappers for TValues containing GCobj.
> +gco_fn_dumpers = [
> +    fn for fn in globals().keys() if fn.startswith('dump_lj_gco')
> +]
> +for fn_name in gco_fn_dumpers:
> +    wrapped_fn_name = fn_name.replace('gco', 'tv')
> +    # Lambda takes `fn_name` as a reference, so the additional
> +    # lambda is needed to fixate the correct wrapper.
> +    globals()[wrapped_fn_name] = (lambda f: (
> +        lambda tv: globals()[f](gcval(tv['gcr']))
> +    ))(fn_name)
>   
>   
> -def dump_lj_tnumx(tv):
> +def dump_lj_tv_numx(tv):
>       if tvisint(tv):
>           return 'integer {}'.format(dbg.cast('int32_t', tv['i']))
>       else:
>           return 'number {}'.format(dbg.cast('double', tv['n']))
>   
>   
> -def dump_lj_invalid(tv):
> -    return 'not valid type @ {}'.format(strx64(gcval(tv['gcr'])))
> -
> -
> -dumpers = {
> -    'LJ_TNIL':     dump_lj_tnil,
> -    'LJ_TFALSE':   dump_lj_tfalse,
> -    'LJ_TTRUE':    dump_lj_ttrue,
> -    'LJ_TLIGHTUD': dump_lj_tlightud,
> -    'LJ_TSTR':     dump_lj_tstr,
> -    'LJ_TUPVAL':   dump_lj_tupval,
> -    'LJ_TTHREAD':  dump_lj_tthread,
> -    'LJ_TPROTO':   dump_lj_tproto,
> -    'LJ_TFUNC':    dump_lj_tfunc,
> -    'LJ_TTRACE':   dump_lj_ttrace,
> -    'LJ_TCDATA':   dump_lj_tcdata,
> -    'LJ_TTAB':     dump_lj_ttab,
> -    'LJ_TUDATA':   dump_lj_tudata,
> -    'LJ_TNUMX':    dump_lj_tnumx,
> +gco_dumpers = {
> +    'LJ_TSTR':     dump_lj_gco_str,
> +    'LJ_TUPVAL':   dump_lj_gco_upval,
> +    'LJ_TTHREAD':  dump_lj_gco_thread,
> +    'LJ_TPROTO':   dump_lj_gco_proto,
> +    'LJ_TFUNC':    dump_lj_gco_func,
> +    'LJ_TTRACE':   dump_lj_gco_trace,
> +    'LJ_TCDATA':   dump_lj_gco_cdata,
> +    'LJ_TTAB':     dump_lj_gco_tab,
> +    'LJ_TUDATA':   dump_lj_gco_udata,
>   }
>   
>   
> +tv_dumpers = {
> +    'LJ_TNIL':     dump_lj_tv_nil,
> +    'LJ_TFALSE':   dump_lj_tv_false,
> +    'LJ_TTRUE':    dump_lj_tv_true,
> +    'LJ_TLIGHTUD': dump_lj_tv_lightud,
> +    'LJ_TSTR':     dump_lj_tv_str,  # noqa: F821 # Generated.
> +    'LJ_TUPVAL':   dump_lj_tv_upval,  # noqa: F821 # Generated.
> +    'LJ_TTHREAD':  dump_lj_tv_thread,  # noqa: F821 # Generated.
> +    'LJ_TPROTO':   dump_lj_tv_proto,  # noqa: F821 # Generated.
> +    'LJ_TFUNC':    dump_lj_tv_func,  # noqa: F821 # Generated.
> +    'LJ_TTRACE':   dump_lj_tv_trace,  # noqa: F821 # Generated.
> +    'LJ_TCDATA':   dump_lj_tv_cdata,  # noqa: F821 # Generated.
> +    'LJ_TTAB':     dump_lj_tv_tab,  # noqa: F821 # Generated.
> +    'LJ_TUDATA':   dump_lj_tv_udata,  # noqa: F821 # Generated.
> +    'LJ_TNUMX':    dump_lj_tv_numx,
> +}
> +
> +
> +def dump_gcobj(gcobj):
> +    return gco_dumpers.get(
> +        typenames(i2notu32(gcobj['gch']['gct'])), dump_lj_gco_invalid
> +    )(gcobj)
> +
> +
>   def dump_tvalue(tvalue):
> -    return dumpers.get(typenames(itypemap(tvalue)), dump_lj_invalid)(tvalue)
> +    return tv_dumpers.get(
> +        typenames(itypemap(tvalue)),
> +        dump_lj_tv_invalid  # noqa: F821 # Generated.
> +    )(tvalue)
>   
>   
>   def dump_framelink_slot_address(fr):
> @@ -1011,7 +1049,7 @@ def dump_framelink(L, fr):
>               p='P' if frame_typep(fr) & FRAME_P else ''
>           ),
>           d=dbg.cast('TValue *', fr) - dbg.cast('TValue *', frame_prev(fr)),
> -        f=dump_lj_tfunc(fr - LJ_FR2),
> +        f=dump_lj_tv_func(fr - LJ_FR2),  # noqa: F821 # Generated.
>       )
>   
>   
> @@ -1141,6 +1179,35 @@ The command requires no args and dumps current GC stats:
>           ))
>   
>   
> +class LJDumpGCobj(dbg.LJBase):
> +    '''
> +lj-gco <GCobj *>
> +
> +The command receives a pointer to <GCobj> (GCobj address) and dumps
> +the type and some info related to it.
> +
> +* LJ_TSTR: string <string payload> @ <gcr>
> +* LJ_TUPVAL: upvalue @ <gcr>
> +* LJ_TTHREAD: thread @ <gcr>
> +* LJ_TPROTO: proto @ <gcr>
> +* LJ_TFUNC: <LFUNC|CFUNC|FFUNC>
> +  <LFUNC>: Lua function @ <gcr>, <nupvals> upvalues,<chunk:line>
> +  <CFUNC>: C function <mcode address>
> +  <FFUNC>: fast function #<ffid>
> +* LJ_TTRACE: trace <traceno> @ <gcr>
> +* LJ_TCDATA: cdata @ <gcr>
> +* LJ_TTAB: table @ <gcr> (asize: <asize>, hmask: <hmask>)
> +* LJ_TUDATA: userdata @ <gcr>
> +
> +Whether the type of the given address differs from the listed above, then
> +error message occurs.
> +    '''
> +
> +    def execute(self, arg):
> +        gcobj = dbg.cast('GCobj *', dbg.eval(arg))
> +        dbg.write('{}\n'.format(dump_gcobj(gcobj)))
> +
> +
>   class LJDumpStack(dbg.LJBase):
>       '''
>   lj-stack [<lua_State *>]
> @@ -1302,6 +1369,7 @@ def load(event=None):
>       dbg.initialize_extension({
>           'lj-arch':  LJDumpArch,
>           'lj-gc':    LJGC,
> +        'lj-gco':   LJDumpGCobj,
>           'lj-stack': LJDumpStack,
>           'lj-state': LJState,
>           'lj-str':   LJDumpString,
> diff --git a/test/tarantool-debugger-tests/debug-extension-tests.py b/test/tarantool-debugger-tests/debug-extension-tests.py
> index 06a118ff..7e2b5ac4 100644
> --- a/test/tarantool-debugger-tests/debug-extension-tests.py
> +++ b/test/tarantool-debugger-tests/debug-extension-tests.py
> @@ -138,6 +138,17 @@ class TestCaseBase(unittest.TestCase):
>               self.assertRegex(self.output, self.pattern.strip())
>   
>   
> +# LLDB + Clang on macOS can't produce debug info for the C-defined
> +# macros. Thus, we hardcoded its value manually.
> +def gcval(arg):
> +    if sys.platform == 'darwin':
> +        # Assume GC64 build only.
> +        LJ_GCVMASK = '(((uint64_t)1 << 47) - 1)'
> +        return '(((' + arg + ')->gcr).gcptr64 & ' + LJ_GCVMASK + ')'
> +    else:
> +        return 'gcval(' + arg + ')'
> +
> +
>   class TestLoad(TestCaseBase):
>       extension_cmds = ''
>       location = 'lj_cf_print'
> @@ -145,6 +156,7 @@ class TestLoad(TestCaseBase):
>       pattern = (
>           r'lj-arch command initialized\n'
>           r'lj-gc command initialized\n'
> +        r'lj-gco command initialized\n'
>           r'lj-stack command initialized\n'
>           r'lj-state command initialized\n'
>           r'lj-str command initialized\n'
> @@ -223,6 +235,31 @@ class TestLJStackFunc(TestCaseBase):
>       pattern = STACK_RX
>   
>   
> +# Sorted in LJT order.
> +GCO_ARGS = (
> +    '"hello",\n'
> +    'coroutine.create(function() end),\n'
> +    'function() end,\n'
> +    'require,\n'
> +    'print,\n'
> +    'ffi.new("int*"),\n'
> +    '{1},\n'
> +    'newproxy(),\n'
> +)
> +
> +
> +GCO_RX = (
> +    r'string \"hello\" @ ' + RX_ADDR + r'\n'
> +    r'thread @ ' + RX_ADDR + r'\n'
> +    r'Lua function @ ' + RX_ADDR + r', [0-9]+ upvalues, .+:[0-9]+\n'
> +    r'C function @ ' + RX_ADDR + r'\n'
> +    r'fast function #[0-9]+\n'
> +    r'cdata @ ' + RX_ADDR + r'\n'
> +    r'table @ ' + RX_ADDR + r' \(asize: \d+, hmask: ' + RX_HASH + r'\)\n'
> +    r'userdata @ ' + RX_ADDR + r'\n'
> +)
> +
> +
>   class TestLJTV(TestCaseBase):
>       location = 'lj_cf_print'
>       extension_cmds = (
> @@ -249,15 +286,8 @@ class TestLJTV(TestCaseBase):
>           '  nil,\n'
>           '  false,\n'
>           '  true,\n'
> -        '  debug.upvalueid(print, 1), \n'  # lightuserdata
> -        '  "hello",\n'
> -        '  coroutine.create(function() end),\n'
> -        '  function() end,\n'
> -        '  require,\n'
> -        '  print,\n'
> -        '  ffi.new("int*"),\n'
> -        '  {1},\n'
> -        '  newproxy(),\n'
> +        '  debug.upvalueid(print, 1), \n' +  # lightuserdata
> +        GCO_ARGS +
>           '  1,\n'
>           '  1.1\n'
>           ')\n'
> @@ -267,15 +297,8 @@ class TestLJTV(TestCaseBase):
>           r'nil\n'
>           r'false\n'
>           r'true\n'
> -        r'light userdata @ ' + RX_ADDR + r'\n'
> -        r'string \"hello\" @ ' + RX_ADDR + r'\n'
> -        r'thread @ ' + RX_ADDR + r'\n'
> -        r'Lua function @ ' + RX_ADDR + r', [0-9]+ upvalues, .+:[0-9]+\n'
> -        r'C function @ ' + RX_ADDR + r'\n'
> -        r'fast function #[0-9]+\n'
> -        r'cdata @ ' + RX_ADDR + r'\n'
> -        r'table @ ' + RX_ADDR + r' \(asize: \d+, hmask: ' + RX_HASH + r'\)\n'
> -        r'userdata @ ' + RX_ADDR + r'\n'
> +        r'light userdata @ ' + RX_ADDR + r'\n' +
> +        GCO_RX +
>           RX_INT + r' .*1.*\n'
>           r'number 1.1\d+\n'
>       )
> @@ -312,6 +335,30 @@ class TestLJTab(TestCaseBase):
>       )
>   
>   
> +class TestLJGCo(TestCaseBase):
> +    location = 'lj_cf_print'
> +    extension_cmds = (
> +        'lj-gco ' + gcval('L->base + 0') + '\n'
> +        'lj-gco ' + gcval('L->base + 1') + '\n'
> +        'lj-gco ' + gcval('L->base + 2') + '\n'
> +        'lj-gco ' + gcval('L->base + 3') + '\n'
> +        'lj-gco ' + gcval('L->base + 4') + '\n'
> +        'lj-gco ' + gcval('L->base + 5') + '\n'
> +        'lj-gco ' + gcval('L->base + 6') + '\n'
> +        'lj-gco ' + gcval('L->base + 7') + '\n'
> +    )
> +
> +    lua_script = (
> +        'local ffi = require("ffi")\n'
> +        'print(\n' +
> +        GCO_ARGS +
> +        '  1\n'  # Stub for the pattern.
> +        ')\n'
> +    )
> +
> +    pattern = GCO_RX
> +
> +
>   for test_cls in TestCaseBase.__subclasses__():
>       test_cls.test = lambda self: self.check()
>   

[-- Attachment #2: Type: text/html, Size: 12761 bytes --]

  reply	other threads:[~2026-06-05 15:02 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-04  9:30 [Tarantool-patches] [PATCH luajit 0/4] Introduce dumpers for bytecodes in debuggers Sergey Kaplun via Tarantool-patches
2026-06-04  9:30 ` [Tarantool-patches] [PATCH luajit 1/4] dbg: fix lj-stack command for LLDB Sergey Kaplun via Tarantool-patches
2026-06-05 14:55   ` Sergey Bronnikov via Tarantool-patches
2026-06-04  9:30 ` [Tarantool-patches] [PATCH luajit 2/4] dbg: fix DUALNUM detection " Sergey Kaplun via Tarantool-patches
2026-06-05 14:57   ` Sergey Bronnikov via Tarantool-patches
2026-06-05 16:01     ` Sergey Kaplun via Tarantool-patches
2026-06-04  9:30 ` [Tarantool-patches] [PATCH luajit 3/4] dbg: introduce lj-gco command Sergey Kaplun via Tarantool-patches
2026-06-05 15:02   ` Sergey Bronnikov via Tarantool-patches [this message]
2026-06-04  9:30 ` [Tarantool-patches] [PATCH luajit 4/4] dbg: introduce lj-bc, lj-func and lj-proto dumpers Sergey Kaplun via Tarantool-patches
2026-06-05 15:07   ` Sergey Bronnikov via Tarantool-patches
2026-06-05 16:10     ` Sergey Kaplun via Tarantool-patches
2026-06-05 14:55 ` [Tarantool-patches] [PATCH luajit 0/4] Introduce dumpers for bytecodes in debuggers Sergey Bronnikov via Tarantool-patches
2026-06-05 16:03 ` [Tarantool-patches] [PATCH luajit 3/5] dbg: update help for the lj-arch command Sergey Kaplun via Tarantool-patches

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=298f77c0-d555-4364-a6d1-c96ab322daf4@tarantool.org \
    --to=tarantool-patches@dev.tarantool.org \
    --cc=e.temirgaleev@tarantool.org \
    --cc=sergeyb@tarantool.org \
    --cc=skaplun@tarantool.org \
    --subject='Re: [Tarantool-patches] [PATCH luajit 3/4] dbg: introduce lj-gco command' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox