Tarantool development patches archive
 help / color / mirror / Atom feed
From: Vladislav Shpilevoy via Tarantool-patches <tarantool-patches@dev.tarantool.org>
To: Mergen Imeev <imeevma@tarantool.org>
Cc: tarantool-patches@dev.tarantool.org
Subject: Re: [Tarantool-patches] [PATCH v1 2/2] sql: introduce syntax for MAP values
Date: Thu, 9 Dec 2021 01:31:39 +0100	[thread overview]
Message-ID: <9b75890d-32e4-aaa9-c377-01854278bc76@tarantool.org> (raw)
In-Reply-To: <20211202083851.GB8207@tarantool.org>

Thanks for the fixes!

>>>>> diff --git a/src/box/sql/expr.c b/src/box/sql/expr.c
>>>>> index 74a98c550..789d8906c 100644
>>>>> --- a/src/box/sql/expr.c
>>>>> +++ b/src/box/sql/expr.c
>>>>> @@ -3432,6 +3432,35 @@ expr_code_array(struct Parse *parser, struct Expr *expr, int reg)
>>>>>  	sqlVdbeAddOp3(vdbe, OP_Array, count, reg, values_reg);
>>>>>  }
>>>>>  
>>>>> +static void
>>>>> +expr_code_map(struct Parse *parser, struct Expr *expr, int reg)
>>>>
>>>> 1. I thought the policy was that we name functions, generating VDBE code,
>>>> using 'emit' suffix. For instance, `vdbe_emit_map()` or `sql_emit_map()`.
>>>> Don't know about prefix though. I see both vdbe_ and sql_ are used.
>>>>
>>> This is usually true, but this function is actually part of sqlExprCodeTarget().
>>> I believe these functions were created to make sqlExprCodeTarget() more
>>> readable. All such functions are named sqlExprCode*(), code*() or
>>> expr_code _*(), for example: sqlExprCodeGetColumn(), codeReal(),
>>> expr_code_int().
>>>
>>> Since all these functions are static, I think we should drop "expr_" prefix for
>>> them. Not in this patch, though.
>>
>> If functions take Expr as an argument like these do, they could be
>> considered methods of Expr. In that case dropping the expr_ prefix would
>> violate our naming convention. It is not about static or global here.
>>
>> As an alternative they could be considered as methods of Parse, but
>> then they would need to have parse_ prefix.
>>
>> For 'code' vs 'emit' - 'code' is fine by me as long as it is static. But
>> if it goes public, then either 'code' or 'emit' must be chosen as one
>> correct suffix. Not a mix.
>>
> After some thought, I think you are right. However, I would suggest removing the
> parser and vdbe from these functions and converting them to proper struct expr
> methods. This way we can make these functions return a value (most likely as an
> "out" argument). For example expr_code_dec() should give us DECIMAL. In this
> case we can make some improvements, for example we can remove "is_neg" from
> expr_code_int() and turn it into expr_code_uint(), since we know that this '-'
> sign will be specified as another expr. Also, since these will be valid expr
> methods, we can drop "static" from their definition. We then should name them
> accordingly, for  example "expr_code_dec" may be named "expr_to_dec".

AFAIU, their value is not only in converting the value, but also in doing
the sqlVdbeAddOp action. You will need to duplicate this call in all places
where expr_to_dec() would be used. The aspiration for refactoring this code
is righteous anyway though.

> diff --git a/src/box/sql/vdbe.c b/src/box/sql/vdbe.c
> index 55e494332..86de3f98a 100644
> --- a/src/box/sql/vdbe.c
> +++ b/src/box/sql/vdbe.c
> @@ -1438,6 +1438,26 @@ case OP_Array: {
>  	break;
>  }
>  
> +/**
> + * Opcode: Map P1 P2 P3 * *
> + * Synopsis: r[P2] = map(P3@P1)
> + *
> + * Construct an MAP value from P1 registers starting at reg(P3).
> + */
> +case OP_Map: {
> +	pOut = &aMem[pOp->p2];
> +
> +	uint32_t size;
> +	struct region *region = &fiber()->gc;
> +	size_t svp = region_used(region);
> +	char *val = mem_encode_map(&aMem[pOp->p3], pOp->p1, &size, region);
> +	if (val == NULL || mem_copy_map(pOut, val, size) != 0) {
> +		region_truncate(region, svp);
> +		goto abort_due_to_error;
> +	}

You should probably truncate the region regardless of the result. Otherwise
in case of success you will leak the region inside of the query bit by bit
while SELECT works:

	box.execute('CREATE TABLE test (id INTEGER PRIMARY KEY)')
	box.execute('INSERT INTO test VALUES (1), (2), (3)')
	box.execute('SELECT {id: id} FROM test')

Here you will do OP_Map 3 times, all will leave some region memory leaked
every time. It is freed in the end of execution probably, but it might do
some big usage while the request is in progress when the row count is much
bigger than 3.

Btw, worth adding a multirow test. All current map tests select a single row.

  reply	other threads:[~2021-12-09  0:32 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-11-18 14:08 [Tarantool-patches] [PATCH v1 0/2] Introduce syntax for MAP values is SQL Mergen Imeev via Tarantool-patches
2021-11-18 14:08 ` [Tarantool-patches] [PATCH v1 1/2] sql: properly check bind variable names Mergen Imeev via Tarantool-patches
2021-11-20  0:45   ` Vladislav Shpilevoy via Tarantool-patches
2021-11-25  8:33     ` Mergen Imeev via Tarantool-patches
2021-11-30 22:02       ` Vladislav Shpilevoy via Tarantool-patches
2021-12-02  8:32         ` Mergen Imeev via Tarantool-patches
2021-12-09  0:31           ` Vladislav Shpilevoy via Tarantool-patches
2021-12-13  7:34             ` Mergen Imeev via Tarantool-patches
2021-12-13 21:47               ` Vladislav Shpilevoy via Tarantool-patches
2021-11-18 14:08 ` [Tarantool-patches] [PATCH v1 2/2] sql: introduce syntax for MAP values Mergen Imeev via Tarantool-patches
2021-11-20  0:46   ` Vladislav Shpilevoy via Tarantool-patches
2021-11-25  8:55     ` Mergen Imeev via Tarantool-patches
2021-11-30 22:04       ` Vladislav Shpilevoy via Tarantool-patches
2021-12-02  8:38         ` Mergen Imeev via Tarantool-patches
2021-12-09  0:31           ` Vladislav Shpilevoy via Tarantool-patches [this message]
2021-12-13  7:42             ` Mergen Imeev via Tarantool-patches
2021-12-13 21:48               ` Vladislav Shpilevoy via Tarantool-patches

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9b75890d-32e4-aaa9-c377-01854278bc76@tarantool.org \
    --to=tarantool-patches@dev.tarantool.org \
    --cc=imeevma@tarantool.org \
    --cc=v.shpilevoy@tarantool.org \
    --subject='Re: [Tarantool-patches] [PATCH v1 2/2] sql: introduce syntax for MAP values' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox