[Tarantool-patches] [PATCH 2/2] uuid: fix unaligned memory access

Aleksandr Lyapunov alyapunov at tarantool.org
Mon May 18 15:55:38 MSK 2020


Thanks for the patch! see my comment below.

On 5/16/20 2:03 AM, Vladislav Shpilevoy wrote:
>   tt_uuid_is_nil(const struct tt_uuid *uu)
>   {
> -	const uint64_t *p = (const uint64_t *) uu;
> -	return !p[0] && !p[1];
> +	const uint32_t *p = (const uint32_t *) uu;
> +	return p[0] == 0 && p[1] == 0 && p[2] == 0 && p[3] == 0;
>   }
>   
>   /**
> @@ -172,9 +172,10 @@ tt_uuid_is_nil(const struct tt_uuid *uu)
>   inline bool
>   tt_uuid_is_equal(const struct tt_uuid *lhs, const struct tt_uuid *rhs)
>   {
> -	const uint64_t *lp = (const uint64_t *) lhs;
> -	const uint64_t *rp = (const uint64_t *) rhs;
> -	return lp[0] == rp[0] && lp[1] == rp[1];
> +	const uint32_t *lp = (const uint32_t *) lhs;
> +	const uint32_t *rp = (const uint32_t *) rhs;
> +	return lp[0] == rp[0] && lp[1] == rp[1] && lp[2] == rp[2] &&
> +	       lp[3] == rp[3];

It seems that we degrade performance just for clang to be happy..
I would suggest to use memcmp in this case.
It's portable and allows a compiler to generate the best possible code.
I've measured it (gcc) and memcmp version is twice faster than your 
solution.
Even for _is_nil method it's better to use memcmp with statically 
allocated zero buffer.



More information about the Tarantool-patches mailing list