From: Serge Petrenko via Tarantool-patches <tarantool-patches@dev.tarantool.org>
To: Timur Safin <tsafin@tarantool.org>
Cc: tarantool-patches@dev.tarantool.org, v.shpilevoy@tarantool.org
Subject: Re: [Tarantool-patches] [PATCH v5 0/8] Initial datetime implementation
Date: Tue, 17 Aug 2021 15:15:05 +0300 [thread overview]
Message-ID: <65b51a68-cdc9-2333-09c3-56c9e6be2411@tarantool.org> (raw)
In-Reply-To: <cover.1629071531.git.tsafin@tarantool.org>
16.08.2021 02:59, Timur Safin via Tarantool-patches пишет:
> * Version #5 changes:
>
> - After performance evaluations it was decided to switch data type used
> by `datetime.secs` from `int64_t` (which is boxed in LuaJIT) to
> `double` (which is unboxed number).
> - please see perf/datetime*.cc perf test as a benchmark we used for
> comparison of 2 veriants of C implementation. Results are here -
> https://gist.github.com/tsafin/618fbf847d258f6e7f5a75fdf9ea945b
Hi! Thanks for the patchset!
Generally looks ok. Most of my comments are style related,
except the ones covering patches 4, 5.
I like the amount of tests you introduce.
> * Version #4 changes:
>
> A lot.
>
> - Got rid of closures usage here and there. Benchmarks revealed there is
> 20x speed difference of hash with closure handlers to the case when we
> check keys using sequence of simple ifs.
> And despite the fact that it looks ugler, we have updated all places which
> used to use closures - so much performance impact does not worth it;
>
> - c-dt library now is accompanied with corresponding unit-tests;
>
> - due to a new (old) way of exporting public symbols, which is now used by
> Tarantool build, we have modified c-dt sources to use approach similar
> to that used by xxhash (or ICU) - "namespaces" for public symbols.
> If there is DT_NAMESPACE macro (e.g. DT_NAMESPACE=tnt_) defined at the
> moment of compilation we rename all relevant public symbols to
> use that "namespace" as prefix
>
> #define dt_from_rdn DT_NAME(DT_NAMESPACE, dt_from_rdn)
>
> C sources continue to use original name (now as a macro), but in Lua ffi
> we have to use renamed symbol - `tnt_dt_from_rdn`.
>
> - We have separated code which implement MessagePack related funcionality
> from generic (i.e. stringization) support for datetime types.
> Stringization is in `src/core/datetime.c`, but MessagePack support is in
> `src/core/mp_datetime.c`.
>
> - `ctime` and `asctime` now use reentrant versions (i.e. `ctime_r` instead of
> `ctime`, and `asctime_r` instead of `asctime`). This allows us to avoid
> nasty problem with GC where it was possible to corrupt returned
> static buffer regardless the fact that it was immediately passed
> to `ffi.string`.
> `test/app-tap/gh-5632-6050-6259-gc-buf-reuse.test.lua` updated to check our
> new api `datetime.ctime`, and `datetime.asctime`.
>
> - Introduced MessagePack unit-test which check C implementation (mp
> encode/decode, mp_snprint, mp_fprint)
>
> - Some special efforts have been taken to make sure we deal correctly
> with datetime stamps which were before Unix Epoch (1970-01-01), i.e.
> for negative stamps or timezones against epoch.
>
> - And last, but not the least: in addition to usual getters like
> `timestamp`, `nanoseconds`, `epoch`, we have introduced function
> modifiers `to_utc()` or `to_tz(offset)` which allow to manipulate with
> datetime timezone `.offset` field. Actually it does not do much now
> (only reattributing object with different offset, but not changed
> actual timestamp, which is UTC normalized). But different timestamp
> modifies textual representation, which simplify debugging.
>
> * Version #3 changes:
>
> - renamed `struct datetime_t` to `struct datetime`, and `struct
> datetime_interval_t` to `struct datetime_interval`;
> - significantly reworked arguments checks in module api entries;
> - fixed datetime comparisons;
> - changed hints calculation to take into account fractional part;
> - provided more comments here and there;
>
> NB! There are MacOSX problems due to GLIBC specific code used (as Vlad
> has already pointed out) - so additional patch, making it more
> cross-compatible coming here soon...
>
> * Version #2 changes:
>
> - fixed problem with overloaded '-' and '+' operations for datetime
> arguments;
> - fixed messagepack serialization problems;
> - heavily documented MessagePack serialization schema in the code;
> - introduced working implementation of datetime hints for storage engines;
> - made interval related names be more consistent, renamed durations and period
> to intervals, i.e. t_datetime_duration to datetime_interval_t,
> duration_* to interval_*, period to interval;
> - properly implemented all reasonable cases of datetime+interval
> arithmetic;
> - moved all initialization code to utils.c;
> - renamed core/mp_datetime.c to core/datetime.c because it makes more
> sense now;
>
> * Version #1 - initial RFC
>
> In brief
> --------
>
> This patchset implements datetime lua support in box, with serialization
> to messagepack, yaml, json and lua mode. Also it contains storage
> engines' indices implementation for datetime type introduced.
>
> * Current implementation is heavily influenced by Sci-Lua lua-time module
> https://github.com/stepelu/lua-time
> e.g. you could find very similar approach for handling of operations
> with year or month long intervals (which should be handled differently than
> usual intervals of seconds, or days).
>
> * But internally we actually use Christian Hanson' c-dt module
> https://github.com/chansen/c-dt
> (though it has been modified slightly for cleaner integration
> into our cmake-based build process).
>
>
> Datetime Module API
> -------------------
>
> Small draft of documentation for our `datetime` module API has been
> extracted to the discussion topic there -
> https://github.com/tarantool/tarantool/discussions/6244#discussioncomment-1043988
> We try to update documentation there once any relevant change introdcued
> to the implementaton.
>
> Messagepack serialization schema
> --------------------------------
>
> In short it looks like:
> - now we introduce new MP_EXT extension type #4;
> - we may save 1 required and 2 optional fields for datetime field;
>
> In all gory details it's explained in MessagePack serialization schema depicted here:
> https://github.com/tarantool/tarantool/discussions/6244#discussioncomment-1043990
>
>
> https://github.com/tarantool/tarantool/issues/5941
> https://github.com/tarantool/tarantool/issues/5946
>
> https://github.com/tarantool/tarantool/tree/tsafin/gh-5941-datetime-v4
>
> Timur Safin (8):
> build: add Christian Hansen c-dt to the build
> lua: built-in module datetime
> lua, datetime: display datetime
> box, datetime: messagepack support for datetime
> box, datetime: datetime comparison for indices
> lua, datetime: time intervals support
> datetime: perf test for datetime parser
> datetime: changelog for datetime module
>
> .gitmodules | 3 +
> CMakeLists.txt | 8 +
> .../gh-5941-datetime-type-support.md | 4 +
> cmake/BuildCDT.cmake | 10 +
> extra/exports | 33 +
> perf/CMakeLists.txt | 3 +
> perf/datetime-common.h | 105 +++
> perf/datetime-compare.cc | 213 +++++
> perf/datetime-parser.cc | 105 +++
> src/CMakeLists.txt | 5 +-
> src/box/field_def.c | 53 +-
> src/box/field_def.h | 4 +
> src/box/lua/serialize_lua.c | 7 +-
> src/box/memtx_space.c | 3 +-
> src/box/msgpack.c | 7 +-
> src/box/tuple_compare.cc | 77 ++
> src/box/vinyl.c | 3 +-
> src/lib/core/CMakeLists.txt | 5 +-
> src/lib/core/datetime.c | 176 ++++
> src/lib/core/datetime.h | 115 +++
> src/lib/core/mp_datetime.c | 189 ++++
> src/lib/core/mp_datetime.h | 89 ++
> src/lib/core/mp_extension_types.h | 1 +
> src/lib/mpstream/mpstream.c | 11 +
> src/lib/mpstream/mpstream.h | 4 +
> src/lua/datetime.lua | 880 ++++++++++++++++++
> src/lua/init.c | 4 +-
> src/lua/msgpack.c | 12 +
> src/lua/msgpackffi.lua | 18 +
> src/lua/serializer.c | 4 +
> src/lua/serializer.h | 2 +
> src/lua/utils.c | 28 +-
> src/lua/utils.h | 12 +
> test/app-tap/datetime.test.lua | 370 ++++++++
> .../gh-5632-6050-6259-gc-buf-reuse.test.lua | 74 +-
> test/engine/datetime.result | 77 ++
> test/engine/datetime.test.lua | 35 +
> test/unit/CMakeLists.txt | 3 +-
> test/unit/datetime.c | 381 ++++++++
> test/unit/datetime.result | 471 ++++++++++
> third_party/c-dt | 1 +
> third_party/lua-cjson/lua_cjson.c | 8 +
> third_party/lua-yaml/lyaml.cc | 6 +-
> 43 files changed, 3591 insertions(+), 28 deletions(-)
> create mode 100644 changelogs/unreleased/gh-5941-datetime-type-support.md
> create mode 100644 cmake/BuildCDT.cmake
> create mode 100644 perf/datetime-common.h
> create mode 100644 perf/datetime-compare.cc
> create mode 100644 perf/datetime-parser.cc
> create mode 100644 src/lib/core/datetime.c
> create mode 100644 src/lib/core/datetime.h
> create mode 100644 src/lib/core/mp_datetime.c
> create mode 100644 src/lib/core/mp_datetime.h
> create mode 100644 src/lua/datetime.lua
> create mode 100755 test/app-tap/datetime.test.lua
> create mode 100644 test/engine/datetime.result
> create mode 100644 test/engine/datetime.test.lua
> create mode 100644 test/unit/datetime.c
> create mode 100644 test/unit/datetime.result
> create mode 160000 third_party/c-dt
>
--
Serge Petrenko
next prev parent reply other threads:[~2021-08-17 12:15 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-15 23:59 Timur Safin via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 1/8] build: add Christian Hansen c-dt to the build Timur Safin via Tarantool-patches
2021-08-17 12:15 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:24 ` Safin Timur via Tarantool-patches
2021-08-18 8:56 ` Serge Petrenko via Tarantool-patches
2021-08-17 15:50 ` Vladimir Davydov via Tarantool-patches
2021-08-18 10:04 ` Safin Timur via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 2/8] lua: built-in module datetime Timur Safin via Tarantool-patches
2021-08-17 12:15 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:30 ` Safin Timur via Tarantool-patches
2021-08-18 8:56 ` Serge Petrenko via Tarantool-patches
2021-08-17 16:52 ` Vladimir Davydov via Tarantool-patches
2021-08-17 19:16 ` Vladimir Davydov via Tarantool-patches
2021-08-18 13:38 ` Safin Timur via Tarantool-patches
2021-08-18 10:03 ` Safin Timur via Tarantool-patches
2021-08-18 10:06 ` Safin Timur via Tarantool-patches
2021-08-18 11:45 ` Vladimir Davydov via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 3/8] lua, datetime: display datetime Timur Safin via Tarantool-patches
2021-08-17 12:15 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:32 ` Safin Timur via Tarantool-patches
2021-08-17 17:06 ` Vladimir Davydov via Tarantool-patches
2021-08-18 14:10 ` Safin Timur via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 4/8] box, datetime: messagepack support for datetime Timur Safin via Tarantool-patches
2021-08-16 0:20 ` Safin Timur via Tarantool-patches
2021-08-17 12:15 ` Serge Petrenko via Tarantool-patches
2021-08-17 12:16 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:42 ` Safin Timur via Tarantool-patches
2021-08-18 9:01 ` Serge Petrenko via Tarantool-patches
2021-08-17 18:36 ` Vladimir Davydov via Tarantool-patches
2021-08-18 14:27 ` Safin Timur via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 5/8] box, datetime: datetime comparison for indices Timur Safin via Tarantool-patches
2021-08-17 12:16 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:43 ` Safin Timur via Tarantool-patches
2021-08-18 9:03 ` Serge Petrenko via Tarantool-patches
2021-08-17 19:05 ` Vladimir Davydov via Tarantool-patches
2021-08-18 17:18 ` Safin Timur via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 6/8] lua, datetime: time intervals support Timur Safin via Tarantool-patches
2021-08-17 12:16 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:44 ` Safin Timur via Tarantool-patches
2021-08-17 18:52 ` Vladimir Davydov via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 7/8] datetime: perf test for datetime parser Timur Safin via Tarantool-patches
2021-08-17 19:13 ` Vladimir Davydov via Tarantool-patches
2021-08-15 23:59 ` [Tarantool-patches] [PATCH v5 8/8] datetime: changelog for datetime module Timur Safin via Tarantool-patches
2021-08-17 12:16 ` Serge Petrenko via Tarantool-patches
2021-08-17 23:44 ` Safin Timur via Tarantool-patches
2021-08-18 9:04 ` Serge Petrenko via Tarantool-patches
2021-08-17 12:15 ` Serge Petrenko via Tarantool-patches [this message]
[not found] ` <20210818082222.mofgheciutpipelz@esperanza>
2021-08-18 8:25 ` [Tarantool-patches] [PATCH v5 0/8] Initial datetime implementation Vladimir Davydov via Tarantool-patches
2021-08-18 13:24 ` Safin Timur via Tarantool-patches
2021-08-18 14:22 ` Vladimir Davydov via Tarantool-patches
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=65b51a68-cdc9-2333-09c3-56c9e6be2411@tarantool.org \
--to=tarantool-patches@dev.tarantool.org \
--cc=sergepetrenko@tarantool.org \
--cc=tsafin@tarantool.org \
--cc=v.shpilevoy@tarantool.org \
--subject='Re: [Tarantool-patches] [PATCH v5 0/8] Initial datetime implementation' \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox