From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: From: Kirill Shcherbatov Subject: [PATCH v4 00/14] box: indexes by JSON path Date: Thu, 11 Oct 2018 10:58:40 +0300 Message-Id: To: tarantool-patches@freelists.org Cc: vdavydov.dev@gmail.com, Kirill Shcherbatov List-ID: http://github.com/tarantool/tarantool/tree/kshch/gh-1012-json-indexes https://github.com/tarantool/tarantool/issues/1012 Sometimes field data could have complex document structure. When this structure is consistent across whole document, you are able to create an index by JSON path. This came possible with auxiliary JSON tree structure that contain intermediate path fields and hashtable. To speed-up data access by JSON index key_part structure extended with offset_slot cache that points to field_map item containing data offset for current tuple. Initialization of the field map is done by traversing the tree to detect vertices that are missing in the msgpack. Introduced offset_slot_cache in key_part to tune data access for typical scenario of using tuples that have same format. Kirill Shcherbatov (14): box: refactor key_def_find routine box: introduce key_def_parts_are_sequential box: introduce tuple_field_by_relative_path box: introduce tuple_format_add_key_part box: introduce tuple_format_sizeof routine box: move tuple_field_go_to_{index,key} definition box: drop format const qualifier in *init_field_map lib: implement JSON tree class for json library lib: introduce json_path_normalize routine box: introduce JSON indexes box: introduce has_json_paths flag in templates box: tune tuple_field_raw_by_path for indexed data box: introduce offset slot cache in key_part box: specify indexes in user-friendly form src/box/alter.cc | 7 +- src/box/blackhole.c | 5 +- src/box/engine.h | 11 +- src/box/errcode.h | 2 +- src/box/index_def.c | 6 +- src/box/key_def.c | 216 ++++++++-- src/box/key_def.h | 52 ++- src/box/lua/index.c | 60 +++ src/box/lua/schema.lua | 22 +- src/box/lua/space.cc | 5 + src/box/memtx_engine.c | 6 +- src/box/memtx_space.c | 5 +- src/box/memtx_space.h | 2 +- src/box/schema.cc | 4 +- src/box/space.c | 4 +- src/box/space.h | 8 +- src/box/sysview.c | 3 +- src/box/tuple.c | 21 +- src/box/tuple_compare.cc | 119 ++++-- src/box/tuple_extract_key.cc | 131 ++++-- src/box/tuple_format.c | 864 +++++++++++++++++++++++++++++++--------- src/box/tuple_format.h | 73 +++- src/box/tuple_hash.cc | 47 ++- src/box/vinyl.c | 9 +- src/box/vy_log.c | 3 +- src/box/vy_lsm.c | 40 +- src/box/vy_point_lookup.c | 2 - src/box/vy_stmt.c | 184 +++++++-- src/lib/json/CMakeLists.txt | 2 + src/lib/json/path.c | 25 ++ src/lib/json/path.h | 18 + src/lib/json/tree.c | 300 ++++++++++++++ src/lib/json/tree.h | 260 ++++++++++++ test/box/misc.result | 57 +-- test/engine/tuple.result | 417 +++++++++++++++++++ test/engine/tuple.test.lua | 120 ++++++ test/unit/json_path.c | 243 ++++++++++- test/unit/json_path.result | 60 ++- test/unit/vy_iterators_helper.c | 6 +- test/unit/vy_mem.c | 2 +- test/unit/vy_point_lookup.c | 2 +- 41 files changed, 2983 insertions(+), 440 deletions(-) create mode 100644 src/lib/json/tree.c create mode 100644 src/lib/json/tree.h -- 2.7.4