From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTP id A9792294AA for ; Mon, 27 Aug 2018 03:37:33 -0400 (EDT) Received: from turing.freelists.org ([127.0.0.1]) by localhost (turing.freelists.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id UKD7AA0gPVOp for ; Mon, 27 Aug 2018 03:37:33 -0400 (EDT) Received: from smtpng2.m.smailru.net (smtpng2.m.smailru.net [94.100.179.3]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTPS id 5F363294A9 for ; Mon, 27 Aug 2018 03:37:33 -0400 (EDT) From: Kirill Shcherbatov Subject: [tarantool-patches] [PATCH v3 0/4] box: indexes by JSON path Date: Mon, 27 Aug 2018 10:37:26 +0300 Message-Id: Sender: tarantool-patches-bounce@freelists.org Errors-to: tarantool-patches-bounce@freelists.org Reply-To: tarantool-patches@freelists.org List-help: List-unsubscribe: List-software: Ecartis version 1.0.0 List-Id: tarantool-patches List-subscribe: List-owner: List-post: List-archive: To: tarantool-patches@freelists.org Cc: v.shpilevoy@tarantool.org, Kirill Shcherbatov Branch: http://github.com/tarantool/tarantool/tree/kshch/gh-1012-json-indexes Issue: https://github.com/tarantool/tarantool/issues/1012 Sometimes field data could have complex document structure. When this structure is consistent across whole document, you are able to create an index by JSON path. This came possible with auxiliary structures per tuple_format: tree of intermediate path fields and hashtable refers to leaf field that use path as key. To speed-up data access by JSON index key_part structure extended with offset_slot cache that points to field_map item containing data offset for current tuple. RFC contains detailed description of those concepts. Finally, supported ability to define JSON paths in user-friendly form containing format field name(that could be changed). Changes in v3: - fixed JSON nullable fields - multiple binary optimizations for field map init for fields that have JSON paths - don't store same JSON path of different indexes twice in format for now - optimized and simplified tuple_field_by_part_raw - new teplate-based comparators/extractors - removed update space epoch to ModifySpaceFormat alter op - fixed key_def_find routine - much informative comments - exported LUA path resolve to core Kirill Shcherbatov (4): rfc: describe a Tarantool JSON indexes box: introduce slot_cache in key_part box: introduce JSON indexes box: specify indexes in user-friendly form doc/rfc/1012-json-indexes.md | 188 +++++++++++ src/box/alter.cc | 38 +++ src/box/errcode.h | 2 +- src/box/index_def.c | 10 +- src/box/key_def.c | 296 +++++++++++++++-- src/box/key_def.h | 42 ++- src/box/lua/index.c | 74 +++++ src/box/lua/schema.lua | 20 +- src/box/lua/space.cc | 5 + src/box/memtx_bitset.c | 8 +- src/box/memtx_engine.c | 5 + src/box/memtx_rtree.c | 6 +- src/box/schema.cc | 12 +- src/box/tuple.c | 11 +- src/box/tuple_compare.cc | 142 ++++++-- src/box/tuple_extract_key.cc | 146 ++++++--- src/box/tuple_format.c | 765 +++++++++++++++++++++++++++++++++++++++---- src/box/tuple_format.h | 80 ++++- src/box/tuple_hash.cc | 67 +++- src/box/vinyl.c | 5 + src/box/vy_log.c | 3 +- src/box/vy_lsm.c | 44 +++ src/box/vy_point_lookup.c | 2 - src/box/vy_stmt.c | 124 +++++-- src/box/vy_stmt.h | 7 +- test/box/misc.result | 57 ++-- test/engine/iterator.result | 2 +- test/engine/tuple.result | 353 ++++++++++++++++++++ test/engine/tuple.test.lua | 99 ++++++ 29 files changed, 2332 insertions(+), 281 deletions(-) create mode 100644 doc/rfc/1012-json-indexes.md -- 2.7.4