[PATCH] lua: escape trigraphs in bundled lua sources

Alexander Turenko alexander.turenko at tarantool.org
Tue Jun 18 15:57:16 MSK 2019


Built-in modules are bundled into tarantool in the following way. A lua
file from src/lua or src/box/lua is stored as a string literal in a C
file, then built and linked into tarantool. During startup tarantool
calls luaL_loadbuffer() on this string.

When a Lua source is converted to a C literal, proper escaping is
performed. However there is one case, which was not covered: trigraphs.
The patch adds escaping of question mark symbols to avoid matching ??X
sequences as trigraphs by C preprocessor.

The most simple way to check that it works is to apply the following
patch:

 | diff --git a/src/lua/string.lua b/src/lua/string.lua
 | index 6e12c59ae..2da2dbf4d 100644
 | --- a/src/lua/string.lua
 | +++ b/src/lua/string.lua
 | @@ -425,3 +425,6 @@ string.fromhex    = string_fromhex
 |  string.strip      = string_strip
 |  string.lstrip      = string_lstrip
 |  string.rstrip      = string_rstrip
 | +string.foo = function()
 | +    return '??('
 | +end

And call the function like so:

 | ./src/tarantool -e 'print(string.foo()) os.exit()'

If it printfs `??(`, then everything is okay. If it prints `[`, then
`??(` was preprocessed as the trigraph.

We hit this problem when tried to bundle luarocks-3: it contains
"^(.-)(%??)$" regexp, where `??)` was interpreted as `]`. Debug build or
a build with -DENABLE_WERROR reports an error in the case, but usual
RelWithDebInfo build passes (with -Wtrigraphs warnings) and can show
this unexpected behaviour.

Fixes #4291.
---

https://github.com/tarantool/tarantool/issues/4291
https://github.com/tarantool/tarantool/tree/Totktonada/gh-4291-fix-bundled-lua-escaping

 extra/txt2c.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/extra/txt2c.c b/extra/txt2c.c
index e1bad0825..3f701d611 100644
--- a/extra/txt2c.c
+++ b/extra/txt2c.c
@@ -70,6 +70,8 @@ int main(int argc, char** argv) {
 		case '\r': fputs("\\r", out); break;
 		case '\\': fputs("\\\\", out); break;
 		case '\"': fputs("\\\"", out); break;
+		/* Don't interpret ??X as a trigraph. */
+		case '?': fputs("\\\?", out); break;
 		default: fputc(c, out); break;
 		}
 	}
-- 
2.21.0




More information about the Tarantool-patches mailing list