[PATCH] Removed the unsafe ngx_memcmp() wrapper for memcmp(3)
Maxim Dounin
mdounin at mdounin.ru
Sat Nov 5 02:39:26 UTC 2022
Hello!
On Fri, Nov 04, 2022 at 04:24:14PM +0100, Alejandro Colomar wrote:
> The casts are unnecessary, since memcmp(3)'s arguments are
> 'const void *', which allows implicit conversion from any pointer type.
> It might have been necessary in the times of K&R C, where 'void *'
> didn't exist yet, up until the early 2000s, because some old systems
> still had limited or no support for ISO C89. Those systems passed away
> a long time ago, and current systems, even the oldest living ones, have
> support for ISO C89.
>
> The changes, apart from the removal of the macro itself, were scripted:
>
> $ find src/ -type f \
> | grep '\.[ch]$' \
> | xargs sed -i 's/ngx_memcmp/memcmp/';
>
> The cast in this case is (almost) innocuous, because it only hides
> warnings for conversions from integer to pointer such as:
>
> nxt_memcmp(n, "foo", 3); // no warnings
> memcmp(n, "foo", 3); // warning: integer to pointer conversion
>
> which is a difficult bug to write, since it's too obvious. Such code
> will probably be caught in a code review, but there's always a small
> risk. Since there's no reason to keep the small risk around, when we
> can just avoid it by removing the cast.
>
> In general, it's better to avoid a cast if possible, since casts will
> disable many compiler warnings regarding type safety.
Thanks for the patch.
The ngx_memcpy() macro was introduced in 107:b5be4b0448d3
(nginx-0.0.1-2003-07-01-19:00:03 import) as an alias to memcmp(),
with the following explanation why it's just an alias:
+/* msvc and icc compile memcmp() to inline loop */
+#define ngx_memcmp memcmp
This was along with the first use of memcmp in nginx, so clearly
the "ngx_" prefix is not an accident.
Indeed, the prefix is used in multiple places to provide optimized
or instrumented functions, even if normally they map to generally
available functions such as memcmp(). For example, ngx_memcpy()
usually maps to memcpy(), but instrumented to detect large copies
if NGX_MEMCPY_LIMIT is defined. Even if the particular function
is not instrumented, the prefix is important to make it possible
to change things without huge modifications of the code, like the
one in your patch. Further, the prefix is also important for code
consistency. And, last but not least, it makes it possible to add
casts if needed on some platforms (or remove them if they are
considered wrong).
In particular, casts to (const char *) for ngx_memcmp() were added
in 2007 by this commit (nginx 0.6.18):
changeset: 1648:89a47f19b9ec
user: Igor Sysoev <igor at sysoev.ru>
date: Fri Nov 23 17:00:11 2007 +0000
summary: update ngx_memcmp()
Unfortunately, this commit contains no details about the
particular system which needed the cast, though it looks like
there were some at least at that time.
Similar casts are used by almost all nginx string macro
definitions, since most C library functions accept "char *", while
nginx uses "u_char *" for strings, and this results in useless
warnings. While it might be seen as something better to avoid,
the only alternative would be to provide wrapper functions, which
might not be a good idea for performance and readability reasons.
On the other hand, macro definitions with casts are easily
readable, have no impact on performance, and provide limited to
no effect on the code quality as long as proper coding, testing
and review process is implemented.
Summing the above, certainly removing the "ngx_" prefix is a bad
idea. Removing casts from the particular macro might be
considered, though in general it looks like a quest to prove these
are not needed (or no longer needed) given the wide range of
platforms nginx supports.
Hope this helps.
[...]
--
Maxim Dounin
http://mdounin.ru/
More information about the nginx-devel
mailing list