bind9

Author	SHA1	Message	Date
Evan Hunt	9a372f2bce	Use different allocators for UDP and TCP Each worker has a receive buffer with space for 20 DNS messages of up to 2^16 bytes each, and the allocator function passed to uv_read_start() or uv_udp_recv_start() will reserve a portion of it for use by sockets. UDP can use recvmmsg() and so it needs that entire space, but TCP reads one message at a time. This commit introduces separate allocator functions for TCP and UDP setting different buffer size limits, so that libuv will provide the correct buffer sizes to each of them.	2020-08-05 12:57:58 +02:00
Witold Kręcicki	a12076cc52	netmgr: retry binding with IP_FREEBIND when EADDRNOTAVAIL is returned. When a new IPv6 interface/address appears it's first in a tentative state - in which we cannot bind to it, yet it's already being reported by the route socket. Because of that BIND9 is unable to listen on any newly detected IPv6 addresses. Fix it by setting IP_FREEBIND option (or equivalent option on other OSes) and then retrying bind() call. (cherry picked from commit `a0f7d28967`)	2020-07-31 13:33:06 +02:00
Evan Hunt	952461b6af	restore "blackhole" functionality the blackhole ACL was accidentally disabled with respect to client queries during the netmgr conversion. in order to make this work for TCP, it was necessary to add a return code to the accept callback functions passed to isc_nm_listentcp() and isc_nm_listentcpdns(). (cherry picked from commit `23c7373d68`)	2020-06-30 21:10:31 -07:00
Witold Kręcicki	4582ef3bb2	Fix a shutdown race in netmgr udp. We need to mark the socket as inactive early (and synchronously) in the stoplistening process - otherwise we might destroy the callback argument before actually stopping listening, and call the callback on a bad memory.	2020-06-26 01:44:03 -07:00
Witold Kręcicki	97e44fa3df	Make netmgr tcpdns send calls asynchronous. isc__nm_tcpdns_send() was not asynchronous and accessed socket internal fields in an unsafe manner, which could lead to a race condition and subsequent crash. Fix it by moving the whole tcpdns processing to a proper netmgr thread.	2020-06-26 01:18:27 -07:00
Ondřej Surý	8b4fe6c6c5	Add missing acquire memory barrier in isc_nmhandle_unref The ThreadSanitizer uses system synchronization primitives to check for data race. The netmgr handle->references was missing acquire memory barrier before resetting and reusing the memory occupied by isc_nmhandle_t. (cherry picked from commit `1013c0930e`)	2020-06-16 08:58:33 +02:00
Witold Kręcicki	aa2282853a	Fix a race in TCP accepting. There's a possibility of a race in TCP accepting code: T1 accepts a connection C1 T2 accepts a connection C2 T1 tries to accept a connection C3, but we hit a quota, isc_quota_cb_init() sets quota_accept_cb for the socket, we return from accept_connection T2 drops C2, but we race in quota_release with accepting C3 so we don't see quota->waiting is > 0, we don't launch the callback T1 accepts a connection C4, we are able to get the quota we clear the quota_accept_cb from sock->quotacb T1 drops C1, tries to call the callback which is zeroed, sigsegv.	2020-06-10 11:39:43 -07:00
Witold Kręcicki	091117b7ae	isc_uv_import must pass UV__IPC_SOCKET_XFER_TCP_CONNECTION, not SERVER. As a leftover from old TCP accept code isc_uv_import passed TCP_SERVER flag when importing a socket on Windows. Since now we're importing/exporting accepted connections it needs to pass TCP_CONNECTION flag. (cherry picked from commit `801f7af6e9`)	2020-06-03 23:27:24 +02:00
Witold Kręcicki	818afe613f	Redesigned TCP accepting: one listen/accept loop, passing the connected socket. Instead of using bind() and passing the listening socket to the children threads using uv_export/uv_import use one thread that does the accepting, and then passes the connected socket using uv_export/uv_import to a random worker. The previous solution had thundering herd problems (all workers waking up on one connection and trying to accept()), this one avoids this and is simpler. The tcp clients quota is simplified with isc_quota_attach_cb - a callback is issued when the quota is available. (cherry picked from commit `60629e5b0b`)	2020-06-03 23:00:52 +02:00
Ondřej Surý	1217916c1e	Don't check the result of setting SO_INCOMING_CPU The SO_INCOMING_CPU is available since Linux 3.19 for getting the value, but only since Linux 4.4 for setting the value (see below for a full description). BIND 9 should not fail when setting the option on the socket fails, as this is only an optimization and not hard requirement to run BIND 9. SO_INCOMING_CPU (gettable since Linux 3.19, settable since Linux 4.4) Sets or gets the CPU affinity of a socket. Expects an integer flag. int cpu = 1; setsockopt(fd, SOL_SOCKET, SO_INCOMING_CPU, &cpu, sizeof(cpu)); Because all of the packets for a single stream (i.e., all packets for the same 4-tuple) arrive on the single RX queue that is associated with a particular CPU, the typical use case is to employ one listening process per RX queue, with the incoming flow being handled by a listener on the same CPU that is handling the RX queue. This provides optimal NUMA behavior and keeps CPU caches hot. (cherry picked from commit `4ec357da0a`)	2020-06-03 12:47:21 +02:00
Witold Kręcicki	3461aab083	Clear sock->magic to 0 when destroying a netmgr socket (cherry picked from commit `7ef756f639`)	2020-05-30 07:50:30 +02:00
Witold Kręcicki	4ceddeee78	Add missing isc_mutex_destroy and isc_conditional_destroy calls. While harmless on Linux, missing isc_{mutex,conditional}_destroy causes a memory leak on *BSD. Missing calls were added. (cherry picked from commit `a8807d9a7b`)	2020-05-30 07:50:30 +02:00
Evan Hunt	00c816778d	change 'expr == true' to 'expr' in conditionals (cherry picked from commit `68a1c9d679`)	2020-05-25 17:03:59 -07:00
Ondřej Surý	af1b56240f	Resolve the overlinking of the system libraries Originally, every library and binaries got linked to everything, which creates unnecessary overlinking. This wasn't as straightforward as it should be as we still support configuration without libtool for 9.16. Couple of smaller issues related to include headers and an issue where sanitizer overload dlopen and dlclose symbols, so we were getting false negatives in the autoconf test.	2020-05-11 09:49:54 +02:00
Witold Kręcicki	444a16bff9	Don't set UDP recv/send buffer sizes - use system defaults (unless explicitly defined) (cherry picked from commit `fa02f6438b`)	2020-05-01 17:47:19 +02:00
Ondřej Surý	c56cd29bbb	Use SO_REUSEPORT only on Linux, use SO_REUSEPORT_LB on FreeBSD The SO_REUSEPORT socket option on Linux means something else on BSD based systems. On FreeBSD there's 1:1 option SO_REUSEPORT_LB, so we can use that. (cherry picked from commit `09ba47b067`)	2020-05-01 16:50:06 +02:00
Witold Kręcicki	786a289dfb	Don't free udp recv buffer if UV_UDP_MMSG_CHUNK is set (cherry picked from commit `83049ceabf`)	2020-05-01 11:27:46 +02:00
Ondřej Surý	cf7975400e	Use UV_UDP_RECVMMSG to enable mmsg support in libuv if available (cherry picked from commit `d5356a40ff`)	2020-05-01 11:27:46 +02:00
Ondřej Surý	0e9b0d79fb	Remove the extra decstats on STATID_ACTIVE for children sockets (cherry picked from commit `26842ac25c`)	2020-04-03 20:22:56 +02:00
Witold Kręcicki	365636dbc9	netmgr refactoring: use generic functions when operating on sockets. tcpdns used transport-specific functions to operate on the outer socket. Use generic ones instead, and select the proper call in netmgr.c. Make the missing functions (e.g. isc_nm_read) generic and add type-specific calls (isc__nm_tcp_read). This is the preparation for netmgr TLS layer. (cherry picked from commit `5fedd21e16`)	2020-04-03 13:44:28 +02:00
Witold Kręcicki	3274650123	Deactivate the handle before sending the async close callback. We could have a race between handle closing and processing async callback. Deactivate the handle before issuing the callback - we have the socket referenced anyway so it's not a problem.	2020-03-30 10:54:12 +00:00
Ondřej Surý	f3c2274479	Use the new sorting rules to regroup #include headers	2020-03-11 08:55:12 +00:00
Witold Kręcicki	5b22e3689d	Only use tcpdns timer if it's initialized. (cherry picked from commit `4b9962d4a3`)	2020-03-05 23:27:56 +00:00
Witold Kręcicki	b32b01d403	Fix TCPDNS socket closing issues (cherry picked from commit `ae1499ca19`)	2020-03-05 23:27:56 +00:00
Witold Kręcicki	11b80da9ff	Limit TCP connection quota logging to 1/s (cherry picked from commit `fc9792eae8`)	2020-03-05 23:27:56 +00:00
Witold Kręcicki	b85de76816	Proper accounting of active TCP connections (cherry picked from commit `fc9e2276ca`)	2020-03-05 23:27:56 +00:00
Evan Hunt	d794d85ce1	comments (cherry picked from commit `0b76d8a490`)	2020-02-28 10:05:25 +01:00
Witold Kręcicki	fbc81f4ed7	Increase inactivehandles and inactivereqs size for better reuse. (cherry picked from commit `4791263def`)	2020-02-28 10:05:25 +01:00
Witold Kręcicki	bd33adfb67	use SO_INCOMING_CPU for UDP sockets (cherry picked from commit `517e6eccdf`)	2020-02-28 10:05:25 +01:00
Witold Kręcicki	4e422b3f10	We don't need to fill udp local address every time since we are bound to it. (cherry picked from commit `a658f7976c`)	2020-02-28 10:05:25 +01:00
Witold Kręcicki	f7039eb27e	Use the original threadid when sending a UDP packet to decrease probability of context switching (cherry picked from commit `eb874608c1`)	2020-02-28 10:05:25 +01:00
Evan Hunt	11a0d771f9	fix spelling errors reported by Fossies. (cherry picked from commit `ba0313e649`)	2020-02-21 07:05:31 +00:00
Witold Kręcicki	32d00479e6	Use libuv-provided uv_{export,import} if available. We were using our own versions of isc_uv_{export,import} functions for multithreaded TCP listeners. Upcoming libuv version will contain proper uv_{export,import} functions - use them if they're available.	2020-02-18 14:21:16 +01:00
Witold Kręcicki	85c2f8dab5	Make nm->recvbuf larger and heap allocated, to allow uv_recvmmsg usage. Upcoming version of libuv will suport uv_recvmmsg and uv_sendmmsg. To use uv_recvmmsg we need to provide a larger buffer and be able to properly free it.	2020-02-18 14:21:16 +01:00
Ondřej Surý	829b461c54	Merge branch '46-enforce-clang-format-rules' into 'master' Start enforcing the clang-format rules on changed files Closes #46 See merge request isc-projects/bind9!3063 (cherry picked from commit `a04cdde45d`) `d2b5853b` Start enforcing the clang-format rules on changed files `618947c6` Switch AlwaysBreakAfterReturnType from TopLevelDefinitions to All `654927c8` Add separate .clang-format files for headers `5777c44a` Reformat using the new rules `60d29f69` Don't enforce copyrights on .clang-format	2020-02-14 08:45:59 +00:00
Ondřej Surý	cdef20bb66	Merge branch 'each-style-tweak' into 'master' adjust clang-format options to get closer to ISC style See merge request isc-projects/bind9!3061 (cherry picked from commit `d3b49b6675`) `0255a974` revise .clang-format and add a C formatting script in util `e851ed0b` apply the modified style	2020-02-14 05:35:29 +00:00
Ondřej Surý	2e55baddd8	Merge branch '46-add-curly-braces' into 'master' Add curly braces using uncrustify and then reformat with clang-format back Closes #46 See merge request isc-projects/bind9!3057 (cherry picked from commit `67b68e06ad`) `36c6105e` Use coccinelle to add braces to nested single line statement `d14bb713` Add copy of run-clang-tidy that can fixup the filepaths `056e133c` Use clang-tidy to add curly braces around one-line statements	2020-02-13 21:28:35 +00:00
Ondřej Surý	c931d8e417	Merge branch '46-just-use-clang-format-to-reformat-sources' into 'master' Reformat source code with clang-format Closes #46 See merge request isc-projects/bind9!2156 (cherry picked from commit `7099e79a9b`) `4c3b063e` Import Linux kernel .clang-format with small modifications `f50b1e06` Use clang-format to reformat the source files `11341c76` Update the definition files for Windows `df6c1f76` Remove tkey_test (which is no-op anyway)	2020-02-12 14:51:18 +00:00
Witold Kręcicki	a133239698	Don't limit the size of uvreq/nmhandle pool artificially. There was a hard limit set on number of uvreq and nmhandles that can be allocated by a pool, but we don't handle a situation where we can't get an uvreq. Don't limit the number at all, let the OS deal with it.	2020-02-11 12:10:57 +00:00
Ondřej Surý	bc1d4c9cb4	Clear the pointer to destroyed object early using the semantic patch Also disable the semantic patch as the code needs tweaks here and there because some destroy functions might not destroy the object and return early if the object is still in use.	2020-02-09 18:00:17 -08:00
Ondřej Surý	41fe9b7a14	Formatting issues found by local coccinelle run	2020-02-08 03:12:09 -08:00
Mark Andrews	0be2dc9f22	break was on wrong line. 959 break; CID 1457872 (#1 of 1): Structurally dead code (UNREACHABLE) unreachable: This code cannot be reached: isc__nm_incstats(sock->mgr,.... 960 isc__nm_incstats(sock->mgr, sock->statsindex[STATID_ACTIVE]); 961 default:	2020-02-05 18:37:17 +11:00
Witold Kręcicki	fd8788eb94	Fix possible race in socket destruction. When two threads unreferenced handles coming from one socket while the socket was being destructed we could get a use-after-free: Having handle H1 coming from socket S1, H2 coming from socket S2, S0 being a parent socket to S1 and S2: Thread A Thread B Unref handle H1 Unref handle H2 Remove H1 from S1 active handles Remove H2 from S2 active handles nmsocket_maybe_destroy(S1) nmsocket_maybe_destroy(S2) nmsocket_maybe_destroy(S0) nmsocket_maybe_destroy(S0) LOCK(S0->lock) Go through all children, figure out that we have no more active handles: sum of S0->children[i]->ah == 0 UNLOCK(S0->lock) destroy(S0) LOCK(S0->lock) - but S0 is already gone	2020-01-20 22:28:36 +01:00
Witold Kręcicki	42f0e25a4c	calling isc__nm_udp_send() on a non-udp socket is not 'unexpected', it's a critical failure	2020-01-20 22:28:36 +01:00
Witold Kręcicki	8d6dc8613a	clean up some handle/client reference counting errors in error cases. We weren't consistent about who should unreference the handle in case of network error. Make it consistent so that it's always the client code responsibility to unreference the handle - either in the callback or right away if send function failed and the callback will never be called.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	f75a9e32be	netmgr: fix a non-thread-safe access to libuv structures In tcp and udp stoplistening code we accessed libuv structures from a different thread, which caused a shutdown crash when named was under load. Also added additional DbC checks making sure we're in a proper thread when accessing uv_ functions.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	16908ec3d9	netmgr: don't send to an inactive (closing) udp socket We had a race in which n UDP socket could have been already closing by libuv but we still sent data to it. Mark socket as not-active when stopping listening and verify that socket is not active when trying to send data to it.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	eda4300bbb	netmgr: have a single source of truth for tcpdns callback We pass interface as an opaque argument to tcpdns listening socket. If we stop listening on an interface but still have in-flight connections the opaque 'interface' is not properly reference counted, and we might hit a dead memory. We put just a single source of truth in a listening socket and make the child sockets use that instead of copying the value from listening socket. We clean the callback when we stop listening.	2020-01-15 17:22:13 +01:00
Witold Kręcicki	0d637b5985	netmgr: we can't uv_close(sock->timer) when in sock->timer close callback	2020-01-15 14:56:40 +01:00
Witold Kręcicki	525c583145	netmgr: - isc__netievent_storage_t was to small to contain isc__netievent__socket_streaminfo_t on Windows - handle isc_uv_export and isc_uv_import errors properly - rewrite isc_uv_export and isc_uv_import on Windows	2020-01-15 14:08:44 +01:00

1 2

94 Commits