bind9

Author	SHA1	Message	Date
Evan Hunt	73ff8850bf	ADB entries could be unlinked too soon due to a typo in the code, ADB entries were unlinked from their entry buckets during shutdown if they had a nonzero reference count. they were only supposed to be unlinked if the reference count was exactly one (that being the reference held by the bucket itself).	2022-04-11 17:29:03 -07:00
Ondřej Surý	f981b52793	Don't destroy mctx and task pools until we are destroying zonemgr The mctx, zonetask and loadtask pools were being destroyed in the shutdown function where in theory a dangling zone could be still attached to it. Move the isc_mem_put() on the pools to the destroy() function.	2022-04-07 18:12:03 +02:00
Tony Finch	71ce8b0a51	Ensure that dns_request_createvia() has a retry limit There are a couple of problems with dns_request_createvia(): a UDP retry count of zero means unlimited retries (it should mean no retries), and the overall request timeout is not enforced. The combination of these bugs means that requests can be retried forever. This change alters calls to dns_request_createvia() to avoid the infinite retry bug by providing an explicit retry count. Previously, the calls specified infinite retries and relied on the limit implied by the overall request timeout and the UDP timeout (which did not work because the overall timeout is not enforced). The `udpretries` argument is also changed to be the number of retries; previously, zero was interpreted as infinity because of an underflow to UINT_MAX, which appeared to be a mistake. And `mdig` is updated to match the change in retry accounting. The bug could be triggered by zone maintenance queries, including NOTIFY messages, DS parental checks, refresh SOA queries and stub zone nameserver lookups. It could also occur with `nsupdate -r 0`. (But `mdig` had its own code to avoid the bug.)	2022-04-06 17:12:48 +01:00
Artem Boldariev	77b2db8246	Replace listener TLS contexts on reconfiguration This commit makes use of isc_nmsocket_set_tlsctx(). Now, instead of recreating TLS-enabled listeners (including the underlying TCP listener sockets), only the TLS context in use is replaced.	2022-04-06 18:45:57 +03:00
Artem Boldariev	df317184eb	Add isc_nmsocket_set_tlsctx() This commit adds isc_nmsocket_set_tlsctx() - an asynchronous function that replaces the TLS context within a given TLS-enabled listener socket object. It is based on the newly added reference counting functionality. The intention of adding this function is to add functionality to replace a TLS context without recreating the whole socket object, including the underlying TCP listener socket, as a BIND process might not have enough permissions to re-create it fully on reconfiguration.	2022-04-06 18:45:57 +03:00
Artem Boldariev	25609156a5	Maintain a per-thread TLS ctx reference in TLS stream code This commit changes the generic TLS stream code to maintain a per-worker thread TLS context reference.	2022-04-06 18:45:57 +03:00
Artem Boldariev	9256026d18	Use isc_tlsctx_attach() in TLS DNS code This commit adds proper reference counting for TLS contexts into generic TLS DNS (DoT) code.	2022-04-06 18:45:57 +03:00
Artem Boldariev	b52d46612f	Use isc_tlsctx_attach() in TLS stream code This commit adds proper reference counting for TLS contexts into generic TLS stream code.	2022-04-06 18:45:57 +03:00
Artem Boldariev	a7a482c1b1	Add isc_tlsctx_attach() The implementation is done on top of the reference counting functionality found in OpenSSL/LibreSSL, which allows for avoiding wrapping the object. Adding this function allows using reference counting for TLS contexts in BIND 9's codebase.	2022-04-06 18:45:57 +03:00
Ondřej Surý	7e71c4d0cc	Rename the configuration option to load balance sockets to reuseport After some back and forth, it was decidede to match the configuration option with unbound ("so-reuseport"), PowerDNS ("reuseport") and/or nginx ("reuseport").	2022-04-06 17:03:57 +02:00
Mark Andrews	98718b3b4b	Unlink the timer event before trying to purge it as far as I can determine the order of operations is not important. *** CID 351372: Concurrent data access violations (ATOMICITY) /lib/isc/timer.c: 227 in timer_purge() 221 LOCK(&timer->lock); 222 if (!purged) { 223 /* 224 * The event has already been executed, but not 225 * yet destroyed. 226 */ >>> CID 351372: Concurrent data access violations (ATOMICITY) >>> Using an unreliable value of "event" inside the second locked section. If the data that "event" depends on was changed by another thread, this use might be incorrect. 227 timerevent_unlink(timer, event); 228 } 229 } 230 } 231 232 void	2022-04-06 07:33:41 +00:00
Mark Andrews	ed1e480c53	Move lock to before label to prevent duplicate lock *** CID 351370: Program hangs (LOCK) /lib/dns/adb.c: 2699 in dns_adb_cancelfind() 2693 2694 LOCK(&nbucket->lock); 2695 ISC_LIST_UNLINK(adbname->finds, find, plink); 2696 UNLOCK(&nbucket->lock); 2697 2698 cleanup: >>> CID 351370: Program hangs (LOCK) >>> "pthread_mutex_lock" locks "find->lock" while it is locked. 2699 LOCK(&find->lock); 2700 if (!FIND_EVENTSENT(find)) { 2701 ev = &find->event; 2702 task = ev->ev_sender; 2703 ev->ev_sender = find; 2704 ev->ev_type = DNS_EVENT_ADBCANCELED;	2022-04-06 12:56:17 +10:00
Mark Andrews	05e08a21d1	Remove unnecessary NULL test leading to REVERSE_INULL false positive *** CID 351371: Null pointer dereferences (REVERSE_INULL) /lib/dns/adb.c: 2615 in dns_adb_createfind() 2609 /* 2610 * Copy out error flags from the name structure into the find. 2611 / 2612 find->result_v4 = find_err_map[adbname->fetch_err]; 2613 find->result_v6 = find_err_map[adbname->fetch6_err]; 2614 >>> CID 351371: Null pointer dereferences (REVERSE_INULL) >>> Null-checking "find" suggests that it may be null, but it has already been dereferenced on all paths leading to the check. 2615 if (find != NULL) { 2616 if (want_event) { 2617 INSIST((find->flags & DNS_ADBFIND_ADDRESSMASK) != 0); 2618 isc_task_attach(task, &(isc_task_t ){ NULL }); 2619 find->event.ev_sender = task; 2620 find->event.ev_action = action;	2022-04-06 12:54:08 +10:00
Artem Boldariev	f0ac4c47b0	Change X509_STORE_up_ref() shim return value X509_STORE_up_ref() must return 1 on success, while the previous implementation would return the references count. This commit fixes that.	2022-04-05 15:03:27 +03:00
Ondřej Surý	7868d8145b	Rename shutdown() to test_shutdown() in timer_test.c The shutdown() is part of standard library (POSIX-1), don't use such name in the timer_test.c, but rather rename it to test_shutdown().	2022-04-05 01:49:04 +02:00
Ondřej Surý	142c63dda8	Enable the load-balance-sockets configuration Previously, HAVE_SO_REUSEPORT_LB has been defined only in the private netmgr-int.h header file, making the configuration of load balanced sockets inoperable. Move the missing HAVE_SO_REUSEPORT_LB define the isc/netmgr.h and add missing isc_nm_getloadbalancesockets() implementation.	2022-04-05 01:30:58 +02:00
Ondřej Surý	85c6e797aa	Add option to configure load balance sockets Previously, the option to enable kernel load balancing of the sockets was always enabled when supported by the operating system (SO_REUSEPORT on Linux and SO_REUSEPORT_LB on FreeBSD). It was reported that in scenarios where the networking threads are also responsible for processing long-running tasks (like RPZ processing, CATZ processing or large zone transfers), this could lead to intermitten brownouts for some clients, because the thread assigned by the operating system might be busy. In such scenarious, the overall performance would be better served by threads competing over the sockets because the idle threads can pick up the incoming traffic. Add new configuration option (`load-balance-sockets`) to allow enabling or disabling the load balancing of the sockets.	2022-04-04 23:10:04 +02:00
Ondřej Surý	f106d0ed2b	Run the RPZ update as offloaded work Previously, the RPZ updates ran quantized on the main nm_worker loops. As the quantum was set to 1024, this might lead to service interruptions when large RPZ update was processed. Change the RPZ update process to run as the offloaded work. The update and cleanup loops were refactored to do as little locking of the maintenance lock as possible for the shortest periods of time and the db iterator is being paused for every iteration, so we don't hold the rbtdb tree lock for prolonged periods of time.	2022-04-04 21:20:05 +02:00
Ondřej Surý	b6e885c97f	Refactor the dns_rpz_add/delete to use local rpz copy Previously dns_rpz_add() were passed dns_rpz_zones_t and index to .zones array. Because we actually attach to dns_rpz_zone_t, we should be using the local pointer instead of passing the index and "finding" the dns_rpz_zone_t again. Additionally, dns_rpz_add() and dns_rpz_delete() were used only inside rpz.c, so make them static.	2022-04-04 21:20:05 +02:00
Ondřej Surý	840179a247	General cleanup of dns_rpz implementation Do a general cleanup of lib/dns/rpz.c style: * Removed deprecated and unused functions * Unified dns_rpz_zone_t naming to rpz * Unified dns_rpz_zones_t naming to rpzs * Add and use rpz_attach() and rpz_attach_rpzs() functions * Shuffled variables to be more local (cppcheck cleanup)	2022-04-04 21:19:48 +02:00
Ondřej Surý	c0995bc380	Remove exclusive mode from ns_interfacemgr Now that the dns_aclenv_t has now properly rwlocked .localhost and .localnets member, we can remove the task exclusive mode use from the ns_interfacemgr. Some light related cleanup has been also done.	2022-04-04 19:27:00 +02:00
Ondřej Surý	8138a595d9	Add isc_rwlock around dns_aclenv .localhost and .localnets member In order to modify the .localhost and .localnets members of the dns_aclenv, all other processing on the netmgr loops needed to be stopped using the task exclusive mode. Add the isc_rwlock to the dns_aclenv, so any modifications to the .localhost and .localnets can be done under the write lock.	2022-04-04 19:27:00 +02:00
Ondřej Surý	ae01ec2823	Don't use reference counting in isc_timer unit The reference counting and isc_timer_attach()/isc_timer_detach() semantic are actually misleading because it cannot be used under normal conditions. The usual conditions under which is timer used uses the object where timer is used as argument to the "timer" itself. This means that when the caller is using `isc_timer_detach()` it needs the timer to stop and the isc_timer_detach() does that only if this would be the last reference. Unfortunately, this also means that if the timer is attached elsewhere and the timer is fired it will most likely be use-after-free, because the object used in the timer no longer exists. Remove the reference counting from the isc_timer unit, remove isc_timer_attach() function and rename isc_timer_detach() to isc_timer_destroy() to better reflect how the API needs to be used. The only caveat is that the already executed event must be destroyed before the isc_timer_destroy() is called because the timer is no longet attached to .ev_destroy_arg.	2022-04-02 01:23:15 +02:00
Ondřej Surý	30e0fd942b	Remove task privileged mode Previously, the task privileged mode has been used only when the named was starting up and loading the zones from the disk as the "first" thing to do. The privileged task was setup with quantum == 2, which made the taskmgr/netmgr spin around the privileged queue processing two events at the time. The same effect can be achieved by setting the quantum to UINT_MAX (e.g. practically unlimited) for the loadzone task, hence the privileged task mode was removed in favor of just processing all the events on the loadzone task in a single task_run().	2022-04-01 23:55:26 +02:00
Ondřej Surý	62a72211aa	Remove isc_pool API Since the last user of the isc_pool API is gone, remove the whole isc_pool API.	2022-04-01 23:50:34 +02:00
Ondřej Surý	2bc7303af2	Use isc_nm_getnworkers to manage zone resources Instead of passing the number of worker to the dns_zonemgr manually, get the number of nm threads using the new isc_nm_getnworkers() call. Additionally, remove the isc_pool API and manage the array of memory context, zonetasks and loadtasks directly in the zonemgr.	2022-04-01 23:50:34 +02:00
Ondřej Surý	2707d0eeb7	Set hard thread affinity for each zone After switching to per-thread resources in the zonemgr, the performance was decreased because the memory context, zonetask and loadtask was picked from the pool at random. Pin the zone to single threadid (.tid) and align the memory context, zonetask and loadtask to be the same, this sets the hard affinity of the zone to the netmgr thread.	2022-04-01 23:50:34 +02:00
Ondřej Surý	a94678ff77	Create per-thread task and memory context for zonemgr Previously, the zonemgr created 1 task per 100 zones and 1 memory context per 1000 zones (with minimum 10 tasks and 2 memory contexts) to reduce the contention between threads. Instead of reducing the contention by having many resources, create a per-nm_thread memory context, loadtask and zonetask and spread the zones between just per-thread resources. Note: this commit alone does decrease performance when loading the zone by couple seconds (in case of 1M zone) and thus there's more work in this whole MR fixing the performance.	2022-04-01 23:50:34 +02:00
Ondřej Surý	40971b22e7	Stop the zone timer before detaching the timer Previously, the zone timer was not stopped before detaching the timer. This could lead to a data race where the timer post_event() could fire before the timer was detached, but then the event would be executed after the zone was already destroyed. This was not noticed before because the timing or the ordering of the actions were different, but it was causing assertion failures in the libns tests now. Properly stop the zone timer before detaching the timer object from the dns_zone.	2022-04-01 23:45:23 +02:00
Ondřej Surý	87c4c24cde	Set quantum to infinity for the zone loading task When we are loading the zones, set the quantum to UINT_MAX, which makes task_run process all tasks at once. After the zone loading is finished the quantum will be dropped to 1 to not block server when we are loading new zones after reconfiguration.	2022-04-01 23:45:23 +02:00
Ondřej Surý	15ea6f002f	Add isc_task_setquantum() and use it for post-init zone loading Add isc_task_setquantum() function that modifies quantum for the future isc_task_run() invocations. NOTE: The current isc_task_run() caches the task->quantum into a local variable and therefore the current event loop is not affected by any quantum change.	2022-04-01 23:45:23 +02:00
Ondřej Surý	c17eee034b	Remove isc_task_purge() and isc_task_purgerange() The isc_task_purge() and isc_task_purgerange() were now unused, so sweep the task.c file. Additionally remove unused ISC_EVENTATTR_NOPURGE event attribute.	2022-04-01 23:45:23 +02:00
Ondřej Surý	9f7ba679ac	Purge the .resched_event in dns_cache Instead of sweeping the cache cleaner tasks, purge the more specific cleaner.resched_event event.	2022-04-01 23:45:23 +02:00
Ondřej Surý	48b2a5df97	Keep the list of scheduled events on the timer Instead of searching for the events to purge, keep the list of scheduled events on the timer list and purge the events that we have scheduled.	2022-04-01 23:45:23 +02:00
Ondřej Surý	17aed2f895	Repair isc_task_purgeevent(), clean isc_task_unsend{,range}() The isc_task_purgerange() was walking through all events on the task to find a matching task. Instead use the ISC_LINK_LINKED to find whether the event is active. Cleanup the related isc_task_unsend() and isc_task_unsendrange() functions that were not used anywhere.	2022-04-01 23:45:23 +02:00
Ondřej Surý	b84c9b2608	Turn isc_hash_bits32() into static online function Adding extra val & 0xffff in the isc_hash_bits32() macros in the hotpath has significantly reduced the performance. Turn the macro into static inline function matching the previous hash_32() function used to compute hashval matching the hashtable->bits.	2022-04-01 23:04:24 +02:00
Artem Boldariev	3edf7a9fe7	Implement shim for SSL_CTX_set1_cert_store() (affects Debian 9) This commit implements a shim for SSL_CTX_set1_cert_store() for OpenSSL/LibreSSL versions where it is not available.	2022-04-01 16:33:43 +03:00
Mark Andrews	5abdee9004	Prevent arithmetic overflow of 'i' in master.c:generate the value of 'i' in generate could overflow when adding 'step' to it in the 'for' loop. Use an unsigned int for 'i' which will give an additional bit and prevent the overflow. The inputs are both less than 2^31 and and the result will be less than 2^32-1.	2022-04-01 07:56:52 +00:00
Tony Finch	84c4eb02e7	Log "not authoritative for update zone" more clearly Ensure the update zone name is mentioned in the NOTAUTH error message in the server log, so that it is easier to track down problematic update clients. There are two cases: either the update zone is unrelated to any of the server's zones (previously no zone was mentioned); or the update zone is a subdomain of one or more of the server's zones (previously the name of the irrelevant parent zone was misleadingly logged). Closes #3209	2022-03-30 12:50:30 +01:00
Ondřej Surý	4f74e1010e	Remove task exclusive mode from ns_clientmgr The .lock, .exiting and .excl members were not using for anything else than starting task exclusive mode, setting .exiting to true and ending exclusive mode. Remove all the stray members and dead code eliminating the task exclusive mode use from ns_clientmgr.	2022-03-30 12:41:55 +02:00
Evan Hunt	199be183fa	Add detailed ADB and entry attach/detach tracing To turn on detailed debug tracing of dns_adb and dns_adbentry reference counting, #define ADB_TRACE at the top of adb.c. This is off by default.	2022-03-30 10:12:25 +02:00
Evan Hunt	d48d8e1cf0	Refactor ADB reference counting, shutdown and locking The ADB previously used separate reference counters for internal and external references, plus additional counters for ABD find and namehook objects, and used all these counters to coordinate its shutdown process, which was a multi-stage affair involving a sequence of control events. It also used a complex interlocking set of static functions for referencing, deferencing, linking, unlinking, and cleaning up various internal objects; these functions returned boolean values to their callers to indicate what additional processing was needed. The changes in the previous two commits destabilized this fragile system in a way that was difficult to recover from, so in this commit we refactor all of it. The dns_adb and dns_adbentry objects now use conventional attach and detach functions for reference counting, and the shutdown process is much more straightforward. Instead of handling shutdown asynchronously, we can just destroy the ADB when references reach zero In addition, ADB locking has been simplified. Instead of a single `find_{name,entry}_and_lock()` function which searches for a name or entry's hash bucket, locks it, and then searches for the name or entry in the bucket, we now use one function to find the bucket (leaving it to the caller to do the locking) and another find the name or entry. Instead of locking the entire ADB when modifying hash tables, we now use read-write locks around the specific hash table. The only remaining need for adb->lock is when modifying the `whenshutdown` list. Comments throughout the module have been improved.	2022-03-30 10:12:25 +02:00
Evan Hunt	76bcb4d16b	Refactor how ADB names and entries are stored in the dns_adb Replace adb->{names,entries} and related arrays (indexed by hashed bucket) with a isc_ht hash tables storing the new struct adb{name,entry}bucket_t that wraps all the variables that were originally stored in arrays indexed by "bucket" number stored directly in the struct dns_adb. Previously, the task exclusive mode has been used to grow the internal arrays used to store the named and entries objects. The isc_ht hash tables are now protected by the isc_rwlock instead and thus the usage of the task exclusive mode has been removed from the dns_adb. Co-authored-by: Ondřej Surý <ondrej@isc.org>	2022-03-30 10:09:18 +02:00
Evan Hunt	6e11211ac6	minor pre-refactoring cleanups the use of "result" as a variable name for a boolean return value was confusing; all 'result' variables that are not isc_result_t have been renamed to 'ret'. The static function print_dns_name() was a duplicate of dns_name_print(), so it has been replaced with that. Changed INSIST to REQUIRE where appropriate, and added NULL initialization for pointer variables.	2022-03-30 09:55:00 +02:00
Ondřej Surý	3a650d973f	Remove isc_appctx_t use in dns_client The use of isc_appctx_t in dns_client was used to wait for dns_client_startresolve() to finish the processing (the resolve_done() task callback). This has been replaced with standard bool+cond+lock combination removing the need of isc_appctx_t altogether.	2022-03-29 14:14:49 -07:00
Ondřej Surý	b05a991ad0	Make isc_ht optionally case insensitive Previously, the isc_ht API would always take the key as a literal input to the hashing function. Change the isc_ht_init() function to take an 'options' argument, in which ISC_HT_CASE_SENSITIVE or _INSENSITIVE can be specified, to determine whether to use case-sensitive hashing in isc_hash32() when hashing the key.	2022-03-28 15:02:18 -07:00
Evan Hunt	e9ef3defa4	consolidate fibonacci hashing in one place Fibonacci hashing was implemented in four separate places (rbt.c, rbtdb.c, resolver.c, zone.c). This commit combines them into a single implementation. The hash_32() function is now replaced with isc_hash_bits32().	2022-03-28 14:44:21 -07:00
Ondřej Surý	4dceab142d	Consistenly use UNREACHABLE() instead of ISC_UNREACHABLE() In couple places, we have missed INSIST(0) or ISC_UNREACHABLE() replacement on some branches with UNREACHABLE(). Replace all ISC_UNREACHABLE() or INSIST(0) calls with UNREACHABLE().	2022-03-28 23:26:08 +02:00
Artem Boldariev	57f0251713	Add support for Strict/Mutual TLS into BIND This commit adds support for Strict/Mutual TLS into BIND. It does so by implementing the backing code for 'hostname' and 'ca-file' options of the 'tls' statement. The commit also updates the documentation accordingly.	2022-03-28 16:22:53 +03:00
Artem Boldariev	89d7059103	Restore disabled unused 'tls' options: 'ca-file' and 'hostname' This commit restores the 'tls' options disabled in `78b73d0865`.	2022-03-28 16:22:53 +03:00

1 2 3 4 5 ...

13849 Commits