bind9

Author	SHA1	Message	Date
Mark Andrews	0f6ae9000a	initalise sock->cond	2021-05-11 14:06:26 +02:00
Ondřej Surý	3713a38689	Bump the netmgr quantum to 1024 During the stress testing, it was discovered that the default netmgr quantum of 128 is not enough and there was a performance drop for TCP on FreeBSD. Bumping the default quantum to 1024 solves the performance issue and is still enough to prevent the endless loops.	2021-05-10 21:32:31 +02:00
Ondřej Surý	e623c12757	Destroy reference to taskmgr after all tasks are done We were clearing the pointer to taskmgr as soon as isc_taskmgr_destroy() would be called and before all tasks were finished. Unfortunately, some tasks would use global named_g_taskmgr objects from inside the events and this would cause either a data race or NULL pointer dereference. This commit fixes the data race by moving the destruction of the referenced pointer to the time after all tasks are finished.	2021-05-10 12:13:27 -07:00
Ondřej Surý	6c57a6cc3d	Add isc_taskmgr_detach when task is created while shutting down When taskmgr is shutting down, the creating the task would attach to the taskmgr, but don't detach on error condition.	2021-05-10 11:39:51 +02:00
Ondřej Surý	0133096c88	improvements to socket_test - be more strict, but patient, waiting for event completion. - use an atomic pointer for the socket to silence TSAN warnings.	2021-05-07 14:28:33 -07:00
Ondřej Surý	365c6a9851	ensure interlocked netmgr events run on worker[0] Network manager events that require interlock (pause, resume, listen) are now always executed in the same worker thread, mgr->workers[0], to prevent races. "stoplistening" events no longer require interlock.	2021-05-07 14:28:32 -07:00
Evan Hunt	c44423127d	fix shutdown deadlocks - ensure isc_nm_pause() and isc_nm_resume() work the same whether run from inside or outside of the netmgr. - promote 'stop' events to the priority event level so they can run while the netmgr is pausing or paused. - when pausing, drain the priority queue before acquiring an interlock; this prevents a deadlock when another thread is waiting for us to complete a task. - release interlock after pausing, reacquire it when resuming, so that stop events can happen. some incidental changes: - use a function to enqueue pause and resume events (this was part of a different change attempt that didn't work out; I kept it because I thought was more readable). - make mgr->nworkers a signed int to remove some annoying integer casts.	2021-05-07 14:28:32 -07:00
Ondřej Surý	4c8f6ebeb1	Use barriers for netmgr synchronization The netmgr listening, stoplistening, pausing and resuming functions now use barriers for synchronization, which makes the code much simpler. isc/barrier.h defines isc_barrier macros as a front-end for uv_barrier on platforms where that works, and pthread_barrier where it doesn't (including TSAN builds).	2021-05-07 14:28:32 -07:00
Ondřej Surý	2eae7813b6	Run isc__nm_http_stoplistening() synchronously in netmgr When isc__nm_http_stoplistening() is run from inside the netmgr, we need to make sure it's run synchronously. This commit is just a band-aid though, as the desired behvaior for isc_nm_stoplistening() is not always the same: 1. When run from outside user of the interface, the call must be synchronous, e.g. the calling code expects the call to really stop listening on the interfaces. 2. But if there's a call from listen<proto> when listening fails, that needs to be scheduled to run asynchronously, because isc_nm_listen<proto> is being run in a paused (interlocked) netmgr thread and we could get stuck. The proper solution would be to make isc_nm_stoplistening() behave like uv_close(), i.e., to have a proper callback.	2021-05-07 14:28:32 -07:00
Evan Hunt	5c08f97791	only run tasks as privileged if taskmgr is in privileged mode all zone loading tasks have the privileged flag, but we only want them to run as privileged tasks when the server is being initialized; if we privilege them the rest of the time, the server may hang for a long time after a reload/reconfig. so now we call isc_taskmgr_setmode() to turn privileged execution mode on or off in the task manager. isc_task_privileged() returns true if the task's privilege flag is set and the taskmgr is in privileged execution mode. this is used to determine in which netmgr event queue the task should be run.	2021-05-07 14:28:30 -07:00
Ondřej Surý	29a208aaf7	Fix crash when allocating UDP socket fails on OpenBSD When socket() call fails, the UDP connect code would call the connectcb with empty req->handle. This has been fixed.	2021-05-07 14:28:30 -07:00
Ondřej Surý	dacf586e18	Make the netmgr queue processing quantized There was a theoretical possibility of clogging up the queue processing with an endless loop where currently processing netievent would schedule new netievent that would get processed immediately. This wasn't such a problem when only netmgr netievents were processed, but with the addition of the tasks, there are at least two situation where this could happen: 1. In lib/dns/zone.c:setnsec3param() the task would get re-enqueued when the zone was not yet fully loaded. 2. Tasks have internal quantum for maximum number of isc_events to be processed, when the task quantum is reached, the task would get rescheduled and then immediately processed by the netmgr queue processing. As the isc_queue doesn't have a mechanism to atomically move the queue, this commit adds a mechanism to quantize the queue, so enqueueing new netievents will never stop processing other uv_loop_t events. The default quantum size is 128. Since the queue used in the network manager allows items to be enqueued more than once, tasks are now reference-counted around task_ready() and task_run(). task_ready() now has a public API wrapper, isc_task_ready(), that the netmgr can use to reschedule processing of a task if the quantum has been reached. Incidental changes: Cleaned up some unused fields left in isc_task_t and isc_taskmgr_t after the last refactoring, and changed atomic flags to atomic_bools for easier manipulation.	2021-05-07 14:28:30 -07:00
Ondřej Surý	b5bf58b419	Destroy netmgr before destroying taskmgr With taskmgr running on top of netmgr, the ordering of how the tasks and netmgr shutdown interacts was wrong as previously isc_taskmgr_destroy() was waiting until all tasks were properly shutdown and detached. This responsibility was moved to netmgr, so we now need to do the following: 1. shutdown all the tasks - this schedules all shutdown events onto the netmgr queue 2. shutdown the netmgr - this also makes sure all the tasks and events are properly executed 3. Shutdown the taskmgr - this now waits for all the tasks to finish running before returning 4. Shutdown the netmgr - this call waits for all the netmgr netievents to finish before returning This solves the race when the taskmgr object would be destroyed before all the tasks were finished running in the netmgr loops.	2021-05-07 14:28:30 -07:00
Ondřej Surý	a011d42211	Add new isc_managers API to simplify <>mgr create/destroy Previously, netmgr, taskmgr, timermgr and socketmgr all had their own isc_<>mgr_create() and isc_<>mgr_destroy() functions. The new isc_managers_create() and isc_managers_destroy() fold all four into a single function and makes sure the objects are created and destroy in correct order. Especially now, when taskmgr runs on top of netmgr, the correct order is important and when the code was duplicated at many places it's easy to make mistake. The former isc_<>mgr_create() and isc_<*>mgr_destroy() functions were made private and a single call to isc_managers_create() and isc_managers_destroy() is required at the program startup / shutdown.	2021-05-07 10:19:05 -07:00
Artem Boldariev	8c0ea01f34	DoH: close active server streams when finishing session Under some circumstances a situation might occur when server-side session gets finished while there are still active HTTP/2 streams. This would lead to isc_nm_httpsocket object leaks. This commit fixes this behaviour as well as refactors failed_read_cb() to allow better code reuse.	2021-05-07 15:47:24 +03:00
Artem Boldariev	a9e97f28b7	Fix crash in client side DoH code This commit fixes a situation when a cstream object could get unlinked from the list as a result of a cstream->read_cb call. Thus, unlinking it after the call could crash the program.	2021-05-07 15:47:24 +03:00
Artem Boldariev	cd178043d9	Make some TLS tests actually use quota A directive to check quota was missing from some of the TLS tests which were supposed to test TLS code with quotas.	2021-05-07 15:47:24 +03:00
Artem Boldariev	22376fc69a	TLS: cancel reading on the underlying TCP socket after (see below) ... the last handle has been detached after calling write callback. That makes it possible to detach from the underlying socket and not to keep the socket object alive for too long. This issue was causing TLS tests with quota to fail because quota might not have been detached on time (because it was still referenced by the underlying TCP socket). One could say that this commit is an ideological continuation of: `513cdb52ec`.	2021-05-07 15:47:24 +03:00
Artem Boldariev	3bf331c453	Fix crashes in TLS when handling TLS shutdown messages This commit fixes some situations which could appear in TLS code when dealing with shutdown messages and lead to crashes.	2021-05-07 15:47:24 +03:00
Artem Boldariev	0d3f503dc9	Avoid creating connect netievents during low level failures in HTTP This way we create less netievent objects, not bombarding NM with the messages in case of numerous low-level errors (like too many open files) in e.g. unit tests.	2021-05-07 15:47:24 +03:00
Artem Boldariev	0e8ac61d6e	Avoid creating httpclose netievents in case of low level failures This way we create less load on NM workers by avoiding netievent creation.	2021-05-07 15:47:24 +03:00
Artem Boldariev	8510c5cd59	Always call TCP connect callback from within a worker context This change ensures that a TCP connect callback is called from within the context of a worker thread in case of a low-level error when descriptors cannot be created (e.g. when there are too many open file descriptors).	2021-05-07 15:47:24 +03:00
Artem Boldariev	1349142333	Got rid of tlsconnect event and corresponding code We do not need it since we decided to not return values from connect functions.	2021-05-07 15:47:24 +03:00
Artem Boldariev	39448c1581	Finish HTTP session on write failure Not doing so caused client-side code to not free file descriptors as soon as possible, that was causing unit tests to fail.	2021-05-07 15:47:24 +03:00
Artem Boldariev	4c5b36780b	Fix flawed DoH unit tests logic This commit fixes some logical mistakes in DoH unit tests logic, causing them either to fail or not to do what they are intended to do.	2021-05-07 15:47:24 +03:00
Matthijs Mekking	66f2cd228d	Use isdigit instead of checking character range When looking for key files, we could use isdigit rather than checking if the character is within the range [0-9]. Use (unsigned char) cast to ensure the value is representable in the unsigned char type (as suggested by the isdigit manpage). Change " & 0xff" occurrences to the recommended (unsigned char) type cast.	2021-05-05 19:15:33 +02:00
Matthijs Mekking	511bc1b882	Check for filename clashes /w dnssec-policy zones Just like with dynamic and/or inline-signing zones, check if no two or more zone configurations set the same filename. In these cases, the zone files are not read-only and named-checkconf should catch a configuration where multiple zone statements write to the same file. Add some bad configuration tests where KASP zones reference the same zone file. Update the good-kasp test to allow for two zones configure the same file name, dnssec-policy none.	2021-05-05 19:13:55 +02:00
Mark Andrews	4a8e33b9f0	Always perform a re-write when processing a version 1 journal version 1 journals may have a mix of type 1 and type 2 transaction headers so always use the recovery code.	2021-05-05 23:12:37 +10:00
Matthijs Mekking	22243ac804	Remove double "insecure" check This was a conflict between `dba13d280a` and `636ff1e15c`.	2021-05-05 12:45:47 +02:00
Matthijs Mekking	636ff1e15c	No longer need to strcmp for "none" When we introduced "dnssec-policy insecure" we could have removed the 'strcmp' check for "none", because if it was set to "none", the 'kasp' variable would have been set to NULL.	2021-05-05 11:23:53 +02:00
Matthijs Mekking	6a60bf637d	Update smart signing when key is offline BIND 9 is smart about when to sign with what key. If a key is offline, BIND will delete the old signature anyway if there is another key to sign the RRset with. With KASP we don't want to fallback to the KSK if the ZSK is missing, only for the SOA RRset. If the KSK is missing, but we do have a ZSK, deleting the signature is fine. Otherwise it depends on if we use KASP or not. Update the 'delsig_ok' function to reflect that.	2021-05-05 11:13:19 +02:00
Matthijs Mekking	3e6fc49c16	Don't roll offline keys When checking the current DNSSEC state against the policy, consider offline keys. If we didn't found an active key, check if the key is offline by checking the public key list. If there is a match in the public key list (the key data is retrieved from the .key and the .state files), treat the key as offline and don't create a successor key for it.	2021-05-05 11:13:19 +02:00
Matthijs Mekking	7ed089576f	Try to read state when reading keylist from rdata The function 'dns_dnssec_keylistfromrdataset()' creates a keylist from the DNSKEY RRset. If we attempt to read the private key, we also store the key state. However, if the private key is offline, the key state will not be stored. To fix this, first attempt to read the public key file. If then reading the private key file fails, and we do have a public key, add that to the keylist, with appropriate state. If we also failed to read the public key file, add the DNSKEY to the keylist, as we did before.	2021-05-05 11:13:19 +02:00
Matthijs Mekking	fa05c1b8da	When reading public key from file, also read state The 'dst_key_fromnamedfile()' function did not read and store the key state from the .state file when reading a public key file.	2021-05-05 11:13:19 +02:00
Matthijs Mekking	cf17698f87	Fix a kasp lock issue The kasp lock would stay locked if 'dns_keymgr_run' failed.	2021-05-05 11:13:19 +02:00
Mark Andrews	dba13d280a	named-checkconf now detects redefinition of dnssec-policy 'insecure'	2021-05-05 16:23:19 +10:00
Mark Andrews	b3301da262	inline-signing should have been in zone_only_clauses	2021-05-04 23:35:59 +00:00
Ondřej Surý	d2dba352cb	Fix RFC292 -> RFC952 typo	2021-05-04 17:21:52 +02:00
Ondřej Surý	dfd56b84f5	Add support for generating backtraces on Windows This commit adds support for generating backtraces on Windows and refactors the isc_backtrace API to match the Linux/BSD API (without the isc_ prefix) * isc_backtrace_gettrace() was renamed to isc_backtrace(), the third argument was removed and the return type was changed to int * isc_backtrace_symbols() was added * isc_backtrace_symbols_fd() was added and used as appropriate	2021-05-03 20:31:52 +02:00
Ondřej Surý	37c0d196e3	Use uv_sleep in the netmgr code libuv added uv_sleep(unsigned int msec) to the API since 1.34.0. Use that in the netmgr code and define usleep based shim for libuv << 1.34.0.	2021-05-03 20:22:54 +02:00
Ondřej Surý	c37ff5d188	Add nanosleep and usleep Windows shims This commit adds POSIX nanosleep() and usleep() shim implementation for Windows to help implementors use less #ifdef _WIN32 in the code.	2021-05-03 20:22:54 +02:00
Ondřej Surý	cd54bbbd9a	Add trampoline around iocompletionport_createthreads() On Windows, the iocompletionport_createthreads() didn't use isc_thread_create() to create new threads for processing IO, but just a simple CreateThread() function that completely circumvent the isc_trampoline mechanism to initialize global isc_tid_v. This lead to segmentation fault in isc_hp API because '-1' isn't valid index to the hazard pointer array. This commit changes the iocompletionport_createthreads() to use isc_thread_create() instead of CreateThread() to properly initialize isc_tid_v.	2021-05-03 20:21:15 +02:00
Diego Fronza	7729844150	Address comparison of integers with different signedess	2021-05-03 06:54:30 +00:00
Matthijs Mekking	2710d9a11d	Add built-in dnssec-policy "insecure" Add a new built-in policy "insecure", to be used to gracefully unsign a zone. Previously you could just remove the 'dnssec-policy' configuration from your zone statement, or remove it. The built-in policy "none" (or not configured) now actually means no DNSSEC maintenance for the corresponding zone. So if you immediately reconfigure your zone from whatever policy to "none", your zone will temporarily be seen as bogus by validating resolvers. This means we can remove the functions 'dns_zone_use_kasp()' and 'dns_zone_secure_to_insecure()' again. We also no longer have to check for the existence of key state files to figure out if a zone is transitioning to insecure.	2021-04-30 11:18:38 +02:00
Mark Andrews	8510ccaa54	Update ZONEMD to match RFC 8976 * The location of the digest type field has changed to where the reserved field was. * The reserved field is now called scheme and is where the digest type field was. * Digest type 2 has been defined (SHA256).	2021-04-30 10:43:37 +10:00
Mark Andrews	af02bbcdd6	Mark DNSSEC responses with NSEC3 records that exceed 150 as insecure	2021-04-29 13:43:40 +02:00
Matthijs Mekking	efa5d84dcf	dnssec-policy: reduce NSEC3 iterations to 150 When reducing the number of NSEC3 iterations to 150, commit `aa26cde2ae` added tests for dnssec-policy to check that a too high iteration count is a configuration failure. The test is not sufficient because 151 was always too high for ECDSAP256SHA256. The test should check for a different algorithm. There was an existing test case that checks for NSEC3 iterations. Update the test with the new maximum values. Update the code in 'kaspconf.c' to allow at most 150 iterations.	2021-04-29 10:41:16 +02:00
Mark Andrews	c1190a3fe0	Handle DNAME lookup via itself When answering a query, named should never attempt to add the same RRset to the ANSWER section more than once. However, such a situation may arise when chasing DNAME records: one of the DNAME records placed in the ANSWER section may turn out to be the final answer to a client query, but there is no way to know that in advance. Tweak the relevant INSIST assertion in query_respond() so that it handles this case properly. qctx->rdataset is freed later anyway, so there is no need to clean it up in query_respond().	2021-04-29 10:30:00 +02:00
Mark Andrews	5318627227	Unload a zone if a transfer breaks its SOA record If a zone transfer results in a zone not having any NS records, named stops serving it because such a zone is broken. Do the same if an incoming zone transfer results in a zone lacking an SOA record at the apex or containing more than one SOA record.	2021-04-29 10:30:00 +02:00
Mark Andrews	2ca0bd882d	Address inconsistencies in checking added RRsets loading_addrdataset() rejects SOA RRsets which are not at top of zone. addrdataset() should similarly reject such RRsets.	2021-04-29 10:30:00 +02:00

1 2 3 4 5 ...

13205 Commits