bind9

Author	SHA1	Message	Date
Ondřej Surý	f022d2db8c	Add CHANGES and release note for [GL #2638 ]	2021-04-20 23:22:28 +02:00
Ondřej Surý	b540722bc3	Refactor taskmgr to run on top of netmgr This commit changes the taskmgr to run the individual tasks on the netmgr internal workers. While an effort has been put into keeping the taskmgr interface intact, couple of changes have been made: * The taskmgr has no concept of universal privileged mode - rather the tasks are either privileged or unprivileged (normal). The privileged tasks are run as a first thing when the netmgr is unpaused. There are now four different queues in in the netmgr: 1. priority queue - netievent on the priority queue are run even when the taskmgr enter exclusive mode and netmgr is paused. This is needed to properly start listening on the interfaces, free resources and resume. 2. privileged task queue - only privileged tasks are queued here and this is the first queue that gets processed when network manager is unpaused using isc_nm_resume(). All netmgr workers need to clean the privileged task queue before they all proceed normal operation. Both task queues are processed when the workers are finished. 3. task queue - only (traditional) task are scheduled here and this queue along with privileged task queues are process when the netmgr workers are finishing. This is needed to process the task shutdown events. 4. normal queue - this is the queue with netmgr events, e.g. reading, sending, callbacks and pretty much everything is processed here. * The isc_taskmgr_create() now requires initialized netmgr (isc_nm_t) object. * The isc_nm_destroy() function now waits for indefinite time, but it will print out the active objects when in tracing mode (-DNETMGR_TRACE=1 and -DNETMGR_TRACE_VERBOSE=1), the netmgr has been made a little bit more asynchronous and it might take longer time to shutdown all the active networking connections. * Previously, the isc_nm_stoplistening() was a synchronous operation. This has been changed and the isc_nm_stoplistening() just schedules the child sockets to stop listening and exits. This was needed to prevent a deadlock as the the (traditional) tasks are now executed on the netmgr threads. * The socket selection logic in isc__nm_udp_send() was flawed, but fortunatelly, it was broken, so we never hit the problem where we created uvreq_t on a socket from nmhandle_t, but then a different socket could be picked up and then we were trying to run the send callback on a socket that had different threadid than currently running.	2021-04-20 23:22:28 +02:00
Ondřej Surý	ae9edb1861	Merge branch '2630-tsan-error-in-xfrin-c' into 'main' Prevent the double xfrin_fail() call Closes #2630 See merge request isc-projects/bind9!4898	2021-04-20 12:13:35 +00:00
Ondřej Surý	30252aed88	Add CHANGES note for [GL #2630 ]	2021-04-20 14:12:26 +02:00
Ondřej Surý	6cf6de55bc	Prevent the double xfrin_fail() call When we are reading from the xfrin socket, and the transfer would be shutdown, the shutdown function would call `xfrin_fail()` which in turns calls `xfrin_cancelio()` that causes the read callback to be invoked with `ISC_R_CANCELED` status code and that caused yet another `xfrin_fail()` call. The fix here is to ensure the `xfrin_fail()` would be run only once properly using better synchronization on xfr->shuttingdown flag.	2021-04-20 14:12:26 +02:00
Ondřej Surý	27960e4208	Merge branch '2637-threadsanitizer-lock-order-inversion-potential-deadlock-in-zone_refreshkeys' into 'main' Fix lock-order-inversion (potential deadlock) in dns_resolver_createfetch Closes #2637 See merge request isc-projects/bind9!4909	2021-04-19 20:29:56 +00:00
Ondřej Surý	25d27851d8	Fix lock-order-inversion (potential deadlock) in dns_resolver_createfetch There's a lock-order-inversion when running `zone_maintenance()` from the timer while shutting down the server `shutdown_server()`. This only happens when the taskmgr scheduling is more relaxed and paralellized, but the issue is real nevertheless. The associated ThreadSanitizer warning: WARNING: ThreadSanitizer: lock-order-inversion (potential deadlock) Cycle in lock order graph: M1 (0x000000000001) => M2 (0x000000000000) => M1 Mutex M2 acquired here while holding mutex M1 in thread T1: #0 pthread_mutex_lock <null> #1 dns_view_findzonecut lib/dns/view.c:1326:2 #2 fctx_create lib/dns/resolver.c:5144:13 #3 dns_resolver_createfetch lib/dns/resolver.c:10977:12 #4 zone_refreshkeys lib/dns/zone.c:10830:13 #5 zone_maintenance lib/dns/zone.c:11065:5 #6 zone_timer lib/dns/zone.c:14652:2 #7 task_run lib/isc/task.c:857:5 #8 isc_task_run lib/isc/task.c:944:10 #9 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #10 process_netievent lib/isc/netmgr/netmgr.c #11 process_queue lib/isc/netmgr/netmgr.c:885:8 #12 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #13 process_queues lib/isc/netmgr/netmgr.c:772:7 #14 async_cb lib/isc/netmgr/netmgr.c:671:2 #15 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #16 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #17 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #18 nm_thread lib/isc/netmgr/netmgr.c:597:11 #19 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M1 previously acquired by the same thread here: #0 pthread_mutex_lock <null> #1 zone_refreshkeys lib/dns/zone.c:10717:2 #2 zone_maintenance lib/dns/zone.c:11065:5 #3 zone_timer lib/dns/zone.c:14652:2 #4 task_run lib/isc/task.c:857:5 #5 isc_task_run lib/isc/task.c:944:10 #6 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #7 process_netievent lib/isc/netmgr/netmgr.c #8 process_queue lib/isc/netmgr/netmgr.c:885:8 #9 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #10 process_queues lib/isc/netmgr/netmgr.c:772:7 #11 async_cb lib/isc/netmgr/netmgr.c:671:2 #12 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #13 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #14 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #15 nm_thread lib/isc/netmgr/netmgr.c:597:11 #16 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M1 acquired here while holding mutex M2 in thread T2: #0 pthread_mutex_lock <null> #1 dns_zone_flush lib/dns/zone.c:11443:2 #2 view_flushanddetach lib/dns/view.c:657:5 #3 dns_view_flushanddetach lib/dns/view.c:690:2 #4 shutdown_server bin/named/server.c:10056:4 #5 task_run lib/isc/task.c:857:5 #6 isc_task_run lib/isc/task.c:944:10 #7 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #8 process_netievent lib/isc/netmgr/netmgr.c #9 process_queue lib/isc/netmgr/netmgr.c:885:8 #10 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #11 process_queues lib/isc/netmgr/netmgr.c:772:7 #12 async_cb lib/isc/netmgr/netmgr.c:671:2 #13 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #14 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #15 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #16 nm_thread lib/isc/netmgr/netmgr.c:597:11 #17 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M2 previously acquired by the same thread here: #0 pthread_mutex_lock <null> #1 view_flushanddetach lib/dns/view.c:645:3 #2 dns_view_flushanddetach lib/dns/view.c:690:2 #3 shutdown_server bin/named/server.c:10056:4 #4 task_run lib/isc/task.c:857:5 #5 isc_task_run lib/isc/task.c:944:10 #6 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #7 process_netievent lib/isc/netmgr/netmgr.c #8 process_queue lib/isc/netmgr/netmgr.c:885:8 #9 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #10 process_queues lib/isc/netmgr/netmgr.c:772:7 #11 async_cb lib/isc/netmgr/netmgr.c:671:2 #12 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #13 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #14 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #15 nm_thread lib/isc/netmgr/netmgr.c:597:11 #16 isc__trampoline_run lib/isc/trampoline.c:184:11 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:79:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:303:3 #3 create_managers bin/named/main.c:957:15 #4 setup bin/named/main.c:1267:11 #5 main bin/named/main.c:1558:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:79:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:303:3 #3 create_managers bin/named/main.c:957:15 #4 setup bin/named/main.c:1267:11 #5 main bin/named/main.c:1558:2 SUMMARY: ThreadSanitizer: lock-order-inversion (potential deadlock) in __interceptor_pthread_mutex_lock	2021-04-19 22:29:14 +02:00
Ondřej Surý	3f0c8189e7	Merge branch 'ondrej/cleanup-ISCAPI-remnants' into 'main' Cleanup the public vs private ISCAPI remnants See merge request isc-projects/bind9!4911	2021-04-19 11:48:14 +00:00
Ondřej Surý	16fe0d1f41	Cleanup the public vs private ISCAPI remnants Since all the libraries are internal now, just cleanup the ISCAPI remnants in isc_socket, isc_task and isc_timer APIs. This means, there's one less layer as following changes have been done: * struct isc_socket and struct isc_socketmgr have been removed * struct isc__socket and struct isc__socketmgr have been renamed to struct isc_socket and struct isc_socketmgr * struct isc_task and struct isc_taskmgr have been removed * struct isc__task and struct isc__taskmgr have been renamed to struct isc_task and struct isc_taskmgr * struct isc_timer and struct isc_timermgr have been removed * struct isc__timer and struct isc__timermgr have been renamed to struct isc_timer and struct isc_timermgr * All the associated code that dealt with typing isc_<foo> to isc__<foo> and back has been removed.	2021-04-19 13:18:24 +02:00
Ondřej Surý	5937c1f787	Merge branch 'ondrej/fixup-bin/tests/system/resolve-win32-directories' into 'main' Fixup win32 paths for moved bin/tests/system/resolve See merge request isc-projects/bind9!4916	2021-04-19 11:15:12 +00:00
Ondřej Surý	f14e678624	Fixup win32 paths for moved bin/tests/system/resolve When resolve.c was moved from lib/samples to bin/tests/system, the resolve.vcxproj.in would still contain old paths to the directory root. This commit adds one more ..\ to match the directory depth. Additionally, fixup the path in BINDInstall.vcxproj.in to be bin/tests/system and not bin/tests/samples.	2021-04-19 12:51:33 +02:00
Ondřej Surý	11d3c475bd	Merge branch '2636-timing-race-in-setnsec3param-task' into 'main' Fix task timing race in setnsec3param() Closes #2636 See merge request isc-projects/bind9!4908	2021-04-19 09:22:31 +00:00
Ondřej Surý	0127ba6472	Fix task timing race in setnsec3param() When setnsec3param() is schedule from zone_postload() there's no guarantee that `zone->db` is not `NULL` yet. Thus when the setnsec3param() is called, we need to check for `zone->db` existence and reschedule the task, because calling `rss_post()` on a zone with empty `.db` ends up with no-op (the function just returns).	2021-04-19 11:16:51 +02:00
Ondřej Surý	c216ace42a	Merge branch 'ondrej/cleanup-double-createinctx-constructors' into 'main' Cleanup the isc_<*>mgr_createinc() constructors See merge request isc-projects/bind9!4910	2021-04-19 08:26:32 +00:00
Ondřej Surý	3388ef36b3	Cleanup the isc_<>mgr_createinc() constructors Previously, the taskmgr, timermgr and socketmgr had a constructor variant, that would create the mgr on top of existing appctx. This was no longer true and isc_<>mgr was just calling isc_<*>mgr_create() directly without any extra code. This commit just cleans up the extra function.	2021-04-19 10:22:56 +02:00
Mark Andrews	eb32677de5	Merge branch 'marka-initialise-resarg-lock' into 'main' properly initialise resarg->lock See merge request isc-projects/bind9!4907	2021-04-19 04:59:39 +00:00
Mark Andrews	eadb829dac	properly initialise resarg->lock	2021-04-19 14:32:40 +10:00
Ondřej Surý	48c703e3d6	Merge branch 'each-cleanup-dns_client' into 'main' clean up dns_client and related samples See merge request isc-projects/bind9!4835	2021-04-16 13:35:44 +00:00
Evan Hunt	07e349de3c	Add CHANGES note for [GL !4835 ]	2021-04-16 14:31:10 +02:00
Evan Hunt	d0ec7d1f33	move samples/resolve.c to bin/tests/system "resolve" is used by the resolver system tests, and I'm not certain whether delv exercises the same code, so rather than remove it, I moved it to bin/tests/system.	2021-04-16 14:29:43 +02:00
Evan Hunt	056afe7bdc	remove sample-async sample code for export libraries is no longer needed and this code is not used for any internal tests. also, sample-gai.c had already been removed but there were some dangling references.	2021-04-16 14:29:43 +02:00
Evan Hunt	568d455c99	rename dns_client_createx() to dns_client_create() there's no longer a need to use an alternate name.	2021-04-16 14:29:43 +02:00
Evan Hunt	1beb05f3e2	remove dns_client_request() and related code continues the cleanup of dns_client started in the previous commit.	2021-04-16 14:29:43 +02:00
Evan Hunt	fb2a352e7c	remove dns_client_update() and related code the libdns client API is no longer being maintained for external use, we can remove the code that isn't being used internally, as well as the related tests.	2021-04-16 14:29:43 +02:00
Michał Kępień	409883adbd	Merge branch '2634-placeholder' into 'main' Add placeholder for GL #2634 See merge request isc-projects/bind9!4901	2021-04-16 12:20:37 +00:00
Michał Kępień	cd91f1c189	Add placeholder for GL #2634	2021-04-16 14:16:57 +02:00
Ondřej Surý	1ab099aa3b	Merge branch '2623-9-16-13-overwrites-master-files-if-old-format-jnl-files-are-present' into 'main' Resolve "9.16.13 overwrites master files if old format .jnl files are present" Closes #2623 See merge request isc-projects/bind9!4880	2021-04-16 11:12:19 +00:00
Mark Andrews	3df637fb83	Add CHANGES and release note for [GL #2623 ]	2021-04-16 12:04:06 +02:00
Ondřej Surý	55b942b4a0	Refactor dns_journal_rollforward() to work over opened journal Too much logic was cramped inside the dns_journal_rollforward() that made it harder to follow. The dns_journal_rollforward() was refactored to work over already opened journal and some of the previous logic was moved to new static zone_journal_rollforward() that separates the journal "rollforward" logic from the "zone" logic.	2021-04-16 12:04:06 +02:00
Mark Andrews	ec7a9af381	Fixing a recoverable journal should not result in the zone being written when dns_journal_rollforward returned ISC_R_RECOVERABLE the distintion between 'up to date' and 'success' was lost, as a consequence zone_needdump() was called writing out the zone file when it shouldn't have been. This change restores that distintion. Adjust system test to reflect visible changes.	2021-04-16 11:15:46 +02:00
Artem Boldariev	10ba2a40e2	Merge branch 'artem/move-tls-tests-into-netmgr-tests' into 'main' Merge the tls_test.c into netmgr_test.c and extend the tests suite See merge request isc-projects/bind9!4893	2021-04-15 14:03:14 +00:00
Artem Boldariev	66432dcd65	Handle a situation when SSL shutdown messages were sent and received It fixes a corner case which was causing dig to print annoying messages like: 14-Apr-2021 18:48:37.099 SSL error in BIO: 1 TLS error (errno: 0). Arguments: received_data: (nil), send_data: (nil), finish: false even when all the data was properly processed.	2021-04-15 15:49:36 +03:00
Artem Boldariev	513cdb52ec	TLS: try to close TCP socket descriptor earlier when possible Before this fix underlying TCP sockets could remain opened for longer than it is actually required, causing unit tests to fail with lots of ISC_R_TOOMANYOPENFILES errors. The change also enables graceful SSL shutdown (before that it would happen only in the case when isc_nm_cancelread() were called).	2021-04-15 15:49:36 +03:00
Ondřej Surý	202b1d372d	Merge the tls_test.c into netmgr_test.c and extend the tests suite This commit merges TLS tests into the common Network Manager unit tests suite and extends the unit test framework to include support for additional "ping-pong" style tests where all data could be sent via lesser number of connections (the behaviour of the old test suite). The tests for TCP and TLS were extended to make use of the new mode, as this mode better translates to how the code is used in DoH. Both TLS and TCP tests now share most of the unit tests' code, as they are expected to function similarly from a users's perspective anyway. Additionally to the above, the TLS test suite was extended to include TLS tests using the connections quota facility.	2021-04-15 15:49:36 +03:00
Matthijs Mekking	2aad659550	Merge branch 'matthijs-keymgr2kasp-rework' into 'main' Update keymgr2kasp system test Closes #2631 See merge request isc-projects/bind9!4894	2021-04-15 12:45:25 +00:00
Matthijs Mekking	0de5a576c5	Fix view-related issues in the "keymgr2kasp" test Due to the lack of "match-clients" clauses in ns4/named2.conf.in, the same view is incorrectly chosen for all queries received by ns4 in the "keymgr2kasp" system test. This causes only one version of the "view-rsasha256.kasp" zone to actually be checked. Add "match-clients" clauses to ns4/named2.conf.in to ensure the test really checks what it claims to. Use identical view names ("ext", "int") in ns4/named.conf.in and ns4/named2.conf.in so that it is easier to quickly identify the differences between these two files. Update tests.sh to account for the above changes. Also fix a copy-paste error in a comment to prevent confusion.	2021-04-15 10:05:52 +02:00
Matthijs Mekking	ee02453a2e	Merge branch '1100-inline-signing-include' into 'main' Resolve "Test inline-signing with $INCLUDE" Closes #1100 See merge request isc-projects/bind9!4825	2021-04-14 09:35:35 +00:00
Matthijs Mekking	bad5c3cf7c	Document relative path of $INCLUDE Add to the ARM from where relative paths are searched.	2021-04-14 10:04:40 +02:00
Matthijs Mekking	96583e7c40	Fix inline test with missing $INCLUDE The test case for a zone with a missing include file was wrong for two reasons: 1. It was loading the wrong file (master5 instead of master6) 2. It did actually not set the $ret variable to 1 if the test failed (it should default to ret=1 and clear the variable if the appropriate log is found).	2021-04-14 10:04:40 +02:00
Matthijs Mekking	6463ee0f40	Add inline-signing with $INCLUDE test Add a test case for inline-signing for a zone with an $INCLUDE statement. There is already a test for a missing include file, this one adds a test for a zone with an include file that does exist. Test if the record in the included file is loaded.	2021-04-14 10:04:40 +02:00
Matthijs Mekking	bb01030252	Merge branch '2347-draft-vandijk-dnsop-nsec-ttl' into 'main' Implement draft-vandijk-dnsop-nsec-ttl Closes #2347 See merge request isc-projects/bind9!4506	2021-04-13 12:14:53 +00:00
Matthijs Mekking	8fcbef2423	Small refactor lib/dns/zone.c Introduce some macros that can be reused in 'zone_load_soa_rr()' and 'zone_get_from_db()' to make those functions more readable.	2021-04-13 11:26:26 +02:00
Matthijs Mekking	032110bd2e	Use designated initializer in dns_zone_create Shorten the code and make it less prone to initialisation errors (it is still easy to forget adding an initializer, but it now defaults to 0).	2021-04-13 11:26:26 +02:00
Matthijs Mekking	3b68dd426f	Add change entry and release note for [#2347 ]	2021-04-13 11:26:26 +02:00
Matthijs Mekking	9af8caa733	Implement draft-vandijk-dnsop-nsec-ttl The draft says that the NSEC(3) TTL must have the same TTL value as the minimum of the SOA MINIMUM field and the SOA TTL. This was always the intended behaviour. Update the zone structure to also track the SOA TTL. Whenever we use the MINIMUM value to determine the NSEC(3) TTL, use the minimum of MINIMUM and SOA TTL instead. There is no specific test for this, however two tests need adjusting because otherwise they failed: They were testing for NSEC3 records including the TTL. Update these checks to use 600 (the SOA TTL), rather than 3600 (the SOA MINIMUM).	2021-04-13 11:26:26 +02:00
Matthijs Mekking	8ffb4b0a13	Merge branch '2289-cache-dump-stale-ttl-weird-values' into 'main' Fix nonsensical stale TTL values in cache dump Closes #2289 See merge request isc-projects/bind9!4799	2021-04-13 08:54:49 +00:00
Matthijs Mekking	a83c8cb0af	Use stale TTL as RRset TTL in dumpdb It is more intuitive to have the countdown 'max-stale-ttl' as the RRset TTL, instead of 0 TTL. This information was already available in a comment "; stale (will be retained for x more seconds", but Support suggested to put it in the TTL field instead.	2021-04-13 09:48:20 +02:00
Matthijs Mekking	debee6157b	Check staleness in bind_rdataset Before binding an RRset, check the time and see if this record is stale (or perhaps even ancient). Marking a header stale or ancient happens only when looking up an RRset in cache, but binding an RRset can also happen on other occasions (for example when dumping the database). Check the time and compare it to the header. If according to the time the entry is stale, but not ancient, set the STALE attribute. If according to the time is ancient, set the ANCIENT attribute. We could mark the header stale or ancient here, but that requires locking, so that's why we only compare the current time against the rdh_ttl. Adjust the test to check the dump-db before querying for data. In the dumped file the entry should be marked as stale, despite no cache lookup happened since the initial query.	2021-04-13 09:48:20 +02:00
Matthijs Mekking	2a5e0232ed	Fix nonsensical stale TTL values in cache dump When introducing change 5149, "rndc dumpdb" started to print a line above a stale RRset, indicating how long the data will be retained. At that time, I thought it should also be possible to load a cache from file. But if a TTL has a value of 0 (because it is stale), stale entries wouldn't be loaded from file. So, I added the 'max-stale-ttl' to TTL values, and adjusted the $DATE accordingly. Since we actually don't have a "load cache from file" feature, this is premature and is causing confusion at operators. This commit changes the 'max-stale-ttl' adjustments. A check in the serve-stale system test is added for a non-stale RRset (longttl.example) to make sure the TTL in cache is sensible. Also, the comment above stale RRsets could have nonsensical values. A possible reason why this may happen is when the RRset was marked a stale but the 'max-stale-ttl' has passed (and is actually an RRset awaiting cleanup). This would lead to the "will be retained" value to be negative (but since it is stored in an uint32_t, you would get a nonsensical value (e.g. 4294362497). To mitigate against this, we now also check if the header is not ancient. In addition we check if the stale_ttl would be negative, and if so we set it to 0. Most likely this will not happen because the header would already have been marked ancient, but there is a possible race condition where the 'rdh_ttl + serve_stale_ttl' has passed, but the header has not been checked for staleness.	2021-04-13 09:48:20 +02:00
Mark Andrews	1941ce99d4	Merge branch '2622-command-line-option-l-not-shown-with-usage-message' into 'main' Resolve "Command-line option -L not shown with usage message" Closes #2622 See merge request isc-projects/bind9!4881	2021-04-13 01:33:28 +00:00

1 2 3 4 5 ...

33673 Commits