bind9

Author	SHA1	Message	Date
Witold Kręcicki	38b78f59a0	Add DoT support to bind Parse the configuration of tls objects into SSL_CTX* objects. Listen on DoT if 'tls' option is setup in listen-on directive. Use DoT/DoH ports for DoT/DoH.	2020-11-10 14:16:55 +01:00
Evan Hunt	8ed005f924	add parser support for TLS configuration options This commit adds stub parser support and tests for: - "tls" statement, specifying key and cert. - an optional "tls" keyvalue in listen-on statements for DoT configuration. Documentation for these options has also been added to the ARM, but needs further work.	2020-11-10 14:16:49 +01:00
Evan Hunt	e011521ef1	address some possible shutdown races in xfrin there were two failures during observed in testing, both occurring when 'rndc halt' was run rather than 'rndc stop' - the latter dumps zone contents to disk and presumably introduced enough delay to prevent the races: - a failure when the zone was shut down and called dns_xfrin_detach() before the xfrin had finished connecting; the connect timeout terminated without detaching its handle - a failure when the tcpdns socket timer fired after the outerhandle had already been cleared. this commit incidentally addresses a failure observed in mutexatomic due to a variable having been initialized incorrectly.	2020-11-09 12:33:37 -08:00
Ondřej Surý	127ba7e930	Add libssl libraries to Windows build This commit extends the perl Configure script to also check for libssl in addition to libcrypto and change the vcxproj source files to link with both libcrypto and libssl.	2020-11-09 16:00:28 +01:00
Evan Hunt	49d53a4aa9	use netmgr for xfrin Use isc_nm_tcpdnsconnect() in xfrin.c for zone transfers.	2020-11-09 13:45:43 +01:00
Ondřej Surý	38f34c266d	Fix possible NULL dereference in cd->dlz_destroy() If the call to cd->dlz_create() in dlopen_dlz_create() fails, cd->dbdata may be NULL when dlopen_dlz_destroy() gets called in the cleanup path and passing NULL to the cd->dlz_destroy() callback may cause a NULL dereference. Ensure that does not happen by checking whether cd->dbdata is non-NULL before calling the cd->dlz_destroy() callback.	2020-10-28 15:48:58 +01:00
Ondřej Surý	37b9511ce1	Use libuv's shared library handling capabilities While libltdl is a feature-rich library, BIND 9 code only uses its basic capabilities, which are also provided by libuv and which BIND 9 already uses for other purposes. As libuv's cross-platform shared library handling interface is modeled after the POSIX dlopen() interface, converting code using the latter to the former is simple. Replace libltdl function calls with their libuv counterparts, refactoring the code as necessary. Remove all use of libltdl from the BIND 9 source tree.	2020-10-28 15:48:58 +01:00
Ondřej Surý	e2436159ab	Refactor the cleanup code in lt_dl code The cleanup code that would clean the object after plugin/dlz/dyndb loading has failed was duplicating the destructor for the object, so instead of the extra code, we just use the destructor instead.	2020-10-28 15:48:58 +01:00
Ondřej Surý	4e9a58a3e6	Unify lt_dlopen() error handling Make sure an error gets logged when any lt_dlopen() call in the source tree fails. Also make sure that NULL values returned by lt_dlerror() are replaced with a generic error message to prevent passing NULL as an argument for the %s format specifier.	2020-10-28 15:48:58 +01:00
Ondřej Surý	0f49b02fc5	Remove redundant lt_dlerror() calls The redundant lt_dlerror() calls were taken from the examples to clean any previous errors from lt_dl...() calls. However upon code inspection, it was discovered there are no such paths that could cause the lt_dlerror() to return spurious error messages.	2020-10-28 15:48:58 +01:00
Ondřej Surý	64e56a9704	Postpone the isc_app_shutdown() after rndc response has been sent When `rndc stop` is received, the isc_app_shutdown() was being called before response to the rndc client has been sent; as the isc_app_shutdown() also tears down the netmgr, the message was never sent and rndc would complain about connection being interrupted in the middle of the transaction. We now postpone the shutdown after the rndc response has been sent.	2020-10-22 11:46:58 -07:00
Ondřej Surý	f7c82e406e	Fix the isc_nm_closedown() to actually close the pending connections 1. The isc__nm_tcp_send() and isc__nm_tcp_read() was not checking whether the socket was still alive and scheduling reads/sends on closed socket. 2. The isc_nm_read(), isc_nm_send() and isc_nm_resumeread() have been changed to always return the error conditions via the callbacks, so they always succeed. This applies to all protocols (UDP, TCP and TCPDNS).	2020-10-22 11:37:16 -07:00
Michal Nowak	1f6f7ccad6	Drop unused portlist code	2020-10-22 13:11:16 +02:00
Mark Andrews	402ac79833	Fix the data race on shutdown/reconfig in control channel The controllistener could be freed before the event posted by isc_nm_stoplistening() has been processed. This commit adds a reference counter to the controllistener to determine when to free the listener.	2020-10-07 18:24:25 +11:00
Ondřej Surý	bb990030d3	Simplify the EDNS buffer size logic for DNS Flag Day 2020 The DNS Flag Day 2020 aims to remove the IP fragmentation problem from the UDP DNS communication. In this commit, we implement the required changes and simplify the logic for picking the EDNS Buffer Size. 1. The defaults for `edns-udp-size`, `max-udp-size` and `nocookie-udp-size` have been changed to `1232` (the value picked by DNS Flag Day 2020). 2. The probing heuristics that would try 512->4096->1432->1232 buffer sizes has been removed and the resolver will always use just the `edns-udp-size` value. 3. Instead of just disabling the PMTUD mechanism on the UDP sockets, we now set IP_DONTFRAG (IPV6_DONTFRAG) flag. That means that the UDP packets won't get ever fragmented. If the ICMP packets are lost the UDP will just timeout and eventually be retried over TCP.	2020-10-05 16:21:21 +02:00
Matthijs Mekking	70d1ec432f	Use explicit result codes for 'rndc dnssec' cmd It is better to add new result codes than to overload existing codes.	2020-10-05 10:53:46 +02:00
Matthijs Mekking	e826facadb	Add rndc dnssec -rollover command This command is similar in arguments as -checkds so refactor the 'named_server_dnssec' function accordingly. The only difference are that: - It does not take a "publish" or "withdrawn" argument. - It requires the key id to be set (add a check to make sure). Add tests that will trigger rollover immediately and one that schedules a test in the future.	2020-10-05 10:53:45 +02:00
Mark Andrews	bea1326cdc	Lock access to listener->connections as it is accessed from multiple threads with libuv. WARNING: ThreadSanitizer: data race Write of size 8 at 0x000000000001 by thread T1: #0 conn_reset bin/named/controlconf.c:574 #1 isc_nmhandle_detach netmgr/netmgr.c:1257 #2 isc__nm_uvreq_put netmgr/netmgr.c:1389 #3 tcp_send_cb netmgr/tcp.c:1030 #4 <null> <null> #5 <null> <null> Previous read of size 8 at 0x000000000001 by thread T2: #0 conn_reset bin/named/controlconf.c:574 #1 isc_nmhandle_detach netmgr/netmgr.c:1257 #2 control_recvmessage bin/named/controlconf.c:556 #3 recv_data lib/isccc/ccmsg.c:110 #4 isc__nm_tcp_shutdown netmgr/tcp.c:1161 #5 shutdown_walk_cb netmgr/netmgr.c:1511 #6 uv_walk <null> #7 process_queue netmgr/netmgr.c:656 #8 process_normal_queue netmgr/netmgr.c:582 #9 process_queues netmgr/netmgr.c:590 #10 async_cb netmgr/netmgr.c:548 #11 <null> <null> #12 <null> <null> Location is heap block of size 265 at 0x000000000017 allocated by thread T3: #0 malloc <null> #1 default_memalloc lib/isc/mem.c:713 #2 mem_get lib/isc/mem.c:622 #3 isc___mem_get lib/isc/mem.c:1044 #4 isc__mem_get lib/isc/mem.c:2432 #5 add_listener bin/named/controlconf.c:1127 #6 named_controls_configure bin/named/controlconf.c:1324 #7 load_configuration bin/named/server.c:9181 #8 run_server bin/named/server.c:9819 #9 dispatch lib/isc/task.c:1152 #10 run lib/isc/task.c:1344 #11 <null> <null> Thread T1 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_nm_start netmgr/netmgr.c:232 #3 create_managers bin/named/main.c:909 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_nm_start netmgr/netmgr.c:232 #3 create_managers bin/named/main.c:909 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 Thread T3 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_taskmgr_create lib/isc/task.c:1434 #3 create_managers bin/named/main.c:915 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 SUMMARY: ThreadSanitizer: data race bin/named/controlconf.c:574 in conn_reset	2020-10-01 15:18:59 +10:00
Mark Andrews	b00ba7ac94	make (named_server_t).reload_status atomic WARNING: ThreadSanitizer: data race Write of size 4 at 0x000000000001 by thread T1: #0 view_loaded bin/named/server.c:9678:25 #1 call_loaddone lib/dns/zt.c:308:3 #2 doneloading lib/dns/zt.c:582:3 #3 zone_asyncload lib/dns/zone.c:2322:3 #4 dispatch lib/isc/task.c:1152:7 #5 run lib/isc/task.c:1344:2 Previous read of size 4 at 0x000000000001 by thread T2: #0 named_server_status bin/named/server.c:11903:14 #1 named_control_docommand bin/named/control.c:272:12 #2 control_command bin/named/controlconf.c:390:17 #3 dispatch lib/isc/task.c:1152:7 #4 run lib/isc/task.c:1344:2 Location is heap block of size 409 at 0x000000000011 allocated by main thread: #0 malloc <null> #1 default_memalloc lib/isc/mem.c:713:8 #2 mem_get lib/isc/mem.c:622:8 #3 mem_allocateunlocked lib/isc/mem.c:1268:8 #4 isc___mem_allocate lib/isc/mem.c:1288:7 #5 isc__mem_allocate lib/isc/mem.c:2453:10 #6 isc___mem_get lib/isc/mem.c:1037:11 #7 isc__mem_get lib/isc/mem.c:2432:10 #8 named_server_create bin/named/server.c:9978:27 #9 setup bin/named/main.c:1256:2 #10 main bin/named/main.c:1523:2 Thread T1 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_taskmgr_create lib/isc/task.c:1434:3 #3 create_managers bin/named/main.c:915:11 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_taskmgr_create lib/isc/task.c:1434:3 #3 create_managers bin/named/main.c:915:11 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 SUMMARY: ThreadSanitizer: data race bin/named/server.c:9678:25 in view_loaded	2020-09-30 14:19:09 +00:00
Matthijs Mekking	8beda7d2ea	Add -expired flag to rndc dumpdb command This flag is the same as -cache, but will use a different style format that will also print expired entries (awaiting cleanup) from the cache.	2020-09-23 16:08:29 +02:00
Mark Andrews	3c4b68af7c	Break lock order loop by sending TAT in an event The dotat() function has been changed to send the TAT query asynchronously, so there's no lock order loop because we initialize the data first and then we schedule the TAT send to happen asynchronously. This breaks following lock-order loops: zone->lock (dns_zone_setviewcommit) while holding view->lock (dns_view_setviewcommit) keytable->lock (dns_keytable_find) while holding zone->lock (zone_asyncload) view->lock (dns_view_findzonecut) while holding keytable->lock (dns_keytable_forall)	2020-09-22 12:33:58 +00:00
Ondřej Surý	ee40b96327	Remove .listener member of controlistener struct In the new netmgr code, the .listener member was mostly functionally only duplicating the .exiting member and was unneeded. This also resolves following ThreadSanitizer (harmless) warning: WARNING: ThreadSanitizer: data race Write of size 1 at 0x000000000001 by thread T1: #0 control_senddone bin/named/controlconf.c:257:22 #1 tcp_send_cb lib/isc/netmgr/tcp.c:1027:2 #2 uv__write_callbacks /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:953:7 #3 uv__stream_io /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:1330:5 #4 uv__run_pending /home/ondrej/Projects/tsan/libuv/src/unix/core.c:812:5 #5 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:377:19 #6 nm_thread lib/isc/netmgr/netmgr.c:500:11 Previous write of size 1 at 0x000000000001 by thread T2: #0 control_senddone bin/named/controlconf.c:257:22 #1 tcp_send_cb lib/isc/netmgr/tcp.c:1027:2 #2 uv__write_callbacks /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:953:7 #3 uv__stream_io /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:1330:5 #4 uv__run_pending /home/ondrej/Projects/tsan/libuv/src/unix/core.c:812:5 #5 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:377:19 #6 nm_thread lib/isc/netmgr/netmgr.c:500:11 Location is heap block of size 265 at 0x000000000009 allocated by thread T3: #0 malloc <null> #1 default_memalloc lib/isc/mem.c:713:8 #2 mem_get lib/isc/mem.c:622:8 #3 isc___mem_get lib/isc/mem.c:1044:9 #4 isc__mem_get lib/isc/mem.c:2432:10 #5 add_listener bin/named/controlconf.c:1133:13 #6 named_controls_configure bin/named/controlconf.c:1331:6 #7 load_configuration bin/named/server.c:9133:2 #8 run_server bin/named/server.c:9771:2 #9 dispatch lib/isc/task.c:1152:7 #10 run lib/isc/task.c:1344:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:223:3 #3 create_managers bin/named/main.c:909:15 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:223:3 #3 create_managers bin/named/main.c:909:15 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 Thread T3 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_taskmgr_create lib/isc/task.c:1434:3 #3 create_managers bin/named/main.c:915:11 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 SUMMARY: ThreadSanitizer: data race bin/named/controlconf.c:257:22 in control_senddone	2020-09-21 10:21:04 +02:00
Mark Andrews	631617d4ec	make controls->shuttingdown an atomic_bool	2020-09-21 12:48:10 +10:00
Mark Andrews	0450acc1b6	Lock access to control->symtab to prevent data race WARNING: ThreadSanitizer: data race Read of size 8 at 0x000000000001 by thread T1: #0 isccc_symtab_foreach lib/isccc/symtab.c:277:14 #1 isccc_cc_cleansymtab lib/isccc/cc.c:954:2 #2 control_recvmessage bin/named/controlconf.c:477:2 #3 recv_data lib/isccc/ccmsg.c:110:2 #4 read_cb lib/isc/netmgr/tcp.c:769:4 #5 <null> <null> Previous write of size 8 at 0x000000000001 by thread T2: #0 isccc_symtab_define lib/isccc/symtab.c:242:2 #1 isccc_cc_checkdup lib/isccc/cc.c:1026:11 #2 control_recvmessage bin/named/controlconf.c:478:11 #3 recv_data lib/isccc/ccmsg.c:110:2 #4 read_cb lib/isc/netmgr/tcp.c:769:4 #5 <null> <null> Location is heap block of size 190352 at 0x000000000011 allocated by main thread: #0 malloc <null> #1 isccc_symtab_create lib/isccc/symtab.c:76:18 #2 isccc_cc_createsymtab lib/isccc/cc.c:948:10 #3 named_controls_create bin/named/controlconf.c:1483:11 #4 named_server_create bin/named/server.c:10057:2 #5 setup bin/named/main.c:1256:2 #6 main bin/named/main.c:1523:2 Thread T1 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:215:3 #3 create_managers bin/named/main.c:909:15 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:73:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:215:3 #3 create_managers bin/named/main.c:909:15 #4 setup bin/named/main.c:1223:11 #5 main bin/named/main.c:1523:2 SUMMARY: ThreadSanitizer: data race lib/isccc/symtab.c:277:14 in isccc_symtab_foreach	2020-09-17 18:51:42 +10:00
Mark Andrews	d988383b4a	control_respond fails to detach from cmdhandle on error	2020-09-17 06:32:41 +00:00
Mark Andrews	c5c2a4820b	Cleanup connection before detaching	2020-09-17 15:18:27 +10:00
Michał Kępień	5ae33351f2	Deprecate the "glue-cache" option No issues with the glue cache feature have been reported since its introduction in BIND 9.12. As the rationale for introducing the "glue-cache" option was to have a safety switch readily available in case the glue cache turns out to cause problems, it is time to deprecate the option. Glue cache will be permanently enabled in a future release, at which point the "glue-cache" option will be made obsolete.	2020-09-16 11:18:07 +02:00
Evan Hunt	dcee985b7f	update all copyright headers to eliminate the typo	2020-09-14 16:20:40 -07:00
Evan Hunt	57b4dde974	change from isc_nmhandle_ref/unref to isc_nmhandle attach/detach Attaching and detaching handle pointers will make it easier to determine where and why reference counting errors have occurred. A handle needs to be referenced more than once when multiple asynchronous operations are in flight, so callers must now maintain multiple handle pointers for each pending operation. For example, ns_client objects now contain: - reqhandle: held while waiting for a request callback (query, notify, update) - sendhandle: held while waiting for a send callback - fetchhandle: held while waiting for a recursive fetch to complete - updatehandle: held while waiting for an update-forwarding task to complete control channel connection objects now contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback - cmdhandle: held while an rndc command is running httpd connections contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback	2020-09-11 12:17:57 -07:00
Mark Andrews	9b445f33e2	Defer read of zl->server and zl->reconfig until the reference counter has gone to zero and there is no longer a possibility of changes in other threads.	2020-09-09 13:58:31 +10:00
Michał Kępień	9ac1f6a9bc	Add "-T maxcachesize=..." command line option An implicit default of "max-cache-size 90%;" may cause memory use issues on hosts which run numerous named instances in parallel (e.g. GitLab CI runners) due to the cache RBT hash table now being pre-allocated [1] at startup. Add a new command line option, "-T maxcachesize=...", to allow the default value of "max-cache-size" to be overridden at runtime. When this new option is in effect, it overrides any other "max-cache-size" setting in the configuration, either implicit or explicit. This approach was chosen because it is arguably the simplest one to implement. The following alternative approaches to solving this problem were considered and ultimately rejected (after it was decided they were not worth the extra code complexity): - adding the same command line option, but making explicit configuration statements have priority over it, - adding a build-time option that allows the implicit default of "max-cache-size 90%;" to be overridden. [1] see commit `e24bc324b4`	2020-08-31 13:15:33 +02:00
Evan Hunt	d7362ff16d	Merge tag 'v9_17_4' into main BIND 9.17.4	2020-08-20 12:05:01 -07:00
Matthijs Mekking	46fcd927e7	rndc dnssec -checkds set algorithm In the rare case that you have multiple keys acting as KSK and that have the same keytag, you can now set the algorithm when calling '-checkds'.	2020-08-07 11:26:09 +02:00
Matthijs Mekking	a25f49f153	Make 'parent-registration-delay' obsolete With the introduction of 'checkds', the 'parent-registration-delay' option becomes obsolete.	2020-08-07 11:26:09 +02:00
Matthijs Mekking	04d8fc0143	Implement 'rndc dnssec -checkds' Add a new 'rndc' command 'dnssec -checkds' that allows the user to signal named that a new DS record has been seen published in the parent, or that an existing DS record has been withdrawn from the parent. Upon the 'checkds' request, 'named' will write out the new state for the key, updating the 'DSPublish' or 'DSRemoved' timing metadata. This replaces the "parent-registration-delay" configuration option, this was unreliable because it was purely time based (if the user did not actually submit the new DS to the parent for example, this could result in an invalid DNSSEC state). Because we cannot rely on the parent registration delay for state transition, we need to replace it with a different guard. Instead, if a key wants its DS state to be moved to RUMOURED, the "DSPublish" time must be set and must not be in the future. If a key wants its DS state to be moved to UNRETENTIVE, the "DSRemoved" time must be set and must not be in the future. By default, with '-checkds' you set the time that the DS has been published or withdrawn to now, but you can set a different time with '-when'. If there is only one KSK for the zone, that key has its DS state moved to RUMOURED. If there are multiple keys for the zone, specify the right key with '-key'.	2020-08-07 11:26:09 +02:00
Mark Andrews	952955aa4c	Update-policy 'subdomain' was incorrectly treated as 'zonesub' resulting in names outside the specified subdomain having the wrong restrictions for the given key.	2020-08-05 15:54:50 +02:00
Ondřej Surý	dd62275152	Add CHANGES and release notes for GL #1712 and GL #1829	2020-08-04 10:51:09 +02:00
Ondřej Surý	ce53db34d6	Add stale-cache-enable option and disable serve-stable by default The current serve-stale implementation in BIND 9 stores all received records in the cache for a max-stale-ttl interval (default 12 hours). This allows DNS operators to turn the serve-stale answers in an event of large authoritative DNS outage. The caching of the stale answers needs to be enabled before the outage happens or the feature would be otherwise useless. The negative consequence of the default setting is the inevitable cache-bloat that happens for every and each DNS operator running named. In this MR, a new configuration option `stale-cache-enable` is introduced that allows the operators to selectively enable or disable the serve-stale feature of BIND 9 based on their decision. The newly introduced option has been disabled by default, e.g. serve-stale is disabled in the default configuration and has to be enabled if required.	2020-08-04 10:50:31 +02:00
Mark Andrews	bde5c7632a	Always check the return from isc_refcount_decrement. Created isc_refcount_decrement_expect macro to test conditionally the return value to ensure it is in expected range. Converted unchecked isc_refcount_decrement to use isc_refcount_decrement_expect. Converted INSIST(isc_refcount_decrement()...) to isc_refcount_decrement_expect.	2020-07-31 10:15:44 +10:00
Evan Hunt	1036338a10	report libuv version string in `named -V`	2020-07-28 02:41:39 +00:00
Petr Menšík	c5e7152cf0	Prevent crash on dst initialization failure server might be created, but not yet fully initialized, when fatal function is called. Check both server and task before attaching exclusive task.	2020-07-23 00:31:52 +00:00
Evan Hunt	69c1ee1ce9	rewrite statschannel to use netmgr modify isc_httpd to use the network manager instead of the isc_socket API. also cleaned up bin/named/statschannel.c to use CHECK.	2020-07-15 22:35:07 -07:00
Tony Finch	030674b2a3	Fix re-signing when `sig-validity-interval` has two arguments Since October 2019 I have had complaints from `dnssec-cds` reporting that the signatures on some of my test zones had expired. These were zones signed by BIND 9.15 or 9.17, with a DNSKEY TTL of 24h and `sig-validity-interval 10 8`. This is the same setup we have used for our production zones since 2015, which is intended to re-sign the zones every 2 days, keeping at least 8 days signature validity. The SOA expire interval is 7 days, so even in the presence of zone transfer problems, no-one should ever see expired signatures. (These timers are a bit too tight to be completely correct, because I should have increased the expiry timers when I increased the DNSKEY TTLs from 1h to 24h. But that should only matter when zone transfers are broken, which was not the case for the error reports that led to this patch.) For example, this morning my test zone contained: dev.dns.cam.ac.uk. 86400 IN RRSIG DNSKEY 13 5 86400 ( 20200701221418 20200621213022 ...) But one of my resolvers had cached: dev.dns.cam.ac.uk. 21424 IN RRSIG DNSKEY 13 5 86400 ( 20200622063022 20200612061136 ...) This TTL was captured at 20200622105807 so the resolver cached the RRset 64976 seconds previously (18h02m56s), at 20200621165511 only about 12h before expiry. The other symptom of this error was incorrect `resign` times in the output from `rndc zonestatus`. For example, I have configured a test zone zone fast.dotat.at { file "../u/z/fast.dotat.at"; type primary; auto-dnssec maintain; sig-validity-interval 500 499; }; The zone is reset to a minimal zone containing only SOA and NS records, and when `named` starts it loads and signs the zone. After that, `rndc zonestatus` reports: next resign node: fast.dotat.at/NS next resign time: Fri, 28 May 2021 12:48:47 GMT The resign time should be within the next 24h, but instead it is near the signature expiry time, which the RRSIG(NS) says is 20210618074847. (Note 499 hours is a bit more than 20 days.) May/June 2021 is less than 500 days from now because expiry time jitter is applied to the NS records. Using this test I bisected this bug to `09990672d` which contained a mistake leading to the resigning interval always being calculated in hours, when days are expected. This bug only occurs for configurations that use the two-argument form of `sig-validity-interval`.	2020-07-14 10:57:43 +10:00
Evan Hunt	29dcdeba1b	purge pending command events when shutting down When we're shutting the system down via "rndc stop" or "rndc halt", or reconfiguring the control channel, there are potential shutdown races between the server task and network manager. These are adressed by: - purging any pending command tasks when shutting down the control channel - adding an extra handle reference before the command handler to ensure the handle can't be deleted out from under us before calling command_respond()	2020-07-13 13:17:08 -07:00
Evan Hunt	45ab0603eb	use an isc_task to execute rndc commands - using an isc_task to execute all rndc functions makes it relatively simple for them to acquire task exclusive mode when needed - control_recvmessage() has been separated into two functions, control_recvmessage() and control_respond(). the respond function can be called immediately from control_recvmessage() when processing a nonce, or it can be called after returning from the task event that ran the rndc command function.	2020-07-13 13:16:53 -07:00
Evan Hunt	3551d3ffd2	convert rndc and control channel to use netmgr - updated libisccc to use netmgr events - updated rndc to use isc_nm_tcpconnect() to establish connections - updated control channel to use isc_nm_listentcp() open issues: - the control channel timeout was previously 60 seconds, but it is now overridden by the TCP idle timeout setting, which defaults to 30 seconds. we should add a function that sets the timeout value for a specific listener socket, instead of always using the global value set in the netmgr. (for the moment, since 30 seconds is a reasonable timeout for the control channel, I'm not prioritizing this.) - the netmgr currently has no support for UNIX-domain sockets; until this is addressed, it will not be possible to configure rndc to use them. we will need to either fix this or document the change in behavior.	2020-07-13 13:16:53 -07:00
Evan Hunt	002c328437	don't use exclusive mode for rndc commands that don't need it "showzone" and "tsig-list" both used exclusive mode unnecessarily; changing this will simplify future refactoring a bit.	2020-07-13 13:12:33 -07:00
Evan Hunt	0580d9cd8c	style cleanup clean up style in rndc and the control channel in preparation for changing them to use the new network manager.	2020-07-13 12:41:04 -07:00
Evan Hunt	ed37c63e2b	make sure new_zone_lock is locked before unlocking it it was possible for the count_newzones() function to try to unlock view->new_zone_lock on return before locking it, which caused a crash on shutdown.	2020-07-13 12:06:26 -07:00
Mark Andrews	d02a14c795	Fallback to built in trust-anchors, managed-keys, or trusted-keys if the bind.keys file cannot be parsed.	2020-07-13 14:12:14 +10:00

1 2 3 4 5 ...

3666 Commits