bind9

Author	SHA1	Message	Date
Michał Kępień	3f25b8e608	Add "-T maxcachesize=..." command line option An implicit default of "max-cache-size 90%;" may cause memory use issues on hosts which run numerous named instances in parallel (e.g. GitLab CI runners) due to the cache RBT hash table now being pre-allocated [1] at startup. Add a new command line option, "-T maxcachesize=...", to allow the default value of "max-cache-size" to be overridden at runtime. When this new option is in effect, it overrides any other "max-cache-size" setting in the configuration, either implicit or explicit. This approach was chosen because it is arguably the simplest one to implement. The following alternative approaches to solving this problem were considered and ultimately rejected (after it was decided they were not worth the extra code complexity): - adding the same command line option, but making explicit configuration statements have priority over it, - adding a build-time option that allows the implicit default of "max-cache-size 90%;" to be overridden. [1] see commit `aa72c31422` (cherry picked from commit `9ac1f6a9bc`)	2020-08-31 23:41:24 +02:00
Evan Hunt	1c7e3c8515	Merge tag 'v9_16_6' into v9_16 BIND 9.16.6	2020-08-20 12:08:57 -07:00
Matthijs Mekking	624f1b9531	rndc dnssec -checkds set algorithm In the rare case that you have multiple keys acting as KSK and that have the same keytag, you can now set the algorithm when calling '-checkds'. (cherry picked from commit `46fcd927e7`)	2020-08-07 13:34:10 +02:00
Matthijs Mekking	4892006a92	Make 'parent-registration-delay' obsolete With the introduction of 'checkds', the 'parent-registration-delay' option becomes obsolete. (cherry picked from commit `a25f49f153`)	2020-08-07 13:30:50 +02:00
Matthijs Mekking	81d0c63ecb	Implement 'rndc dnssec -checkds' Add a new 'rndc' command 'dnssec -checkds' that allows the user to signal named that a new DS record has been seen published in the parent, or that an existing DS record has been withdrawn from the parent. Upon the 'checkds' request, 'named' will write out the new state for the key, updating the 'DSPublish' or 'DSRemoved' timing metadata. This replaces the "parent-registration-delay" configuration option, this was unreliable because it was purely time based (if the user did not actually submit the new DS to the parent for example, this could result in an invalid DNSSEC state). Because we cannot rely on the parent registration delay for state transition, we need to replace it with a different guard. Instead, if a key wants its DS state to be moved to RUMOURED, the "DSPublish" time must be set and must not be in the future. If a key wants its DS state to be moved to UNRETENTIVE, the "DSRemoved" time must be set and must not be in the future. By default, with '-checkds' you set the time that the DS has been published or withdrawn to now, but you can set a different time with '-when'. If there is only one KSK for the zone, that key has its DS state moved to RUMOURED. If there are multiple keys for the zone, specify the right key with '-key'. (cherry picked from commit `04d8fc0143`)	2020-08-07 13:30:19 +02:00
Mark Andrews	7630a64141	Update-policy 'subdomain' was incorrectly treated as 'zonesub' resulting in names outside the specified subdomain having the wrong restrictions for the given key.	2020-08-05 15:55:06 +02:00
Matthijs Mekking	c92de6cb44	stale-cache-enable is enabled by default Because this is a backport, the option should default to keep the serve-stale caching enabled.	2020-08-05 09:09:16 +02:00
Ondřej Surý	f3a7ee87ef	Add CHANGES and release notes for GL #1712 and GL #1829 (cherry picked from commit `dd62275152`)	2020-08-05 09:09:16 +02:00
Ondřej Surý	b48e9ab201	Add stale-cache-enable option and disable serve-stable by default The current serve-stale implementation in BIND 9 stores all received records in the cache for a max-stale-ttl interval (default 12 hours). This allows DNS operators to turn the serve-stale answers in an event of large authoritative DNS outage. The caching of the stale answers needs to be enabled before the outage happens or the feature would be otherwise useless. The negative consequence of the default setting is the inevitable cache-bloat that happens for every and each DNS operator running named. In this MR, a new configuration option `stale-cache-enable` is introduced that allows the operators to selectively enable or disable the serve-stale feature of BIND 9 based on their decision. The newly introduced option has been disabled by default, e.g. serve-stale is disabled in the default configuration and has to be enabled if required. (cherry picked from commit `ce53db34d6`)	2020-08-05 09:09:16 +02:00
Mark Andrews	14fe6e77a7	Always check the return from isc_refcount_decrement. Created isc_refcount_decrement_expect macro to test conditionally the return value to ensure it is in expected range. Converted unchecked isc_refcount_decrement to use isc_refcount_decrement_expect. Converted INSIST(isc_refcount_decrement()...) to isc_refcount_decrement_expect. (cherry picked from commit `bde5c7632a`)	2020-07-31 12:54:47 +10:00
Evan Hunt	bbc739b09b	report libuv version string in `named -V` (cherry picked from commit `1036338a10`)	2020-07-27 19:55:22 -07:00
Petr Menšík	fade143531	Prevent crash on dst initialization failure server might be created, but not yet fully initialized, when fatal function is called. Check both server and task before attaching exclusive task. (cherry picked from commit `c5e7152cf0`)	2020-07-23 11:28:11 +10:00
Tony Finch	31005d61ae	Fix re-signing when `sig-validity-interval` has two arguments Since October 2019 I have had complaints from `dnssec-cds` reporting that the signatures on some of my test zones had expired. These were zones signed by BIND 9.15 or 9.17, with a DNSKEY TTL of 24h and `sig-validity-interval 10 8`. This is the same setup we have used for our production zones since 2015, which is intended to re-sign the zones every 2 days, keeping at least 8 days signature validity. The SOA expire interval is 7 days, so even in the presence of zone transfer problems, no-one should ever see expired signatures. (These timers are a bit too tight to be completely correct, because I should have increased the expiry timers when I increased the DNSKEY TTLs from 1h to 24h. But that should only matter when zone transfers are broken, which was not the case for the error reports that led to this patch.) For example, this morning my test zone contained: dev.dns.cam.ac.uk. 86400 IN RRSIG DNSKEY 13 5 86400 ( 20200701221418 20200621213022 ...) But one of my resolvers had cached: dev.dns.cam.ac.uk. 21424 IN RRSIG DNSKEY 13 5 86400 ( 20200622063022 20200612061136 ...) This TTL was captured at 20200622105807 so the resolver cached the RRset 64976 seconds previously (18h02m56s), at 20200621165511 only about 12h before expiry. The other symptom of this error was incorrect `resign` times in the output from `rndc zonestatus`. For example, I have configured a test zone zone fast.dotat.at { file "../u/z/fast.dotat.at"; type primary; auto-dnssec maintain; sig-validity-interval 500 499; }; The zone is reset to a minimal zone containing only SOA and NS records, and when `named` starts it loads and signs the zone. After that, `rndc zonestatus` reports: next resign node: fast.dotat.at/NS next resign time: Fri, 28 May 2021 12:48:47 GMT The resign time should be within the next 24h, but instead it is near the signature expiry time, which the RRSIG(NS) says is 20210618074847. (Note 499 hours is a bit more than 20 days.) May/June 2021 is less than 500 days from now because expiry time jitter is applied to the NS records. Using this test I bisected this bug to `09990672d` which contained a mistake leading to the resigning interval always being calculated in hours, when days are expected. This bug only occurs for configurations that use the two-argument form of `sig-validity-interval`. (cherry picked from commit `030674b2a3`)	2020-07-14 12:11:42 +10:00
Evan Hunt	fc73dbdc7d	make sure new_zone_lock is locked before unlocking it it was possible for the count_newzones() function to try to unlock view->new_zone_lock on return before locking it, which caused a crash on shutdown. (cherry picked from commit `ed37c63e2b`)	2020-07-13 23:53:14 +00:00
Mark Andrews	0265bd17d5	Fallback to built in trust-anchors, managed-keys, or trusted-keys if the bind.keys file cannot be parsed. (cherry picked from commit `d02a14c795`)	2020-07-13 15:13:50 +10:00
Mark Andrews	86464e6e4b	Remove redundant check for listener being non-NULL (cherry picked from commit `c91dc92410`)	2020-07-13 10:28:34 +10:00
Michał Kępień	0bc4d6cc7a	Fix locking for LMDB 0.9.26 When "rndc reconfig" is run, named first configures a fresh set of views and then tears down the old views. Consider what happens for a single view with LMDB enabled; "envA" is the pointer to the LMDB environment used by the original/old version of the view, "envB" is the pointer to the same LMDB environment used by the new version of that view: 1. mdb_env_open(envA) is called when the view is first created. 2. "rndc reconfig" is called. 3. mdb_env_open(envB) is called for the new instance of the view. 4. mdb_env_close(envA) is called for the old instance of the view. This seems to have worked so far. However, an upstream change [1] in LMDB which will be part of its 0.9.26 release prevents the above sequence of calls from working as intended because the locktable mutexes will now get destroyed by the mdb_env_close() call in step 4 above, causing any subsequent mdb_txn_begin() calls to fail (because all of the above steps are happening within a single named process). Preventing the above scenario from happening would require either redesigning the way we use LMDB in BIND, which is not something we can easily backport, or redesigning the way BIND carries out its reconfiguration process, which would be an even more severe change. To work around the problem, set MDB_NOLOCK when calling mdb_env_open() to stop LMDB from controlling concurrent access to the database and do the necessary locking in named instead. Reuse the view->new_zone_lock mutex for this purpose to prevent the need for modifying struct dns_view (which would necessitate library API version bumps). Drop use of MDB_NOTLS as it is made redundant by MDB_NOLOCK: MDB_NOTLS only affects where LMDB reader locktable slots are stored while MDB_NOLOCK prevents the reader locktable from being used altogether. [1] `2fd44e3251` (cherry picked from commit `53120279b5`)	2020-07-10 11:30:31 +02:00
Matthijs Mekking	293d52341d	Increase "rndc dnssec -status" output size BUFSIZ (512 bytes on Windows) may not be enough to fit the status of a DNSSEC policy and three DNSSEC keys. Set the size of the relevant buffer to a hardcoded value of 4096 bytes, which should be enough for most scenarios. (cherry picked from commit `9347e7db7e`)	2020-07-03 15:13:50 +02:00
Ondřej Surý	7c0fb5e492	Don't continue opening a new rndc connection if we are shutting down Due to lack of synchronization, whenever named was being requested to stop using rndc, controlconf.c module could be trying to access an already released pointer through named_g_server->interfacemgr in a separate thread. The race could only be triggered if named was being shutdown and more rndc connections were ocurring at the same time. This fix correctly checks if the server is shutting down before opening a new rndc connection. (cherry picked from commit `be6cc53ec2`)	2020-07-01 12:52:51 +02:00
Matthijs Mekking	f1b3686cd2	Output rndc dnssec -status Implement the 'rndc dnssec -status' command that will output some information about the key states, such as which policy is used for the zone, what keys are in use, and when rollover is scheduled. Add loose testing in the kasp system test, the actual times are already tested via key file inspection. (cherry picked from commit `19ce9ec1d4`)	2020-07-01 09:57:44 +02:00
Matthijs Mekking	34a9c3f6c9	Implement dummy 'rndc dnssec -status' command Add the code and documentation required to provide DNSSEC signing status through rndc. This does not yet show any useful information, just provide the command that will output some dummy string. (cherry picked from commit `e1ba1bea7c`)	2020-07-01 09:57:44 +02:00
Mark Andrews	4885f0813e	Resize unamebuf[] to avoid warnings about snprintf() not having enough buffer space. Also change named_os_uname() prototype so that it is now returning (const char ) rather than (char ). If uname() is not supported on a UNIX build prepopulate unamebuf[] with "unknown architecture". (cherry picked from commit `4bc3de070f`)	2020-06-25 09:26:22 +10:00
Evan Hunt	dca3658720	"check-names primary" and "check-names secondary" were ignored these keywords were added to the parser as synonyms for "master" and "slave" but were never hooked in to the configuration of named, so they were ignored. this has been fixed and the option is now checked for correctness. (cherry picked from commit `ba31b189b4`)	2020-06-22 14:30:14 +02:00
Mark Andrews	70c27df941	The dsset returned by dns_keynode_dsset needs to be thread safe. - clone keynode->dsset rather than return a pointer so that thread use is independent of each other. - hold a reference to the dsset (keynode) so it can't be deleted while in use. - create a new keynode when removing DS records so that dangling pointers to the deleted records will not occur. - use a rwlock when accessing the rdatalist to prevent instabilities when DS records are added. (cherry picked from commit `e5b2eca1d3`)	2020-06-11 16:09:43 +10:00
Michal Nowak	e39685b579	Fix "array subscript is of type 'char'"	2020-06-04 16:27:43 +02:00
Ondřej Surý	d85b936898	Reduce the default value for max-stale-ttl from 1 week to 12 hours Originally, the default value for max-stale-ttl was 1 week, which could and in some scenarios lead to cache exhaustion on a busy resolvers. Picking the default value will always be juggling between value that's useful (e.g. keeping the already cached records after they have already expired and the upstream name servers are down) and not bloating the cache too much (e.g. keeping everything for a very long time). The new default reflects what we think is a reasonable to time to react on both sides (upstream authoritative and downstream recursive). (cherry picked from commit `13fd3ecfab`)	2020-06-03 10:45:09 +00:00
Evan Hunt	a40d6ec091	generate grammar text correctly in RST format	2020-06-01 09:58:29 +02:00
Evan Hunt	b580eb2fb3	add support for building sphinx documentation	2020-06-01 09:54:46 +02:00
Ondřej Surý	58e20fa746	Convert the documentation to Sphinx documentation format The ARM and the manpages have been converted into Sphinx documentation format. Sphinx uses reStructuredText as its markup language, and many of its strengths come from the power and straightforwardness of reStructuredText and its parsing and translating suite, the Docutils. (cherry picked from commit `9fb6d11abb`)	2020-06-01 09:54:45 +02:00
Witold Kręcicki	4ceddeee78	Add missing isc_mutex_destroy and isc_conditional_destroy calls. While harmless on Linux, missing isc_{mutex,conditional}_destroy causes a memory leak on *BSD. Missing calls were added. (cherry picked from commit `a8807d9a7b`)	2020-05-30 07:50:30 +02:00
Evan Hunt	c19a593d3e	change "expr == false" to "!expr" in conditionals (cherry picked from commit `57e54c46e4`)	2020-05-25 17:04:04 -07:00
Evan Hunt	00c816778d	change 'expr == true' to 'expr' in conditionals (cherry picked from commit `68a1c9d679`)	2020-05-25 17:03:59 -07:00
Mark Andrews	63238a6c7a	Add DBC checks to named_checknames_get (cherry picked from commit `20c07da49b`)	2020-05-25 11:52:17 +10:00
Mark Andrews	d02f1ed5b8	Set obj to NULL so INSIST makes sense (cherry picked from commit `ac9e266749`)	2020-05-25 11:52:17 +10:00
Tinderbox User	046f56f5ab	prep 9.16.3	2020-05-19 13:57:36 +02:00
Paul Frieden	838329bcb7	Add the zone timers to the XSL (cherry picked from commit `72ffa194e2`)	2020-05-12 14:12:03 +02:00
Paul Frieden	794c645987	Export zone timers via stats channels (cherry picked from commit `543bab699d`)	2020-05-12 14:12:03 +02:00
Ondřej Surý	af1b56240f	Resolve the overlinking of the system libraries Originally, every library and binaries got linked to everything, which creates unnecessary overlinking. This wasn't as straightforward as it should be as we still support configuration without libtool for 9.16. Couple of smaller issues related to include headers and an issue where sanitizer overload dlopen and dlclose symbols, so we were getting false negatives in the autoconf test.	2020-05-11 09:49:54 +02:00
Michał Kępień	299954d006	Make dnstap work reliably with netmgr The introduction of netmgr doubled the number of threads from which dnstap data may be logged: previously, it could only happen from within taskmgr worker threads; with netmgr, it can happen both from taskmgr worker threads and from network threads. Since the argument passed to fstrm_iothr_options_set_num_input_queues() was not updated to reflect this change, some calls to fstrm_iothr_get_input_queue() can now return NULL, effectively preventing some dnstap data from being logged. Whether this bug is triggered or not depends on thread scheduling order and packet distribution between network threads, but will almost certainly be triggered on any recursive resolver sooner or later. Fix by requesting the correct number of dnstap input queues to be allocated. (cherry picked from commit `77dc091855`)	2020-05-01 16:29:18 +02:00
Mark Andrews	1254db80bb	Propagate first_time to named_os_openfile in generate_session_key. named_os_openfile was being called with switch_user set to true unconditionally leading to log messages about being unable to switch user identity from named when regenerating the key. (cherry picked from commit `071bc29962`)	2020-04-28 17:29:35 +02:00
Ondřej Surý	fe0eca7c20	Don't change effective uid when we already dropped privileges When running on Linux and system capabilities are available, named will drop the extra capabilities before loading the configuration. This led to spurious warnings from `seteuid()` because named already dropped CAP_SETUID and CAP_GETUID capabilities. The fix removes setting the effective uid/gid when capabilities are available, and adds a check that we are running under the user we were requested to run. (cherry picked from commit `6c82e2af92`)	2020-04-28 17:29:35 +02:00
Tinderbox User	f4e6b61b0f	regen v9_16	2020-04-16 23:07:41 +02:00
Matthijs Mekking	6e3654c434	dnssec-policy: to sign inline or not When dnssec-policy was introduced, it implicitly set inline-signing. But DNSSEC maintenance required either inline-signing to be enabled, or a dynamic zone. In other words, not in all cases you want to DNSSEC maintain your zone with inline-signing. Change the behavior and determine whether inline-signing is required: if the zone is dynamic, don't use inline-signing, otherwise implicitly set it. You can also explicitly set inline-signing to yes with dnssec-policy, the restriction that both inline-signing and dnssec-policy cannot be set at the same time is now lifted. However, 'inline-signing no;' on a non-dynamic zone with a dnssec-policy is not possible. (cherry picked from commit `644f0d958a`)	2020-04-16 16:04:28 +02:00
Matthijs Mekking	728112d70c	Replace leftover DNSSEC-KEYS with TRUST-ANCHORS Change 5332 renamed "dnssec-keys" configuration statement to the more descriptive "trust-anchors". Not all occurrences in the documentation had been updated. (cherry picked from commit 7c6dde024155585008e9bfd09c03722d69211d02)	2020-04-16 14:08:04 +02:00
Ondřej Surý	8b84fb4f42	Disable MSB8028 warning All our MSVS Project files share the same intermediate directory. We know that this doesn't cause any problems, so we can just disable the detection in the project files. Example of the warning: warning MSB8028: The intermediate directory (.\Release\) contains files shared from another project (dnssectool.vcxproj). This can lead to incorrect clean and rebuild behavior. (cherry picked from commit `b6c2012d93`)	2020-04-16 07:56:38 +02:00
Ondřej Surý	820b9ba38a	Set WarningLevel to Level1 for Release, treat warnings as errors Our vcxproj files set the WarningLevel to Level3, which is too verbose for a code that needs to be portable. That basically leads to ignoring all the errors that MSVC produces. This commits downgrades the WarningLevel to Level1 and enables treating warnings as errors for Release builds. For the Debug builds the WarningLevel got upgraded to Level4, and treating warnings as errors is explicitly disabled. We should eventually make the code clean of all MSVC warnings, but it's a long way to go for Level4, so it's more reasonable to start at Level1. For reference[1], these are the warning levels as described by MSVC documentation: * /W0 suppresses all warnings. It's equivalent to /w. * /W1 displays level 1 (severe) warnings. /W1 is the default setting in the command-line compiler. * /W2 displays level 1 and level 2 (significant) warnings. * /W3 displays level 1, level 2, and level 3 (production quality) warnings. /W3 is the default setting in the IDE. * /W4 displays level 1, level 2, and level 3 warnings, and all level 4 (informational) warnings that aren't off by default. We recommend that you use this option to provide lint-like warnings. For a new project, it may be best to use /W4 in all compilations. This option helps ensure the fewest possible hard-to-find code defects. * /Wall displays all warnings displayed by /W4 and all other warnings that /W4 doesn't include — for example, warnings that are off by default. * /WX treats all compiler warnings as errors. For a new project, it may be best to use /WX in all compilations; resolving all warnings ensures the fewest possible hard-to-find code defects. 1. https://docs.microsoft.com/en-us/cpp/build/reference/compiler-option-warning-level?view=vs-2019 (cherry picked from commit `789d253e3d`)	2020-04-16 07:55:40 +02:00
Matthijs Mekking	f46187bcaa	Merge if blocks in statschannel.c (cherry picked from commit `1596d3b498`)	2020-04-03 10:04:16 +02:00
Matthijs Mekking	ae19d0f60a	Replace sign operation bool with enum (cherry picked from commit `44b49955e1`)	2020-04-03 10:04:07 +02:00
Matthijs Mekking	f59f446122	Redesign dnssec sign statistics The first attempt to add DNSSEC sign statistics was naive: for each zone we allocated 64K counters, twice. In reality each zone has at most four keys, so the new approach only has room for four keys per zone. If after a rollover more keys have signed the zone, existing keys are rotated out. The DNSSEC sign statistics has three counters per key, so twelve counters per zone. First counter is actually a key id, so it is clear what key contributed to the metrics. The second counter tracks the number of generated signatures, and the third tracks how many of those are refreshes. This means that in the zone structure we no longer need two separate references to DNSSEC sign metrics: both the resign and refresh stats are kept in a single dns_stats structure. Incrementing dnssecsignstats: Whenever a dnssecsignstat is incremented, we look up the key id to see if we already are counting metrics for this key. If so, we update the corresponding operation counter (resign or refresh). If the key is new, store the value in a new counter and increment corresponding counter. If all slots are full, we rotate the keys and overwrite the last slot with the new key. Dumping dnssecsignstats: Dumping dnssecsignstats is no longer a simple wrapper around isc_stats_dump, but uses the same principle. The difference is that rather than dumping the index (key tag) and counter, we have to look up the corresponding counter. (cherry picked from commit `705810d577`)	2020-04-03 10:03:30 +02:00
Ondřej Surý	0fdc09efb6	Fix 'Dereference of null pointer' from scan-build-10 These are mostly false positives, the clang-analyzer FAQ[1] specifies why and how to fix it: > The reason the analyzer often thinks that a pointer can be null is > because the preceding code checked compared it against null. So if you > are absolutely sure that it cannot be null, remove the preceding check > and, preferably, add an assertion as well. The 4 warnings reported are: dnssec-cds.c:781:4: warning: Access to field 'base' results in a dereference of a null pointer (loaded from variable 'buf') isc_buffer_availableregion(buf, &r); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builds/isc-projects/bind9/lib/isc/include/isc/buffer.h:996:36: note: expanded from macro 'isc_buffer_availableregion' ^ /builds/isc-projects/bind9/lib/isc/include/isc/buffer.h:821:16: note: expanded from macro 'ISC__BUFFER_AVAILABLEREGION' (_r)->base = isc_buffer_used(_b); \ ^~~~~~~~~~~~~~~~~~~ /builds/isc-projects/bind9/lib/isc/include/isc/buffer.h:152:29: note: expanded from macro 'isc_buffer_used' ((void )((unsigned char )(b)->base + (b)->used)) /d/ ^~~~~~~~~ 1 warning generated. -- byname_test.c:308:34: warning: Access to field 'fwdtable' results in a dereference of a null pointer (loaded from variable 'view') RUNTIME_CHECK(dns_fwdtable_add(view->fwdtable, dns_rootname, ^~~~~~~~~~~~~~ /builds/isc-projects/bind9/lib/isc/include/isc/util.h:318:52: note: expanded from macro 'RUNTIME_CHECK' ^~~~ /builds/isc-projects/bind9/lib/isc/include/isc/error.h:50:21: note: expanded from macro 'ISC_ERROR_RUNTIMECHECK' ((void)(ISC_LIKELY(cond) \|\| \ ^~~~ /builds/isc-projects/bind9/lib/isc/include/isc/likely.h:23:43: note: expanded from macro 'ISC_LIKELY' ^ 1 warning generated. -- ./rndc.c:255:6: warning: Dereference of null pointer (loaded from variable 'host') if (*host == '/') { ^~~~~ 1 warning generated. -- ./main.c:1254:9: warning: Access to field 'sctx' results in a dereference of a null pointer (loaded from variable 'named_g_server') sctx = named_g_server->sctx; ^~~~~~~~~~~~~~~~~~~~ 1 warning generated. References: 1. https://clang-analyzer.llvm.org/faq.html#null_pointer (cherry picked from commit `ddd0d356e5`)	2020-03-25 18:06:29 +01:00

1 2 3 4 5 ...

3610 Commits