bind9

Author	SHA1	Message	Date
Ondřej Surý	f9711481ad	Expire the 0 TTL RRSet quickly rather using them for serve-stale When a received RRSet has TTL 0, they would be preserved for serve-stale (default `max-stale-cache` is 12 hours) rather than expiring them quickly from the cache database. This commit makes sure the RRSet didn't have TTL 0 before marking the entry in the database as "stale". (cherry picked from commit `6ffa2ddae0`)	2020-08-05 09:09:16 +02:00
Mark Andrews	14fe6e77a7	Always check the return from isc_refcount_decrement. Created isc_refcount_decrement_expect macro to test conditionally the return value to ensure it is in expected range. Converted unchecked isc_refcount_decrement to use isc_refcount_decrement_expect. Converted INSIST(isc_refcount_decrement()...) to isc_refcount_decrement_expect. (cherry picked from commit `bde5c7632a`)	2020-07-31 12:54:47 +10:00
Ondřej Surý	0fff3008ac	Change the dns_name hashing to use 32-bit values Change the dns_hash_name() and dns_hash_fullname() functions to use isc_hash32() as the maximum hashtable size in rbt is 0..UINT32_MAX large. (cherry picked from commit `a9182c89a6`)	2020-07-30 11:57:24 +02:00
Ondřej Surý	aa72c31422	Fix the rbt hashtable and grow it when setting max-cache-size There were several problems with rbt hashtable implementation: 1. Our internal hashing function returns uint64_t value, but it was silently truncated to unsigned int in dns_name_hash() and dns_name_fullhash() functions. As the SipHash 2-4 higher bits are more random, we need to use the upper half of the return value. 2. The hashtable implementation in rbt.c was using modulo to pick the slot number for the hash table. This has several problems because modulo is: a) slow, b) oblivious to patterns in the input data. This could lead to very uneven distribution of the hashed data in the hashtable. Combined with the single-linked lists we use, it could really hog-down the lookup and removal of the nodes from the rbt tree[a]. The Fibonacci Hashing is much better fit for the hashtable function here. For longer description, read "Fibonacci Hashing: The Optimization that the World Forgot"[b] or just look at the Linux kernel. Also this will make Diego very happy :). 3. The hashtable would rehash every time the number of nodes in the rbt tree would exceed 3 * (hashtable size). The overcommit will make the uneven distribution in the hashtable even worse, but the main problem lies in the rehashing - every time the database grows beyond the limit, each subsequent rehashing will be much slower. The mitigation here is letting the rbt know how big the cache can grown and pre-allocate the hashtable to be big enough to actually never need to rehash. This will consume more memory at the start, but since the size of the hashtable is capped to `1 << 32` (e.g. 4 mio entries), it will only consume maximum of 32GB of memory for hashtable in the worst case (and max-cache-size would need to be set to more than 4TB). Calling the dns_db_adjusthashsize() will also cap the maximum size of the hashtable to the pre-computed number of bits, so it won't try to consume more gigabytes of memory than available for the database. FIXME: What is the average size of the rbt node that gets hashed? I chose the pagesize (4k) as initial value to precompute the size of the hashtable, but the value is based on feeling and not any real data. For future work, there are more places where we use result of the hash value modulo some small number and that would benefit from Fibonacci Hashing to get better distribution. Notes: a. A doubly linked list should be used here to speedup the removal of the entries from the hashtable. b. https://probablydance.com/2018/06/16/fibonacci-hashing-the-optimization-that-the-world-forgot-or-a-better-alternative-to-integer-modulo/ (cherry picked from commit `e24bc324b4`)	2020-07-30 11:57:24 +02:00
Michał Kępień	b6c33087b0	Fix idle timeout for connected TCP sockets When named acting as a resolver connects to an authoritative server over TCP, it sets the idle timeout for that connection to 20 seconds. This fixed timeout was picked back when the default processing timeout for each client query was hardcoded to 30 seconds. Commit `000a8970f8` made this processing timeout configurable through "resolver-query-timeout" and decreased its default value to 10 seconds, but the idle TCP timeout was not adjusted to reflect that change. As a result, with the current defaults in effect, a single hung TCP connection will consistently cause the resolution process for a given query to time out. Set the idle timeout for connected TCP sockets to half of the client query processing timeout configured for a resolver. This allows named to handle hung TCP connections more robustly and prevents the timeout mismatch issue from resurfacing in the future if the default is ever changed again. (cherry picked from commit `953d704bd2`)	2020-07-30 11:16:09 +02:00
Diego Fronza	a8ce7b461c	Fix rpz wildcard name matching Whenever an exact match is found by dns_rbt_findnode(), the highest level node in the chain will not be put into chain->levels[] array, but instead the chain->end pointer will be adjusted to point to that node. Suppose we have the following entries in a rpz zone: example.com CNAME rpz-passthru. *.example.com CNAME rpz-passthru. A query for www.example.com would result in the following chain object returned by dns_rbt_findnode(): chain->level_count = 2 chain->level_matches = 2 chain->levels[0] = . chain->levels[1] = example.com chain->levels[2] = NULL chain->end = www Since exact matches only care for testing rpz set bits, we need to test for rpz wild bits through iterating the nodechain, and that includes testing the rpz wild bits in the highest level node found. In the case of an exact match, chain->levels[chain->level_matches] will be NULL, to address that we must use chain->end as the start point, then iterate over the remaining levels in the chain.	2020-07-27 17:02:16 -03:00
Mark Andrews	b0942c2442	Check walking the hip rendezvous servers. Also fixes extraneous white space at end of record when there are no rendezvous servers. (cherry picked from commit `78db46d746`)	2020-07-24 15:24:49 +10:00
Tinderbox User	b03a635f68	prep 9.16.5	2020-07-15 23:10:55 +02:00
Matthijs Mekking	4dabb688db	Check return value of dst_key_getbool() Fix Coverity CHECKED_RETURN reports for dst_key_getbool(). In most cases we do not really care about its return value, but it is prudent to check it. In one case, where a dst_key_getbool() error should be treated identically as success, cast the return value to void and add a relevant comment. (cherry picked from commit `e645d2ef1e`)	2020-07-14 17:48:21 +02:00
Mark Andrews	e67b7a62d0	Assert tsigout is non-NULL (cherry picked from commit `827746e89b`)	2020-07-13 13:21:12 +10:00
Mark Andrews	12fac1ce70	check returns from inet_pton() (cherry picked from commit `9499adeb5e`)	2020-07-13 11:44:58 +10:00
Michał Kępień	0bc4d6cc7a	Fix locking for LMDB 0.9.26 When "rndc reconfig" is run, named first configures a fresh set of views and then tears down the old views. Consider what happens for a single view with LMDB enabled; "envA" is the pointer to the LMDB environment used by the original/old version of the view, "envB" is the pointer to the same LMDB environment used by the new version of that view: 1. mdb_env_open(envA) is called when the view is first created. 2. "rndc reconfig" is called. 3. mdb_env_open(envB) is called for the new instance of the view. 4. mdb_env_close(envA) is called for the old instance of the view. This seems to have worked so far. However, an upstream change [1] in LMDB which will be part of its 0.9.26 release prevents the above sequence of calls from working as intended because the locktable mutexes will now get destroyed by the mdb_env_close() call in step 4 above, causing any subsequent mdb_txn_begin() calls to fail (because all of the above steps are happening within a single named process). Preventing the above scenario from happening would require either redesigning the way we use LMDB in BIND, which is not something we can easily backport, or redesigning the way BIND carries out its reconfiguration process, which would be an even more severe change. To work around the problem, set MDB_NOLOCK when calling mdb_env_open() to stop LMDB from controlling concurrent access to the database and do the necessary locking in named instead. Reuse the view->new_zone_lock mutex for this purpose to prevent the need for modifying struct dns_view (which would necessitate library API version bumps). Drop use of MDB_NOTLS as it is made redundant by MDB_NOLOCK: MDB_NOTLS only affects where LMDB reader locktable slots are stored while MDB_NOLOCK prevents the reader locktable from being used altogether. [1] `2fd44e3251` (cherry picked from commit `53120279b5`)	2020-07-10 11:30:31 +02:00
Mark Andrews	86681ca6f1	Adjust range limit of unknown meta types (cherry picked from commit `092a159dcd`)	2020-07-08 13:44:47 +10:00
Ondřej Surý	0279cc76a7	Update STALE and ANCIENT header attributes atomically The ThreadSanitizer found a data race when updating the stale header. Instead of trying to acquire the write lock and failing occasionally which would skew the statistics, the dns_rdatasetheader_t.attributes field has been promoted to use stdatomics. Updating the attributes in the mark_header_ancient() and mark_header_stale() now uses the cmpxchg to update the attributes forfeiting the need to hold the write lock on the tree. Please note that mark_header_ancient() still needs to hold the lock because .dirty is being updated in the same go. (cherry picked from commit `81d4230e60`)	2020-07-08 12:01:46 +10:00
Mark Andrews	244ebdfb8c	remove redundant rctx != NULL check (cherry picked from commit `2fa2dbd5fb`)	2020-07-06 10:30:25 +10:00
Witold Kręcicki	000c7d1340	rbtdb: cleanup_dead_nodes should ignore alive nodes on the deadlist (cherry picked from commit `c8f2d55acf`)	2020-07-01 15:35:21 +02:00
Witold Kręcicki	03e583ffa8	Fix assertion failure during startup when the server is under load. When we're coming back from recursion fetch_callback does not accept DNS_R_NXDOMAIN as an rcode - query_gotanswer calls query_nxdomain in which an assertion fails on qctx->is_zone. Yet, under some circumstances, qname minimization will return an DNS_R_NXDOMAIN - when root zone mirror is not yet loaded. The fix changes the DNS_R_NXDOMAIN answer to DNS_R_SERVFAIL.	2020-07-01 12:55:12 +02:00
Matthijs Mekking	9f5a43808f	Fix linking problem for #1612 When a library is examined, an object file within it can be left out of the link if it does not provide symbols that the symbol table needs. Introducing `isc_stdtime_tostring` caused a build failure for `update_test` because it now requires `libisc.a(stdtime.o)` and that also exports the `isc_stdtime_get` symbol, meaning we have a multiple definition error. Add a local version of `isc_stdtime_tostring`, so that the linker will not search for it in available object files.	2020-07-01 10:55:30 +02:00
Matthijs Mekking	f1b3686cd2	Output rndc dnssec -status Implement the 'rndc dnssec -status' command that will output some information about the key states, such as which policy is used for the zone, what keys are in use, and when rollover is scheduled. Add loose testing in the kasp system test, the actual times are already tested via key file inspection. (cherry picked from commit `19ce9ec1d4`)	2020-07-01 09:57:44 +02:00
Matthijs Mekking	7915327aac	Move dst key printtime in separate function I'd like to use the same functionality (pretty print the datetime of keytime metadata) in the 'rndc dnssec -status' command. So it is better that this logic is done in a separate function. Since the stdtime.c code have differernt files for unix and win32, I think the "#ifdef WIN32" define can be dropped. (cherry picked from commit `9e03f8e8fe`)	2020-07-01 09:57:44 +02:00
Tony Finch	b7f7b8128e	Fix `rndc dnstap -roll N` The `rndc` argument was always overridden by the static configuration, because the logic for handling the number of dnstap files to retain was both backwards and a bit redundant. (cherry picked from commit `7c07129a51`)	2020-06-29 22:30:01 +00:00
Michał Kępień	be35b872fd	Address compilation warnings on FreeBSD 11.4 With Clang 10.0.0 on FreeBSD 11.4, compiling lib/dns/spnego.c triggers the following warnings: spnego.c:361:11: error: converting the result of '<<' to a boolean always evaluates to true [-Werror,-Wtautological-constant-compare] return (GSS_S_DEFECTIVE_TOKEN); ^ /usr/include/gssapi/gssapi.h:423:41: note: expanded from macro 'GSS_S_DEFECTIVE_TOKEN' #define GSS_S_DEFECTIVE_TOKEN (9ul << GSS_C_ROUTINE_ERROR_OFFSET) ^ spnego.c:366:11: error: converting the result of '<<' to a boolean always evaluates to true [-Werror,-Wtautological-constant-compare] return (GSS_S_DEFECTIVE_TOKEN); ^ /usr/include/gssapi/gssapi.h:423:41: note: expanded from macro 'GSS_S_DEFECTIVE_TOKEN' #define GSS_S_DEFECTIVE_TOKEN (9ul << GSS_C_ROUTINE_ERROR_OFFSET) ^ spnego.c:371:12: error: converting the result of '<<' to a boolean always evaluates to true [-Werror,-Wtautological-constant-compare] return (GSS_S_DEFECTIVE_TOKEN); ^ /usr/include/gssapi/gssapi.h:423:41: note: expanded from macro 'GSS_S_DEFECTIVE_TOKEN' #define GSS_S_DEFECTIVE_TOKEN (9ul << GSS_C_ROUTINE_ERROR_OFFSET) ^ spnego.c:376:11: error: converting the result of '<<' to a boolean always evaluates to true [-Werror,-Wtautological-constant-compare] return (GSS_S_DEFECTIVE_TOKEN); ^ /usr/include/gssapi/gssapi.h:423:41: note: expanded from macro 'GSS_S_DEFECTIVE_TOKEN' #define GSS_S_DEFECTIVE_TOKEN (9ul << GSS_C_ROUTINE_ERROR_OFFSET) ^ spnego.c:380:11: error: converting the result of '<<' to a boolean always evaluates to true [-Werror,-Wtautological-constant-compare] return (GSS_S_DEFECTIVE_TOKEN); ^ /usr/include/gssapi/gssapi.h:423:41: note: expanded from macro 'GSS_S_DEFECTIVE_TOKEN' #define GSS_S_DEFECTIVE_TOKEN (9ul << GSS_C_ROUTINE_ERROR_OFFSET) ^ 5 errors generated. Address by replacing all instances of the GSS_S_DEFECTIVE_TOKEN constant with a boolean value. Invert the values returned by cmp_gss_type() so that its only call site reads more naturally in the context of the comment preceding it.	2020-06-29 12:03:01 +02:00
Matthijs Mekking	7eed00502f	kasp tests: fix wait for reconfig done The wait until zones are signed after rndc reconfig is broken because the zones are already signed before the reconfig. Fix by having a different way to ensure the signing of the zone is complete. This does require a call to the "wait_for_done_signing" function after each "check_keys" call after the ns6 reconfig. The "wait_for_done_signing" looks for a (newly added) debug log message that named will output if it is done signing with a certain key. (cherry picked from commit `a47192ed5b`)	2020-06-29 08:09:40 +02:00
Evan Hunt	f171017570	append "0" to IPv6 addresses ending in "::" when printing YAML such addresses broke some YAML parsers. (cherry picked from commit `a8baf79e33`)	2020-06-25 18:57:06 -07:00
Mark Andrews	3612f662da	The validator could fail when select_signing_key/get_dst_key failed to select the signing key because the algorithm was not supported and the loop was prematurely aborted. (cherry picked from commit `d475f3aeed`)	2020-06-25 22:42:43 +10:00
Mark Andrews	34a5ad82d6	Address race between zone_maintenance and dns_zone_setview_helper There was a possible NULL dereference due to data race between accessing zone->view and zone->view->adb. (cherry picked from commit `67c8f7329d`)	2020-06-22 12:27:11 +02:00
Mark Andrews	41e38c216d	Add missing #pragma once to <dns/lmdb.h>	2020-06-19 12:12:45 +10:00
Tinderbox User	adab85b815	prep 9.16.4	2020-06-18 10:25:50 +02:00
Mark Andrews	6964a21fa6	Remove INSIST from from new_reference RBTDB node can now appear on the deadnodes lists following the changes to decrement_reference in `176b23b6cd` to defer checking of node->down when the tree write lock is not held. The node should be unlinked instead. (cherry picked from commit 569cc155b8680d8ed12db1fabbe20947db24a0f9)	2020-06-18 10:18:42 +02:00
Mark Andrews	06cebcb6b7	val->keynode is no longer needed (cherry picked from commit `ff4fc3f8dc`)	2020-06-11 16:09:43 +10:00
Mark Andrews	70c27df941	The dsset returned by dns_keynode_dsset needs to be thread safe. - clone keynode->dsset rather than return a pointer so that thread use is independent of each other. - hold a reference to the dsset (keynode) so it can't be deleted while in use. - create a new keynode when removing DS records so that dangling pointers to the deleted records will not occur. - use a rwlock when accessing the rdatalist to prevent instabilities when DS records are added. (cherry picked from commit `e5b2eca1d3`)	2020-06-11 16:09:43 +10:00
Witold Kręcicki	c3dcab5f13	Fix a data access race in resolver We were passing client address to dns_resolver_createfetch as a pointer and it was saved as a pointer. The client (with its address) could be gone before the fetch is finished, and in a very odd scenario log_formerr would call isc_sockaddr_format() which first checks if the address family is valid (and at this point it still is), then the sockaddr is cleared, and then isc_netaddr_fromsockaddr is called which fails an assertion as the address family is now invalid. (cherry picked from commit `175c4d9055`)	2020-06-05 18:58:13 -07:00
Mark Andrews	28a940fe69	Add +yaml support for EDE (cherry picked from commit `0ec77c2b92`)	2020-06-05 10:50:58 +10:00
Mark Andrews	b17f6eba6a	Reject primary zones with an DS record at the zone apex. DS records only belong at delegation points and if present at the zone apex are invariably the result of administrative errors. Additionally they can't be queried for with modern resolvers as the parent servers will be queried. (cherry picked from commit `35a58d30c9`)	2020-06-04 16:06:07 +02:00
Matthijs Mekking	a17dcccf7d	Retire predecessor when creating successor When creating the successor, the current active key (predecessor) should change its goal state to HIDDEN. Also add two useful debug logs in the keymgr_key_rollover function. (cherry picked from commit `e71d60299f`)	2020-06-02 14:54:41 +02:00
Matthijs Mekking	ef1a4a416a	If prepub > retire, prepub now Catch a case where if the prepublication time of the successor key is later than the retire time of the predecessor. If that is the case we should prepublish as soon as possible, a.k.a. now. (cherry picked from commit `c08d0f7dd6`)	2020-06-02 14:54:32 +02:00
Matthijs Mekking	b0737b8c61	Put new key rollover logic in separate function The `dns_keymgr_run()` function became quite long, put the logic that looks if a new key needs to be created (start a key rollover) in a separate function. (cherry picked from commit `bcf8192438`)	2020-06-02 14:54:20 +02:00
Matthijs Mekking	168d362b54	Fix bug in keymgr_key_has_successor The logic in `keymgr_key_has_successor(key, keyring)` is flawed, it returns true if there is any key in the keyring that has a successor, while what we really want here is to make sure that the given key has a successor in the given keyring. Rather than relying on `keymgr_key_exists_with_state`, walk the list of keys in the keyring and check if the key is a successor of the given predecessor key. (cherry picked from commit `0d578097ef`)	2020-06-02 14:54:08 +02:00
Matthijs Mekking	f026332f88	Test keytimes on CSK rollover This improves keytime testing on CSK rollover. It now tests for specific times, and also tests for SyncPublish and Removed keytimes. Since an "active key" for ZSK and KSK means something different, this makes it tricky to decide when a CSK is active. An "active key" intuitively means the key is signing so we say a CSK is active when it is creating zone signatures. This change means a lot of timings for the CSK rollover tests need to be adjusted. The keymgr code needs a slight change on calculating the prepublication time: For a KSK we need to include the parent registration delay, but for CSK we look at the zone signing property and stick with the ZSK prepublication calculation. (cherry picked from commit `e233433772`)	2020-06-02 11:35:52 +02:00
Matthijs Mekking	437ec25c0c	kasp: registration delay adjustments Registration delay is not part of the Iret retire interval, thus removed from the calculation when setting the Delete time metadata. Include the registration delay in prepublication time, because we need to prepublish the key sooner than just the Ipub publication interval. (cherry picked from commit `50bbbb76a8`)	2020-06-02 11:35:32 +02:00
Matthijs Mekking	cad5ae1682	Set SyncPublish on keys Set the SyncPublish metadata on keys that don't have them yet. (cherry picked from commit `30cb5c97c2`)	2020-06-02 11:34:57 +02:00
Matthijs Mekking	6879cdca77	Set keytimes appropriately when using kasp While kasp relies on key states to determine when a key needs to be published or be used for signing, the keytimes are used by operators to get some expectation of key publication and usage. Update the code such that these keytimes are set appropriately. That means: - Print "PublishCDS" and "DeleteCDS" times in the state files. - The keymgr sets the "Removed" and "PublishCDS" times and derives those from the dnssec-policy. - Tweak setting of the "Retired" time, when retiring keys, only update the time to now when the retire time is not yet set, or is in the future. This also fixes a bug in "keymgr_transition_time" where we may wait too long before zone signatrues become omnipresent or hidden. Not only can we skip waiting the sign delay Dsgn if there is no predecessor, we can also skip it if there is no successor. Finally, this commit moves setting the lifetime, reducing two calls to one. (cherry picked from commit `18dc27afd3`)	2020-06-02 11:34:04 +02:00
Mark Andrews	69c43a03d0	Ensure tree lock is always held when dns_rbt_fullnamefromnode is called (cherry picked from commit `eded3efb79`)	2020-05-29 15:02:09 -07:00
Evan Hunt	8b154d0f9f	pass the nodename to add32() instead of calling dns_rbt_fullnamefromnode() in addition to being more efficient, this prevents a possible crash by looking up the node name before the tree sructure can be changed when cleaning up dead nodes in addrdataset(). (cherry picked from commit `db9d10e3c1`)	2020-05-29 15:02:09 -07:00
Evan Hunt	c19a593d3e	change "expr == false" to "!expr" in conditionals (cherry picked from commit `57e54c46e4`)	2020-05-25 17:04:04 -07:00
Evan Hunt	00c816778d	change 'expr == true' to 'expr' in conditionals (cherry picked from commit `68a1c9d679`)	2020-05-25 17:03:59 -07:00
Mark Andrews	4df013f0ea	Extend loop count by 1 to account for non-exact timing in usleep(). (cherry picked from commit `3012e1ad9b`)	2020-05-21 14:09:42 +10:00
Michał Kępień	e67d247ee7	Adjust library API versions	2020-05-19 13:57:37 +02:00
Tinderbox User	046f56f5ab	prep 9.16.3	2020-05-19 13:57:36 +02:00
Mark Andrews	39bb741927	Count queries to the root and TLD servers as well	2020-05-19 13:57:07 +02:00

1 2 3 4 5 ...

7851 Commits