bind9

Author	SHA1	Message	Date
Mark Andrews	bde5c7632a	Always check the return from isc_refcount_decrement. Created isc_refcount_decrement_expect macro to test conditionally the return value to ensure it is in expected range. Converted unchecked isc_refcount_decrement to use isc_refcount_decrement_expect. Converted INSIST(isc_refcount_decrement()...) to isc_refcount_decrement_expect.	2020-07-31 10:15:44 +10:00
Mark Andrews	aca18b8b5b	Refactor the code that counts the last log version to keep When silencing the Coverity warning in remove_old_tsversions(), the code was refactored to reduce the indentation levels and break down the long code into individual functions. This improve fix for [GL #1989].	2020-07-31 09:30:12 +10:00
Michał Kępień	953d704bd2	Fix idle timeout for connected TCP sockets When named acting as a resolver connects to an authoritative server over TCP, it sets the idle timeout for that connection to 20 seconds. This fixed timeout was picked back when the default processing timeout for each client query was hardcoded to 30 seconds. Commit `000a8970f8` made this processing timeout configurable through "resolver-query-timeout" and decreased its default value to 10 seconds, but the idle TCP timeout was not adjusted to reflect that change. As a result, with the current defaults in effect, a single hung TCP connection will consistently cause the resolution process for a given query to time out. Set the idle timeout for connected TCP sockets to half of the client query processing timeout configured for a resolver. This allows named to handle hung TCP connections more robustly and prevents the timeout mismatch issue from resurfacing in the future if the default is ever changed again.	2020-07-30 10:58:39 +02:00
Evan Hunt	881b635141	initialize, rather than invalidating, new http buffers when building without ISC_BUFFER_USEINLINE (which is the default on Windows) an assertion failure could occur when setting up a new isc_httpd_t object for the statistics channel.	2020-07-27 14:29:37 -07:00
Diego Fronza	c2928c2ed4	Fix rpz wildcard name matching Whenever an exact match is found by dns_rbt_findnode(), the highest level node in the chain will not be put into chain->levels[] array, but instead the chain->end pointer will be adjusted to point to that node. Suppose we have the following entries in a rpz zone: example.com CNAME rpz-passthru. *.example.com CNAME rpz-passthru. A query for www.example.com would result in the following chain object returned by dns_rbt_findnode(): chain->level_count = 2 chain->level_matches = 2 chain->levels[0] = . chain->levels[1] = example.com chain->levels[2] = NULL chain->end = www Since exact matches only care for testing rpz set bits, we need to test for rpz wild bits through iterating the nodechain, and that includes testing the rpz wild bits in the highest level node found. In the case of an exact match, chain->levels[chain->level_matches] will be NULL, to address that we must use chain->end as the start point, then iterate over the remaining levels in the chain.	2020-07-24 11:34:40 -07:00
Mark Andrews	78db46d746	Check walking the hip rendezvous servers. Also fixes extraneous white space at end of record when there are no rendezvous servers.	2020-07-24 04:15:56 +00:00
Mark Andrews	70c060120f	Add fallthrough and braces	2020-07-24 13:49:56 +10:00
Petr Menšík	72d81c4768	Remove few lines in unix socket handling Reuse the same checks two times, make difference minimal.	2020-07-24 12:59:38 +10:00
Ondřej Surý	a9182c89a6	Change the dns_name hashing to use 32-bit values Change the dns_hash_name() and dns_hash_fullname() functions to use isc_hash32() as the maximum hashtable size in rbt is 0..UINT32_MAX large.	2020-07-21 08:44:26 +02:00
Ondřej Surý	f59fd49fd8	Add isc_hash32() and rename isc_hash_function() to isc_hash64() As the names suggest the original isc_hash64 function returns 64-bit long hash values and the isc_hash32() returns 32-bit values.	2020-07-21 08:44:26 +02:00
Ondřej Surý	344d66aaff	Add HalfSipHash 2-4 reference implementation The HalfSipHash implementation has 32-bit keys and returns 32-bit value.	2020-07-21 08:44:26 +02:00
Ondřej Surý	21d751dfc7	Remove OpenSSL based SipHash 2-4 implementation Creation of EVP_MD_CTX and EVP_PKEY is quite expensive, so until we fix the code to reuse the OpenSSL contexts and keys we'll use our own implementation of siphash instead of trying to integrate with OpenSSL.	2020-07-21 08:44:26 +02:00
Ondřej Surý	e24bc324b4	Fix the rbt hashtable and grow it when setting max-cache-size There were several problems with rbt hashtable implementation: 1. Our internal hashing function returns uint64_t value, but it was silently truncated to unsigned int in dns_name_hash() and dns_name_fullhash() functions. As the SipHash 2-4 higher bits are more random, we need to use the upper half of the return value. 2. The hashtable implementation in rbt.c was using modulo to pick the slot number for the hash table. This has several problems because modulo is: a) slow, b) oblivious to patterns in the input data. This could lead to very uneven distribution of the hashed data in the hashtable. Combined with the single-linked lists we use, it could really hog-down the lookup and removal of the nodes from the rbt tree[a]. The Fibonacci Hashing is much better fit for the hashtable function here. For longer description, read "Fibonacci Hashing: The Optimization that the World Forgot"[b] or just look at the Linux kernel. Also this will make Diego very happy :). 3. The hashtable would rehash every time the number of nodes in the rbt tree would exceed 3 * (hashtable size). The overcommit will make the uneven distribution in the hashtable even worse, but the main problem lies in the rehashing - every time the database grows beyond the limit, each subsequent rehashing will be much slower. The mitigation here is letting the rbt know how big the cache can grown and pre-allocate the hashtable to be big enough to actually never need to rehash. This will consume more memory at the start, but since the size of the hashtable is capped to `1 << 32` (e.g. 4 mio entries), it will only consume maximum of 32GB of memory for hashtable in the worst case (and max-cache-size would need to be set to more than 4TB). Calling the dns_db_adjusthashsize() will also cap the maximum size of the hashtable to the pre-computed number of bits, so it won't try to consume more gigabytes of memory than available for the database. FIXME: What is the average size of the rbt node that gets hashed? I chose the pagesize (4k) as initial value to precompute the size of the hashtable, but the value is based on feeling and not any real data. For future work, there are more places where we use result of the hash value modulo some small number and that would benefit from Fibonacci Hashing to get better distribution. Notes: a. A doubly linked list should be used here to speedup the removal of the entries from the hashtable. b. https://probablydance.com/2018/06/16/fibonacci-hashing-the-optimization-that-the-world-forgot-or-a-better-alternative-to-integer-modulo/	2020-07-21 08:44:26 +02:00
Evan Hunt	69c1ee1ce9	rewrite statschannel to use netmgr modify isc_httpd to use the network manager instead of the isc_socket API. also cleaned up bin/named/statschannel.c to use CHECK.	2020-07-15 22:35:07 -07:00
Michał Kępień	97a2733ef9	Update library API versions	2020-07-15 22:54:13 +02:00
Matthijs Mekking	e645d2ef1e	Check return value of dst_key_getbool() Fix Coverity CHECKED_RETURN reports for dst_key_getbool(). In most cases we do not really care about its return value, but it is prudent to check it. In one case, where a dst_key_getbool() error should be treated identically as success, cast the return value to void and add a relevant comment.	2020-07-14 12:53:54 +00:00
Mark Andrews	e7662c4c63	Mark 'addr' as unused if HAVE_IF_NAMETOINDEX is not defined Also 'zone' should be initialised to zero.	2020-07-14 00:13:40 +00:00
Mark Andrews	488eef63ca	Only call gsskrb5_register_acceptor_identity if we have gssapi_krb5.h.	2020-07-14 08:55:13 +10:00
Mark Andrews	18eef20241	Handle namespace clash over 'SEC' on illumos.	2020-07-14 07:46:10 +10:00
Mark Andrews	cc0089c66b	Address potential double unlock in process_fd	2020-07-14 07:07:14 +10:00
Witold Kręcicki	ae5d316f64	isccc: merge recv_message and recv_nonce into one function - make isccc message receiving code clearer by merging recv_nonce and recv_message into a single recv_data function and adding a boolean state field.	2020-07-13 13:17:08 -07:00
Evan Hunt	55896df79d	use handles for isc_nm_pauseread() and isc_nm_resumeread() by having these functions act on netmgr handles instead of socket objects, they can be used in callback functions outside the netgmr.	2020-07-13 13:17:08 -07:00
Evan Hunt	45ab0603eb	use an isc_task to execute rndc commands - using an isc_task to execute all rndc functions makes it relatively simple for them to acquire task exclusive mode when needed - control_recvmessage() has been separated into two functions, control_recvmessage() and control_respond(). the respond function can be called immediately from control_recvmessage() when processing a nonce, or it can be called after returning from the task event that ran the rndc command function.	2020-07-13 13:16:53 -07:00
Evan Hunt	3551d3ffd2	convert rndc and control channel to use netmgr - updated libisccc to use netmgr events - updated rndc to use isc_nm_tcpconnect() to establish connections - updated control channel to use isc_nm_listentcp() open issues: - the control channel timeout was previously 60 seconds, but it is now overridden by the TCP idle timeout setting, which defaults to 30 seconds. we should add a function that sets the timeout value for a specific listener socket, instead of always using the global value set in the netmgr. (for the moment, since 30 seconds is a reasonable timeout for the control channel, I'm not prioritizing this.) - the netmgr currently has no support for UNIX-domain sockets; until this is addressed, it will not be possible to configure rndc to use them. we will need to either fix this or document the change in behavior.	2020-07-13 13:16:53 -07:00
Evan Hunt	0580d9cd8c	style cleanup clean up style in rndc and the control channel in preparation for changing them to use the new network manager.	2020-07-13 12:41:04 -07:00
Diego Fronza	aab691d512	Fix ns_statscounter_recursclients underflow The basic scenario for the problem was that in the process of resolving a query, if any rrset was eligible for prefetching, then it would trigger a call to query_prefetch(), this call would run in parallel to the normal query processing. The problem arises due to the fact that both query_prefetch(), and, in the original thread, a call to ns_query_recurse(), try to attach to the recursionquota, but recursing client stats counter is only incremented if ns_query_recurse() attachs to it first. Conversely, if fetch_callback() is called before prefetch_done(), it would not only detach from recursionquota, but also decrement the stats counter, if query_prefetch() attached to te quota first that would result in a decrement not matched by an increment, as expected. To solve this issue an atomic bool was added, it is set once in ns_query_recurse(), allowing fetch_callback() to check for it and decrement stats accordingly. For a more compreensive explanation check the thread comment below: https://gitlab.isc.org/isc-projects/bind9/-/issues/1719#note_145857	2020-07-13 11:46:18 -03:00
Mark Andrews	42b2290c3a	Add changes for [GL #1989 ]	2020-07-13 13:10:45 +10:00
Mark Andrews	6ca78bc57d	Address overrun in remove_old_tsversions If too many versions of log / dnstap files to be saved where requests the memory after to_keep could be overwritten. Force the number of versions to be saved to a save level. Additionally the memmove length was incorrect.	2020-07-13 13:10:45 +10:00
Mark Andrews	827746e89b	Assert tsigout is non-NULL	2020-07-13 02:26:06 +00:00
Mark Andrews	9499adeb5e	check returns from inet_pton()	2020-07-13 00:31:29 +00:00
Michał Kępień	53120279b5	Fix locking for LMDB 0.9.26 When "rndc reconfig" is run, named first configures a fresh set of views and then tears down the old views. Consider what happens for a single view with LMDB enabled; "envA" is the pointer to the LMDB environment used by the original/old version of the view, "envB" is the pointer to the same LMDB environment used by the new version of that view: 1. mdb_env_open(envA) is called when the view is first created. 2. "rndc reconfig" is called. 3. mdb_env_open(envB) is called for the new instance of the view. 4. mdb_env_close(envA) is called for the old instance of the view. This seems to have worked so far. However, an upstream change [1] in LMDB which will be part of its 0.9.26 release prevents the above sequence of calls from working as intended because the locktable mutexes will now get destroyed by the mdb_env_close() call in step 4 above, causing any subsequent mdb_txn_begin() calls to fail (because all of the above steps are happening within a single named process). Preventing the above scenario from happening would require either redesigning the way we use LMDB in BIND, which is not something we can easily backport, or redesigning the way BIND carries out its reconfiguration process, which would be an even more severe change. To work around the problem, set MDB_NOLOCK when calling mdb_env_open() to stop LMDB from controlling concurrent access to the database and do the necessary locking in named instead. Reuse the view->new_zone_lock mutex for this purpose to prevent the need for modifying struct dns_view (which would necessitate library API version bumps). Drop use of MDB_NOTLS as it is made redundant by MDB_NOLOCK: MDB_NOTLS only affects where LMDB reader locktable slots are stored while MDB_NOLOCK prevents the reader locktable from being used altogether. [1] `2fd44e3251`	2020-07-10 11:29:18 +02:00
Mark Andrews	092a159dcd	Adjust range limit of unknown meta types	2020-07-08 02:04:16 +00:00
Ondřej Surý	81d4230e60	Update STALE and ANCIENT header attributes atomically The ThreadSanitizer found a data race when updating the stale header. Instead of trying to acquire the write lock and failing occasionally which would skew the statistics, the dns_rdatasetheader_t.attributes field has been promoted to use stdatomics. Updating the attributes in the mark_header_ancient() and mark_header_stale() now uses the cmpxchg to update the attributes forfeiting the need to hold the write lock on the tree. Please note that mark_header_ancient() still needs to hold the lock because .dirty is being updated in the same go.	2020-07-08 10:50:52 +10:00
Mark Andrews	bccea5862d	Make the stdatomic shim and mutexatomic type complete The stdatomic shims for non-C11 compilers (Windows, old gcc, ...) and mutexatomic implemented only and minimal subset of the atomic types. This commit adds 16-bit operations for Windows and all atomic types as defined in standard.	2020-07-08 09:39:02 +10:00
Mark Andrews	2fa2dbd5fb	remove redundant rctx != NULL check	2020-07-05 23:52:19 +00:00
Evan Hunt	f619708bbf	prevent "primaries" lists from having duplicate names it is now an error to have two primaries lists with the same name. this is true regardless of whether the "primaries" or "masters" keywords were used to define them.	2020-07-01 11:11:34 -07:00
Evan Hunt	424a3cf3cc	add "primary-only" as a synonym for "master-only" update the "notify" option to use RFC 8499 terminology as well.	2020-07-01 11:11:34 -07:00
Evan Hunt	16e14353b1	add "primaries" as a synonym for "masters" in named.conf as "type primary" is preferred over "type master" now, it makes sense to make "primaries" available as a synonym too. added a correctness check to ensure "primaries" and "masters" cannot both be used in the same zone.	2020-07-01 11:11:34 -07:00
Evan Hunt	233f134a4f	Don't destroy a non-closed socket, wait for all the callbacks. We erroneously tried to destroy a socket after issuing isc__nm_tcp{,dns}_close. Under some (race) circumstances we could get nm_socket_cleanup to be called twice for the same socket, causing an access to a dead memory.	2020-07-01 17:35:10 +02:00
Witold Kręcicki	896db0f419	Fix possible race in isc__nm_tcpconnect. There's a possibility of race in isc__nm_tcpconnect if the asynchronous connect operation finishes with all the callbacks before we exit the isc__nm_tcpconnect itself we might access an already freed memory. Fix it by creating an additional reference to the socket freed at the end of isc__nm_tcpconnect.	2020-07-01 13:52:12 +00:00
Witold Kręcicki	25f84ffc68	Add missing libisc.def definitions, netmgr version of isc_sockettype_t.	2020-07-01 13:52:12 +00:00
Witold Kręcicki	c8f2d55acf	rbtdb: cleanup_dead_nodes should ignore alive nodes on the deadlist	2020-07-01 15:11:07 +02:00
Witold Kręcicki	b4f3fafcff	Fix assertion failure during startup when the server is under load. When we're coming back from recursion fetch_callback does not accept DNS_R_NXDOMAIN as an rcode - query_gotanswer calls query_nxdomain in which an assertion fails on qctx->is_zone. Yet, under some circumstances, qname minimization will return an DNS_R_NXDOMAIN - when root zone mirror is not yet loaded. The fix changes the DNS_R_NXDOMAIN answer to DNS_R_SERVFAIL.	2020-07-01 12:25:36 +02:00
Evan Hunt	23c7373d68	restore "blackhole" functionality the blackhole ACL was accidentally disabled with respect to client queries during the netmgr conversion. in order to make this work for TCP, it was necessary to add a return code to the accept callback functions passed to isc_nm_listentcp() and isc_nm_listentcpdns().	2020-06-30 17:29:09 -07:00
Matthijs Mekking	19ce9ec1d4	Output rndc dnssec -status Implement the 'rndc dnssec -status' command that will output some information about the key states, such as which policy is used for the zone, what keys are in use, and when rollover is scheduled. Add loose testing in the kasp system test, the actual times are already tested via key file inspection.	2020-06-30 09:51:04 +02:00
Matthijs Mekking	9e03f8e8fe	Move dst key printtime in separate function I'd like to use the same functionality (pretty print the datetime of keytime metadata) in the 'rndc dnssec -status' command. So it is better that this logic is done in a separate function. Since the stdtime.c code have differernt files for unix and win32, I think the "#ifdef WIN32" define can be dropped.	2020-06-30 09:51:04 +02:00
Matthijs Mekking	a47192ed5b	kasp tests: fix wait for reconfig done The wait until zones are signed after rndc reconfig is broken because the zones are already signed before the reconfig. Fix by having a different way to ensure the signing of the zone is complete. This does require a call to the "wait_for_done_signing" function after each "check_keys" call after the ns6 reconfig. The "wait_for_done_signing" looks for a (newly added) debug log message that named will output if it is done signing with a certain key.	2020-06-26 08:43:45 +00:00
Evan Hunt	591b79b597	Make netmgr tcpdns send calls asynchronous isc__nm_tcpdns_send() was not asynchronous and accessed socket internal fields in an unsafe manner, which could lead to a race condition and subsequent crash. Fix it by moving tcpdns processing to a proper netmgr thread.	2020-06-26 00:19:42 -07:00
Witold Kręcicki	1cf65cd882	Fix a shutdown race in netmgr udp We need to mark the socket as inactive early (and synchronously) in the stoplistening process; otherwise we might destroy the callback argument before we actually stop listening, and call the callback on bad memory.	2020-06-26 00:19:42 -07:00
Evan Hunt	3704c4fff2	clean up outerhandle when a tcpdns socket is disconnected this prevents a crash when some non-netmgr thread, such as a recursive lookup, times out after the TCP socket is already disconnected.	2020-06-26 00:19:42 -07:00

1 2 3 4 5 ...

12692 Commits