bind9

Author	SHA1	Message	Date
Mark Andrews	14bd113b8f	Fix dual-stack-servers Named was stopping nameserver address resolution attempts too soon when dual stack servers are configured. Dual stack servers are used when there are not addresses for the server in a particular address family so find->status == DNS_ADB_NOMOREADDRESSES is not a sufficient stopping condition when dual stack servers are available. Call fctx_try to see if the alternate servers can be used. (cherry picked from commit `f98a8331aa`)	2025-02-26 01:04:59 +00:00
Evan Hunt	4f1f958d6d	prevent a reference leak from the ns_query_done hooks if the NS_QUERY_DONE_BEGIN or NS_QUERY_DONE_SEND hook is used in a plugin and returns NS_HOOK_RETURN, some of the cleanup in ns_query_done() can be skipped over, leading to reference leaks that can cause named to hang on shut down. this has been addressed by adding more housekeeping code after the cleanup: tag in ns_query_done(). (cherry picked from commit `c2e4358267`)	2025-02-26 00:55:51 +00:00
Mark Andrews	a0dae15cd1	Relax private DNSKEY and RRSIG constraints DNSKEY, KEY, RRSIG and SIG constraints have been relaxed to allow empty key and signature material after the algorithm identifier for PRIVATEOID and PRIVATEDNS. It is arguable whether this falls within the expected use of these types as no key material is shared and the signatures are ineffective but these are private algorithms and they can be totally insecure. (cherry picked from commit `b048190e23`)	2025-02-25 23:40:38 +00:00
Ondřej Surý	7682d63bd4	Destroy the hashmap iterator inside the rwlock Previously, the hashmap iterator for fetches-per-zone was destroy outside the rwlock. This could lead to an assertion failure due to a timing race with the internal rehashing of the hashmap table as the rehashing process requires no iterators to be running when rehashing the hashmap table. This has been fixed by moving the destruction of the iterator inside the read locked section. (cherry picked from commit `1e4fb53c61`)	2025-02-25 15:41:30 +00:00
Evan Hunt	16a80f401a	Fix a logic error in cache_name() A change in `6aba56ae8` (checking whether a rejected RRset was identical to the data it would have replaced, so that we could still cache a signature) inadvertently introduced cases where processing of a response would continue when previously it would have been skipped. (cherry picked from commit `d0fd9cbe3b`)	2025-02-24 23:42:25 +00:00
Ondřej Surý	37e95cb4dd	Dump the fetches from dns_resolver_dumpfetches() Previously, the dns_resolver_dumpfetches() would go over the fetch counters. Alas, because of the earlier optimization, the fetch counters would be increased only when fetches-per-zone was not 0, otherwise the whole counting was skipped for performance reasons. Instead of using the auxiliary fetch counters hash table, use the real hash table that stores the fetch contexts to dump the ongoing fetches to the recursing file. Additionally print more information about the fetch context like start and expiry times, number of fetch responses, number of queries and count of allowed and dropped fetches. (cherry picked from commit `c6b0368b21`)	2025-02-21 22:05:24 +00:00
Ondřej Surý	eec7b79ee0	Fix the fetch context hash table lock ordering The order of the fetch context hash table rwlock and the individual fetch context was reversed when calling the release_fctx() function. This was causing a problem when iterating the hash table, and thus the ordering has been corrected in a way that the hash table rwlock is now always locked on the outside and the fctx lock is the interior lock. (cherry picked from commit `cf078fadeb`)	2025-02-21 22:27:34 +01:00
Ondřej Surý	ace7c879a8	Add isc_timer_running() function to check status of timer In the next commit, we need to know whether the timer has been started or stopped. Add isc_timer_running() function that returns true if the timer has been started. (cherry picked from commit `b9e3cd5d2a`)	2025-02-21 22:27:25 +01:00
Aram Sargsyan	0add37862e	Fix RPZ bug when resuming a query during a reconfiguration After a reconfiguration the old view can be left without a valid 'rpzs' member, because when the RPZ is not changed during the named reconfiguration 'rpzs' "migrate" from the old view into the new view, so when a query resumes it can find that 'qctx->view->rpzs' is NULL which query_resume() currently doesn't expect to happen if it's recursing and 'qctx->rpz_st' is not NULL. Fix the issue by adding a NULL-check. In order to not split the log message to two different log messages depending on whether 'qctx->view->rpzs' is NULL or not, change the message to not log the RPZ policy's "version" which is just a runtime counter and is most likely not very useful for the users. (cherry picked from commit `3ea2fbc238`)	2025-02-21 11:45:45 +00:00
Mark Andrews	db364baa83	Remove check for missing RRSIG records from getsection Checking whether the authority section is properly signed should be left to the validator. Checking in getsection (dns_message_parse) was way too early and resulted in resolution failures of lookups that should have otherwise succeeded. (cherry picked from commit `83159d0a54`)	2025-02-21 03:00:29 +00:00
Aram Sargsyan	5d69aab92d	Implement sig0key-checks-limit and sig0message-checks-limit Previously a hard-coded limitation of maximum two key or message verification checks were introduced when checking the message's SIG(0) signature. It was done in order to protect against possible DoS attacks. The logic behind choosing the number two was that more than one key should only be required only during key rotations, and in that case two keys are enough. But later it became apparent that there are other use cases too where even more keys are required, see issue number #5050 in GitLab. This change introduces two new configuration options for the views, sig0key-checks-limit and sig0message-checks-limit, which define how many keys are allowed to be checked to find a matching key, and how many message verifications are allowed to take place once a matching key has been found. The latter protects against expensive cryptographic operations when there are keys with colliding tags and algorithm numbers, with default being 2, and the former protects against a bit less expensive key parsing operations and defaults to 16. (cherry picked from commit `716b936045`)	2025-02-20 14:48:01 +00:00
Aram Sargsyan	18fbc3f735	Fix isc_quota bug Running jobs which were entered into the isc_quota queue is the responsibility of the isc_quota_release() function, which, when releasing a previously acquired quota, checks whether the queue is empty, and if it's not, it runs a job from the queue without touching the 'quota->used' counter. This mechanism is susceptible to a possible hangup of a newly queued job in case when between the time a decision has been made to queue it (because used >= max) and the time it was actually queued, the last quota was released. Since there is no more quotas to be released (unless arriving in the future), the newly entered job will be stuck in the queue. Fix the wrong memory ordering for 'quota->used', as the relaxed ordering doesn't ensure that data modifications made by one thread are visible in other threads. Add checks in both isc_quota_release() and isc_quota_acquire_cb() to make sure that the described hangup does not happen. Also see code comments. (cherry picked from commit `c6529891bb`)	2025-02-20 12:20:25 +00:00
Aram Sargsyan	0bd251a496	Expose the incoming transfers' rates in the statistics channel Expose the average transfer rate (in bytes-per-second) during the last full 'min-transfer-rate-in <bytes> <minutes>' minutes interval. If no such interval has passed yet, then the overall average rate is reported instead. (cherry picked from commit `c701b590e4`)	2025-02-20 11:05:09 +00:00
Aram Sargsyan	e6b14365ad	Implement the min-transfer-rate-in configuration option This new option sets a minimum amount of transfer rate for an incoming zone transfer that will abort a transfer, which for some network related reasons run very slowly. (cherry picked from commit `91ea156203`)	2025-02-20 11:05:09 +00:00
Evan Hunt	fad9b3771f	Check whether a rejected rrset is different Add a new dns_rdataset_equals() function to check whether two rdatasets are equal in DNSSEC terms. When an rdataset being cached is rejected because its trust level is lower than the existing rdataset, we now check to see whether the rejected data was identical to the existing data. This allows us to cache a potentially useful RRSIG when handling CD=1 queries, while still rejecting RRSIGs that would definitely have resulted in a validation failure. (cherry picked from commit `6aba56ae89`)	2025-02-19 18:29:34 -08:00
Artem Boldariev	788e925261	DoH: http_send_outgoing() return value is not used The value returned by http_send_outgoing() is not used anywhere, so we make it not return anything (void). Probably it is an omission from older times. (cherry picked from commit `2adabe835a`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	47e9b47742	DoH: Fix missing send callback calls When handling outgoing data, there were a couple of rarely executed code paths that would not take into account that the callback MUST be called. It could lead to potential memory leaks and consequent shutdown hangs. (cherry picked from commit `8b8f4d500d`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	6b9387e2ee	DoH: change how the active streams number is calculated This commit changes the way how the number of active HTTP streams is calculated and allows it to scale with the values of the maximum amount of streams per connection, instead of effectively capping at STREAM_CLIENTS_PER_CONN. The original limit, which is intended to define the pipelining limit for TCP/DoT. However, it appeared to be too restrictive for DoH, as it works quite differently and implements pipelining at protocol level by the means of multiplexing multiple streams. That renders each stream to be effectively a separate connection from the point of view of the rest of the codebase. (cherry picked from commit `a22bc2d7d4`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	96e8ea1245	DoH: Track the amount of in flight outgoing data Previously we would limit the amount of incoming data to process based solely on the presence of not completed send requests. That worked, however, it was found to severely degrade performance in certain cases, as was revealed during extended testing. Now we switch to keeping track of how much data is in flight (or ready to be in flight) and limit the amount of processed incoming data when the amount of in flight data surpasses the given threshold, similarly to like we do in other transports. (cherry picked from commit `05e8a50818`)	2025-02-19 20:34:29 +02:00
Evan Hunt	e35e701c2c	when committing a new qpzone version, delete dead nodes if all data has been deleted from a node in the qpzone database, delete the node too. (cherry picked from commit `e58ce19cf2`)	2025-02-18 22:55:20 +00:00
Artem Boldariev	7b0a5596d6	Fix wrong logging severity in do_nsfetch() ISC_LOG_WARNING was used while ISC_LOG_DEBUG(3) was implied. (cherry picked from commit `fd3beaba2e`)	2025-02-18 10:30:18 +02:00
Evan Hunt	a21168a221	fix dns_qp_insert() checks in qpzone in some places there were checks for failures of dns_qp_insert() after dns_qp_getname(). such failures could only happen if another thread inserted a node between the two calls, and that can't happen because the calls are serialized with dns_qpmulti_write(). we can simplify the code and just add an INSIST. (cherry picked from commit `fffa150df3`)	2025-02-18 05:55:02 +00:00
Mark Andrews	89122c3fde	Re-fetch pending records that failed validation If a deferred validation on data that was originally queried with CD=1 fails, we now repeat the query, since the zone data may have changed in the meantime. (cherry picked from commit `04b1484ed8`)	2025-02-17 11:04:19 +11:00
Mark Andrews	de8893733f	Complete the deferred validation if there are no RRSIGs When a query is made with CD=1, we store the result in the cache marked pending so that it can be validated later, at which time it will either be accepted as an answer or removed from the cache as invalid. Deferred validation was not attempted when there were no cached RRSIGs for DNSKEY and DS. We now complete the deferred validation in this scenario. (cherry picked from commit `8b900d1808`)	2025-02-17 11:04:17 +11:00
Mark Andrews	ae3e67717c	Fix "CNAME and other data" detection prio_type was being used in the wrong place to optimize cname_and_other. We have to first exclude and accepted types and we also have to determine that the record exists before we can check if we are at a point where a later CNAME cannot appear. (cherry picked from commit `5e49a9e4ae`)	2025-02-14 13:41:11 +11:00
Ondřej Surý	db2bce1c6f	Switch the locknum generation for qpznode to random Instead of using on hash of the name modulo number of the buckets, assign the locknum randomly with isc_random_uniform(). This makes the locknum assignment aligned with qpcache and allows the bucket number to be non-prime in the future. (cherry picked from commit `732fc338a9`)	2025-02-04 23:28:53 +01:00
Ondřej Surý	d4e8a92977	Rely on call_rcu() to destroy the qpzone outside of locks Reduce the number of qpzone_ref() and qpzone_unref() calls in qpzone_detachnode() by relying on the call_rcu to delay the destruction of the lock buckets. (cherry picked from commit `1fa5219fdf`)	2025-02-04 23:28:53 +01:00
Ondřej Surý	c6c03a6b11	Reduce false sharing in dns_qpzone Instead of having many node_lock_count * sizeof(<member>) arrays, pack all the members into a qpzone_bucket_t that is cacheline aligned and have a single array of those. (cherry picked from commit `6dcc398726`)	2025-02-04 23:28:50 +01:00
Ondřej Surý	a9f4e3369a	Reduce false sharing in dns_qpcache Instead of having many node_lock_count * sizeof(<member>) arrays, pack all the members into a qpcache_bucket_t struct that is cacheline aligned and have a single array of those. Additionaly, make both the head and the tail of isc_queue_t padded, not just the head, to prevent false sharing of the lock-free structure with the lock that follows it. (cherry picked from commit `c602d76c1f`)	2025-02-04 23:27:28 +01:00
Ondřej Surý	8229d9cdfa	Print the expiration time of the stale records (not ancient) In #1870, the expiration time of ANCIENT records were printed, but actually the ancient records are very short lived, and the information carries a little value. Instead of printing the expiration of ANCIENT records, print the expiration time of STALE records. (cherry picked from commit `355fc48472`)	2025-02-04 18:07:59 +01:00
Ondřej Surý	302aca809d	Expand the usage of mark_ancient() helper functions When the mark_ancient() helper function was introduced, couple of places with duplicate (or almost duplicate) code was missed. Move the mark_ancient() function closer to the top of the file, and correctly use it in places that mark the header as ANCIENT. (cherry picked from commit `58179e6a19`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	4b114838de	Add better ZEROTTL handling in bindrdataset() If we know that the header has ZEROTTL set, the server should never send stale records for it and the TTL should never be anything else than 0. The comment was already there, but the code was not matching the comment. (cherry picked from commit `cfee6aa565`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	b32512a232	In cache, set rdataset TTL to 0 when the header is not active When the header has been marked as ANCIENT, but the ttl hasn't been reset (this happens in couple of places), the rdataset TTL would be set to the header timestamp instead to a reasonable TTL value. Since this header has been already expired (ANCIENT is set), set the rdataset TTL to 0 and don't reuse this field to print the expiration time when dumping the cache. Instead of printing the time, we now just print 'expired (awaiting cleanup'. (cherry picked from commit `1bbb57f81b`)	2025-02-03 15:53:34 +01:00
Evan Hunt	1e818d368f	fix the cache findzonecut implementation the search for the deepest known zone cut in the cache could improperly reject a node containing stale data, even if the NS rdataset wasn't the data that was stale. this change also improves the efficiency of the search by stopping it when both NS and RRSIG(NS) have been found. (cherry picked from commit `1f095b902c`)	2025-02-02 20:01:52 +01:00
Ondřej Surý	857225aeb6	Clarify reference counting in RBTDB database Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - dns__rbtdb_newref() and dns__rbtdb_decref() are now called dns__rbtnode_acquire() and dns__rbtnode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - rbtnode_newref() and rbtnode_decref are now called rbtnode_erefs_increment() and rbtnode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal.	2025-01-31 06:07:48 +01:00
Ondřej Surý	9c45de9473	Refactor node reference counting in rbtdb.c Refactor the pattern in the newref() and decref() functions in rbtdb.c following the pattern, so it follows the similar pattern we already have for QPDB.	2025-01-31 05:52:13 +01:00
Evan Hunt	5300eebc9e	Clarify reference counting in QP databases Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - newref() and decref() are now called qpcnode_acquire()/ qpznode_acquire() and qpcnode_release()/qpznode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - qpcnode_newref() and qpznode_newref() are now called qpcnode_erefs_increment() and qpznode_erefs_increment(), and qpcnode_decref() and qpznode_decref() are now called qpcnode_erefs_decrement() and qpznode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal. (cherry picked from commit `d4f791793e`)	2025-01-31 05:52:13 +01:00
Ondřej Surý	7dab6cdfbc	Remove db_nodelock_t in favor of reference counted qpdb This removes the db_nodelock_t structure and changes the node_locks array to be composed only of isc_rwlock_t pointers. The .reference member has been moved to qpdb->references in addition to common.references that's external to dns_db API users. The .exiting members has been completely removed as it has no use when the reference counting is used correctly. (cherry picked from commit `431513d8b3`)	2025-01-31 05:49:36 +01:00
Ondřej Surý	082a54cc5d	Remove origin_node from qpcache The origin_node in qpcache was always NULL, so we can remove the getoriginode() function and origin_node pointer as the dns_db_getoriginnode() correctly returns ISC_R_NOTFOUND when the function is not implemented. (cherry picked from commit `36a26bfa1a`)	2025-01-31 05:49:23 +01:00
Ondřej Surý	d1d444d2ab	Refactor decref() in both qpcache.c and qpzone.c Cleanup the pattern in the decref() functions in both qpcache.c and qpzone.c, so it follows the similar patter as we already have in newref() function. (cherry picked from commit `814b87da64`)	2025-01-31 05:49:12 +01:00
Colin Vidal	3aff00dc7b	fix EDE 22 time out detection Extended DNS error 22 (No reachable authority) was previously detected when `fctx_expired` fired. It turns out this function is used as a "safety net" and the timeout detection should be caught earlier. It was working though, because of another issue fixed by !9927. Since this change, the recursive request timed out detection occurs before `fctx_expired` so EDE 22 is not added to the response message anymore. The fix of the problem is to add the EDE 22 code in two situations: - When the dispatch code timed out (rctx_timedout) the resolver code checks various properties to figure out if it needs to make another fetch attempt. One of the paramters if the fetch expiration time. If it expires, the whole recursion is canceled, so it now adds the EDE 22 code. - If the fetch expiration time doesn't expires in the case above (and other parameters allows it) a new fetch attempt is made (fctx_query). But before the new request is actually made, the fetch expiration time is re-checked. It might then has elapsed, and the whole recursion is canceled. So it now also adds the EDE 22 code here as well. (cherry picked from commit `78274ec2b1`)	2025-01-30 14:43:25 +00:00
Colin Vidal	7b04c80183	manually add dns_lctx to isc_log_write in ede.c Because the new introduced code in main doesn't use the log context anymore, manually add the log context for isc_log_write usages in the new ede.c file.	2025-01-30 12:37:55 +00:00
Colin Vidal	ccafa27b44	Use DNS_EDE_OTHER instead of its literal value (cherry picked from commit `7c5678bb03`)	2025-01-30 12:37:55 +00:00
Colin Vidal	e5fc9f5fcb	detect dup EDE with bitmap and store next pos In order to avoid to loop to find the next position to store an EDE in a dns_edectx_t, add a "nextede" state which holds the next available position. Also, in order ot avoid to loop to find if an EDE is already existing in a dns_edectx_t, and avoid a duplicate, use a bitmap to immediately know if the EDE is there or not. Those both changes applies for adding or copying EDE. Also make the direction of dns_ede_copy more explicit/avoid errors by making "edectx_from" a const pointer. (cherry picked from commit `9021f9d802`)	2025-01-30 12:37:55 +00:00
Colin Vidal	f390108f8c	add lib/dns/ede.c documentation Add documentation usage of EDE compilation unit as well as centralize all EDE-related macros in the same lib/dns/include/dns/ede.h header. (cherry picked from commit `7b01cbfb04`)	2025-01-30 12:37:55 +00:00
Colin Vidal	7e3a650ae2	Refactor test covering dns_ede API Migrate tests cases in client_test code which were exclusively testing code which is now all wrapped inside ede compilation unit. Those are testing maximum number of EDE, duplicate EDE as well as truncation of text of an EDE. Also add coverage for the copy of EDE from an edectx to another one, as well as checking the assertion of the maximum EDE info code which can be used. (cherry picked from commit `f9f41190b3`)	2025-01-30 12:37:55 +00:00
Ondřej Surý	1ffb67a135	Split and simplify the use of EDE list implementation Instead of mixing the dns_resolver and dns_validator units directly with the EDE code, split-out the dns_ede functionality into own separate compilation unit and hide the implementation details behind abstraction. Additionally, the EDE codes are directly copied into the ns_client buffers by passing the EDE context to dns_resolver_createfetch(). This makes the dns_ede implementation simpler to use, although sligtly more complicated on the inside. Co-authored-by: Colin Vidal <colin@isc.org> Co-authored-by: Ondřej Surý <ondrej@isc.org> (cherry picked from commit `2f8e0edf3b`)	2025-01-30 12:37:55 +00:00
Andoni Duarte Pintado	2d0323e006	Merge tag 'v9.20.5' into bind-9.20	2025-01-29 17:21:44 +01:00
Michal Nowak	2134b35557	Use archived version of draft-icann-dnssec-keymgmt-01.txt The iana.org link is gone. (cherry picked from commit `5dbc87730e`)	2025-01-28 13:41:05 +00:00
Colin Vidal	6c65d70ce5	add support for EDE code 1 and 2 Add support for EDE codes 1 (Unsupported DNSKEY Algorithm) and 2 (Unsupported DS Digest Type) which might occurs during DNSSEC validation in case of unsupported DNSKEY algorithm or DS digest type. Because DNSSEC internally kicks off various fetches, we need to copy all encountered extended errors from fetch responses to the fetch context. Upon an event, the errors from the fetch context are copied to the client response. (cherry picked from commit `46a58acdf5`)	2025-01-24 14:27:16 +01:00

1 2 3 4 5 ...

15654 Commits