bind9

Author	SHA1	Message	Date
Ondřej Surý	614f8c1ef1	Acquire the database reference before possibly last node release Acquire the database refernce in the detachnode() to prevent the last reference to be release while the NODE_LOCK being locked. The NODE_LOCK is locked/unlocked inside the RCU critical section, thus it is most probably this should not pose a problem as the database uses call_rcu memory reclamation, but this it is still safer to acquire the reference before releasing the node. (cherry picked from commit `d1ef6a93c1`)	2025-03-06 10:39:17 +00:00
Ondřej Surý	ee6e64df21	Revert "fix: dev: Delete dead nodes when committing a new version" This reverts commit `67255da4b3`, reversing changes made to `74c9ff384e`. (cherry picked from commit `1e4695510a`)	2025-03-05 17:28:44 +00:00
Aram Sargsyan	0561936272	Fix a bug in get_request_transport_type() When dns_remote_done() is true, calling dns_remote_curraddr() asserts. Add a dns_remote_curraddr() check before calling dns_remote_curraddr(). (cherry picked from commit `6cd9e4f67c`)	2025-03-05 13:18:09 +00:00
Mark Andrews	c0197077aa	Implement digest_sig and digest_rrsig for ZONEMD ZONEMD needs to be able to digest SIG and RRSIG records. The signer field can be compressed in SIG so we need to call dns_name_digest(). While for RRSIG the records the signer field is not compressed the canonical form has the signer field downcased (RFC 4034, 6.2). This also implies that compare_rrsig needs to downcase the signer field during comparison. (cherry picked from commit `006c5990ce`)	2025-03-05 10:33:53 +00:00
Mark Andrews	cbf416a284	Call isc__iterated_hash_initialize The iterated hash implementation needs to be initialised on the worker thread. Also clean it up after we are done. (cherry picked from commit `988dc57c8c`)	2025-03-04 13:49:38 +00:00
Aram Sargsyan	079a4176bb	Fix a bug in dns_zone_getprimaryaddr() When all the addresses were already iterated over, the dns_remote_curraddr() function asserts. So before calling it, dns_zone_getprimaryaddr() now checks the address list using the dns_remote_done() function. This also means that instead of returning 'isc_sockaddr_t' it now returns 'isc_result_t' and writes the primary's address into the provided pointer only when returning success. (cherry picked from commit `7293cb0612`)	2025-03-03 12:23:23 +00:00
Artem Boldariev	9977c7e5fa	DoH: Bump the active streams processing limit This commit bumps the total number of active streams (= the opened streams for which a request is received, but response is not ready) to 60% of the total streams limit. The previous limit turned out to be too tight as revealed by longer (≥1h) runs of "stress:long:rpz:doh+udp:linux:*" tests. (cherry picked from commit `eaad0aefe6`)	2025-03-03 10:12:27 +00:00
Artem Boldariev	b1ca1b3abc	DoH: remove obsolete INSIST() check The check, while not active by default, is not valid since the commit `8b8f4d500d`. See 'if (total == 0) { ...' below branch to understand why. (cherry picked from commit `217a1ebd79`)	2025-03-03 10:12:27 +00:00
Artem Boldariev	0bc12d0deb	DoH: Flush HTTP write buffer on an outgoing DNS message Previously, the code would try to avoid sending any data regardless of what it is unless: a) The flush limit is reached; b) There are no sends in flight. This strategy is used to avoid too numerous send requests with little amount of data. However, it has been proven to be too aggressive and, in fact, harms performance in some cases (e.g., on longer (≥1h) runs of "stress:long:rpz:doh+udp:linux:"). Now, additionally to the listed cases, we also: c) Flush the buffer and perform a send operation when there is an outgoing DNS message passed to the code (which is indicated by the presence of a send callback). That helps improve performance for "stress:long:rpz:doh+udp:linux:" tests. (cherry picked from commit `c5f7968856`)	2025-03-03 10:12:27 +00:00
Artem Boldariev	30226c749f	DoH: Limit the number of delayed IO processing requests Previously, a function for continuing IO processing on the next UV tick was introduced (http_do_bio_async()). The intention behind this function was to ensure that http_do_bio() is eventually called at least once in the future. However, the current implementation allows queueing multiple such delayed requests needlessly. There is currently no need for these excessive requests as http_do_bio() can requeue them if needed. At the same time, each such request can lead to a memory allocation, particularly in BIND 9.18. This commit ensures that the number of enqueued delayed IO processing requests never exceeds one in order to avoid potentially bombarding IO threads with the delayed requests needlessly. (cherry picked from commit `0e1b02868a`)	2025-03-03 10:12:27 +00:00
Artem Boldariev	515d84e1f6	DoH: Simplify http_do_bio() This commit significantly simplifies the code flow in the http_do_bio() function, which is responsible for processing incoming and outgoing HTTP/2 data. It seems that the way it was structured before was indirectly caused by the presence of the missing callback calls bug, fixed in `8b8f4d500d`. The change introduced by this commit is known to remove a bottleneck and allows reproducible and measurable performance improvement for long runs (>= 1h) of "stress:long:rpz:doh+udp:linux:*" tests. Additionally, it fixes a similar issue with potentially missing send callback calls processing and hardens the code against use-after-free errors related to the session object (they can potentially occur). (cherry picked from commit `0956fb9b9e`)	2025-03-03 10:12:27 +00:00
Aram Sargsyan	2d48cb33e3	Fix TTL issue with ANY queries processed through RPZ "passthru" Answers to an "ANY" query which are processed by the RPZ "passthru" policy have the response-policy's 'max-policy-ttl' value unexpectedly applied. Do not change the records' TTL when RPZ uses a policy which does not alter the answer. (cherry picked from commit `5633dc90d3`)	2025-02-27 09:22:01 +00:00
Mark Andrews	14bd113b8f	Fix dual-stack-servers Named was stopping nameserver address resolution attempts too soon when dual stack servers are configured. Dual stack servers are used when there are not addresses for the server in a particular address family so find->status == DNS_ADB_NOMOREADDRESSES is not a sufficient stopping condition when dual stack servers are available. Call fctx_try to see if the alternate servers can be used. (cherry picked from commit `f98a8331aa`)	2025-02-26 01:04:59 +00:00
Evan Hunt	4f1f958d6d	prevent a reference leak from the ns_query_done hooks if the NS_QUERY_DONE_BEGIN or NS_QUERY_DONE_SEND hook is used in a plugin and returns NS_HOOK_RETURN, some of the cleanup in ns_query_done() can be skipped over, leading to reference leaks that can cause named to hang on shut down. this has been addressed by adding more housekeeping code after the cleanup: tag in ns_query_done(). (cherry picked from commit `c2e4358267`)	2025-02-26 00:55:51 +00:00
Mark Andrews	a0dae15cd1	Relax private DNSKEY and RRSIG constraints DNSKEY, KEY, RRSIG and SIG constraints have been relaxed to allow empty key and signature material after the algorithm identifier for PRIVATEOID and PRIVATEDNS. It is arguable whether this falls within the expected use of these types as no key material is shared and the signatures are ineffective but these are private algorithms and they can be totally insecure. (cherry picked from commit `b048190e23`)	2025-02-25 23:40:38 +00:00
Ondřej Surý	7682d63bd4	Destroy the hashmap iterator inside the rwlock Previously, the hashmap iterator for fetches-per-zone was destroy outside the rwlock. This could lead to an assertion failure due to a timing race with the internal rehashing of the hashmap table as the rehashing process requires no iterators to be running when rehashing the hashmap table. This has been fixed by moving the destruction of the iterator inside the read locked section. (cherry picked from commit `1e4fb53c61`)	2025-02-25 15:41:30 +00:00
Evan Hunt	16a80f401a	Fix a logic error in cache_name() A change in `6aba56ae8` (checking whether a rejected RRset was identical to the data it would have replaced, so that we could still cache a signature) inadvertently introduced cases where processing of a response would continue when previously it would have been skipped. (cherry picked from commit `d0fd9cbe3b`)	2025-02-24 23:42:25 +00:00
Ondřej Surý	37e95cb4dd	Dump the fetches from dns_resolver_dumpfetches() Previously, the dns_resolver_dumpfetches() would go over the fetch counters. Alas, because of the earlier optimization, the fetch counters would be increased only when fetches-per-zone was not 0, otherwise the whole counting was skipped for performance reasons. Instead of using the auxiliary fetch counters hash table, use the real hash table that stores the fetch contexts to dump the ongoing fetches to the recursing file. Additionally print more information about the fetch context like start and expiry times, number of fetch responses, number of queries and count of allowed and dropped fetches. (cherry picked from commit `c6b0368b21`)	2025-02-21 22:05:24 +00:00
Ondřej Surý	eec7b79ee0	Fix the fetch context hash table lock ordering The order of the fetch context hash table rwlock and the individual fetch context was reversed when calling the release_fctx() function. This was causing a problem when iterating the hash table, and thus the ordering has been corrected in a way that the hash table rwlock is now always locked on the outside and the fctx lock is the interior lock. (cherry picked from commit `cf078fadeb`)	2025-02-21 22:27:34 +01:00
Ondřej Surý	ace7c879a8	Add isc_timer_running() function to check status of timer In the next commit, we need to know whether the timer has been started or stopped. Add isc_timer_running() function that returns true if the timer has been started. (cherry picked from commit `b9e3cd5d2a`)	2025-02-21 22:27:25 +01:00
Aram Sargsyan	0add37862e	Fix RPZ bug when resuming a query during a reconfiguration After a reconfiguration the old view can be left without a valid 'rpzs' member, because when the RPZ is not changed during the named reconfiguration 'rpzs' "migrate" from the old view into the new view, so when a query resumes it can find that 'qctx->view->rpzs' is NULL which query_resume() currently doesn't expect to happen if it's recursing and 'qctx->rpz_st' is not NULL. Fix the issue by adding a NULL-check. In order to not split the log message to two different log messages depending on whether 'qctx->view->rpzs' is NULL or not, change the message to not log the RPZ policy's "version" which is just a runtime counter and is most likely not very useful for the users. (cherry picked from commit `3ea2fbc238`)	2025-02-21 11:45:45 +00:00
Mark Andrews	db364baa83	Remove check for missing RRSIG records from getsection Checking whether the authority section is properly signed should be left to the validator. Checking in getsection (dns_message_parse) was way too early and resulted in resolution failures of lookups that should have otherwise succeeded. (cherry picked from commit `83159d0a54`)	2025-02-21 03:00:29 +00:00
Aram Sargsyan	5d69aab92d	Implement sig0key-checks-limit and sig0message-checks-limit Previously a hard-coded limitation of maximum two key or message verification checks were introduced when checking the message's SIG(0) signature. It was done in order to protect against possible DoS attacks. The logic behind choosing the number two was that more than one key should only be required only during key rotations, and in that case two keys are enough. But later it became apparent that there are other use cases too where even more keys are required, see issue number #5050 in GitLab. This change introduces two new configuration options for the views, sig0key-checks-limit and sig0message-checks-limit, which define how many keys are allowed to be checked to find a matching key, and how many message verifications are allowed to take place once a matching key has been found. The latter protects against expensive cryptographic operations when there are keys with colliding tags and algorithm numbers, with default being 2, and the former protects against a bit less expensive key parsing operations and defaults to 16. (cherry picked from commit `716b936045`)	2025-02-20 14:48:01 +00:00
Aram Sargsyan	18fbc3f735	Fix isc_quota bug Running jobs which were entered into the isc_quota queue is the responsibility of the isc_quota_release() function, which, when releasing a previously acquired quota, checks whether the queue is empty, and if it's not, it runs a job from the queue without touching the 'quota->used' counter. This mechanism is susceptible to a possible hangup of a newly queued job in case when between the time a decision has been made to queue it (because used >= max) and the time it was actually queued, the last quota was released. Since there is no more quotas to be released (unless arriving in the future), the newly entered job will be stuck in the queue. Fix the wrong memory ordering for 'quota->used', as the relaxed ordering doesn't ensure that data modifications made by one thread are visible in other threads. Add checks in both isc_quota_release() and isc_quota_acquire_cb() to make sure that the described hangup does not happen. Also see code comments. (cherry picked from commit `c6529891bb`)	2025-02-20 12:20:25 +00:00
Aram Sargsyan	0bd251a496	Expose the incoming transfers' rates in the statistics channel Expose the average transfer rate (in bytes-per-second) during the last full 'min-transfer-rate-in <bytes> <minutes>' minutes interval. If no such interval has passed yet, then the overall average rate is reported instead. (cherry picked from commit `c701b590e4`)	2025-02-20 11:05:09 +00:00
Aram Sargsyan	e6b14365ad	Implement the min-transfer-rate-in configuration option This new option sets a minimum amount of transfer rate for an incoming zone transfer that will abort a transfer, which for some network related reasons run very slowly. (cherry picked from commit `91ea156203`)	2025-02-20 11:05:09 +00:00
Evan Hunt	fad9b3771f	Check whether a rejected rrset is different Add a new dns_rdataset_equals() function to check whether two rdatasets are equal in DNSSEC terms. When an rdataset being cached is rejected because its trust level is lower than the existing rdataset, we now check to see whether the rejected data was identical to the existing data. This allows us to cache a potentially useful RRSIG when handling CD=1 queries, while still rejecting RRSIGs that would definitely have resulted in a validation failure. (cherry picked from commit `6aba56ae89`)	2025-02-19 18:29:34 -08:00
Artem Boldariev	788e925261	DoH: http_send_outgoing() return value is not used The value returned by http_send_outgoing() is not used anywhere, so we make it not return anything (void). Probably it is an omission from older times. (cherry picked from commit `2adabe835a`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	47e9b47742	DoH: Fix missing send callback calls When handling outgoing data, there were a couple of rarely executed code paths that would not take into account that the callback MUST be called. It could lead to potential memory leaks and consequent shutdown hangs. (cherry picked from commit `8b8f4d500d`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	6b9387e2ee	DoH: change how the active streams number is calculated This commit changes the way how the number of active HTTP streams is calculated and allows it to scale with the values of the maximum amount of streams per connection, instead of effectively capping at STREAM_CLIENTS_PER_CONN. The original limit, which is intended to define the pipelining limit for TCP/DoT. However, it appeared to be too restrictive for DoH, as it works quite differently and implements pipelining at protocol level by the means of multiplexing multiple streams. That renders each stream to be effectively a separate connection from the point of view of the rest of the codebase. (cherry picked from commit `a22bc2d7d4`)	2025-02-19 20:34:29 +02:00
Artem Boldariev	96e8ea1245	DoH: Track the amount of in flight outgoing data Previously we would limit the amount of incoming data to process based solely on the presence of not completed send requests. That worked, however, it was found to severely degrade performance in certain cases, as was revealed during extended testing. Now we switch to keeping track of how much data is in flight (or ready to be in flight) and limit the amount of processed incoming data when the amount of in flight data surpasses the given threshold, similarly to like we do in other transports. (cherry picked from commit `05e8a50818`)	2025-02-19 20:34:29 +02:00
Evan Hunt	e35e701c2c	when committing a new qpzone version, delete dead nodes if all data has been deleted from a node in the qpzone database, delete the node too. (cherry picked from commit `e58ce19cf2`)	2025-02-18 22:55:20 +00:00
Artem Boldariev	7b0a5596d6	Fix wrong logging severity in do_nsfetch() ISC_LOG_WARNING was used while ISC_LOG_DEBUG(3) was implied. (cherry picked from commit `fd3beaba2e`)	2025-02-18 10:30:18 +02:00
Evan Hunt	a21168a221	fix dns_qp_insert() checks in qpzone in some places there were checks for failures of dns_qp_insert() after dns_qp_getname(). such failures could only happen if another thread inserted a node between the two calls, and that can't happen because the calls are serialized with dns_qpmulti_write(). we can simplify the code and just add an INSIST. (cherry picked from commit `fffa150df3`)	2025-02-18 05:55:02 +00:00
Mark Andrews	89122c3fde	Re-fetch pending records that failed validation If a deferred validation on data that was originally queried with CD=1 fails, we now repeat the query, since the zone data may have changed in the meantime. (cherry picked from commit `04b1484ed8`)	2025-02-17 11:04:19 +11:00
Mark Andrews	de8893733f	Complete the deferred validation if there are no RRSIGs When a query is made with CD=1, we store the result in the cache marked pending so that it can be validated later, at which time it will either be accepted as an answer or removed from the cache as invalid. Deferred validation was not attempted when there were no cached RRSIGs for DNSKEY and DS. We now complete the deferred validation in this scenario. (cherry picked from commit `8b900d1808`)	2025-02-17 11:04:17 +11:00
Mark Andrews	ae3e67717c	Fix "CNAME and other data" detection prio_type was being used in the wrong place to optimize cname_and_other. We have to first exclude and accepted types and we also have to determine that the record exists before we can check if we are at a point where a later CNAME cannot appear. (cherry picked from commit `5e49a9e4ae`)	2025-02-14 13:41:11 +11:00
Ondřej Surý	db2bce1c6f	Switch the locknum generation for qpznode to random Instead of using on hash of the name modulo number of the buckets, assign the locknum randomly with isc_random_uniform(). This makes the locknum assignment aligned with qpcache and allows the bucket number to be non-prime in the future. (cherry picked from commit `732fc338a9`)	2025-02-04 23:28:53 +01:00
Ondřej Surý	d4e8a92977	Rely on call_rcu() to destroy the qpzone outside of locks Reduce the number of qpzone_ref() and qpzone_unref() calls in qpzone_detachnode() by relying on the call_rcu to delay the destruction of the lock buckets. (cherry picked from commit `1fa5219fdf`)	2025-02-04 23:28:53 +01:00
Ondřej Surý	c6c03a6b11	Reduce false sharing in dns_qpzone Instead of having many node_lock_count * sizeof(<member>) arrays, pack all the members into a qpzone_bucket_t that is cacheline aligned and have a single array of those. (cherry picked from commit `6dcc398726`)	2025-02-04 23:28:50 +01:00
Ondřej Surý	a9f4e3369a	Reduce false sharing in dns_qpcache Instead of having many node_lock_count * sizeof(<member>) arrays, pack all the members into a qpcache_bucket_t struct that is cacheline aligned and have a single array of those. Additionaly, make both the head and the tail of isc_queue_t padded, not just the head, to prevent false sharing of the lock-free structure with the lock that follows it. (cherry picked from commit `c602d76c1f`)	2025-02-04 23:27:28 +01:00
Ondřej Surý	8229d9cdfa	Print the expiration time of the stale records (not ancient) In #1870, the expiration time of ANCIENT records were printed, but actually the ancient records are very short lived, and the information carries a little value. Instead of printing the expiration of ANCIENT records, print the expiration time of STALE records. (cherry picked from commit `355fc48472`)	2025-02-04 18:07:59 +01:00
Ondřej Surý	302aca809d	Expand the usage of mark_ancient() helper functions When the mark_ancient() helper function was introduced, couple of places with duplicate (or almost duplicate) code was missed. Move the mark_ancient() function closer to the top of the file, and correctly use it in places that mark the header as ANCIENT. (cherry picked from commit `58179e6a19`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	4b114838de	Add better ZEROTTL handling in bindrdataset() If we know that the header has ZEROTTL set, the server should never send stale records for it and the TTL should never be anything else than 0. The comment was already there, but the code was not matching the comment. (cherry picked from commit `cfee6aa565`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	b32512a232	In cache, set rdataset TTL to 0 when the header is not active When the header has been marked as ANCIENT, but the ttl hasn't been reset (this happens in couple of places), the rdataset TTL would be set to the header timestamp instead to a reasonable TTL value. Since this header has been already expired (ANCIENT is set), set the rdataset TTL to 0 and don't reuse this field to print the expiration time when dumping the cache. Instead of printing the time, we now just print 'expired (awaiting cleanup'. (cherry picked from commit `1bbb57f81b`)	2025-02-03 15:53:34 +01:00
Evan Hunt	1e818d368f	fix the cache findzonecut implementation the search for the deepest known zone cut in the cache could improperly reject a node containing stale data, even if the NS rdataset wasn't the data that was stale. this change also improves the efficiency of the search by stopping it when both NS and RRSIG(NS) have been found. (cherry picked from commit `1f095b902c`)	2025-02-02 20:01:52 +01:00
Ondřej Surý	857225aeb6	Clarify reference counting in RBTDB database Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - dns__rbtdb_newref() and dns__rbtdb_decref() are now called dns__rbtnode_acquire() and dns__rbtnode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - rbtnode_newref() and rbtnode_decref are now called rbtnode_erefs_increment() and rbtnode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal.	2025-01-31 06:07:48 +01:00
Ondřej Surý	9c45de9473	Refactor node reference counting in rbtdb.c Refactor the pattern in the newref() and decref() functions in rbtdb.c following the pattern, so it follows the similar pattern we already have for QPDB.	2025-01-31 05:52:13 +01:00
Evan Hunt	5300eebc9e	Clarify reference counting in QP databases Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - newref() and decref() are now called qpcnode_acquire()/ qpznode_acquire() and qpcnode_release()/qpznode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - qpcnode_newref() and qpznode_newref() are now called qpcnode_erefs_increment() and qpznode_erefs_increment(), and qpcnode_decref() and qpznode_decref() are now called qpcnode_erefs_decrement() and qpznode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal. (cherry picked from commit `d4f791793e`)	2025-01-31 05:52:13 +01:00
Ondřej Surý	7dab6cdfbc	Remove db_nodelock_t in favor of reference counted qpdb This removes the db_nodelock_t structure and changes the node_locks array to be composed only of isc_rwlock_t pointers. The .reference member has been moved to qpdb->references in addition to common.references that's external to dns_db API users. The .exiting members has been completely removed as it has no use when the reference counting is used correctly. (cherry picked from commit `431513d8b3`)	2025-01-31 05:49:36 +01:00

1 2 3 4 5 ...

15666 Commits