bind9

Author	SHA1	Message	Date
Ondřej Surý	506138ec0f	Fix the assertion failure when putting 48-bit number to buffer When putting the 48-bit number into a fixed-size buffer that's exactly 6 bytes, the assertion failure would occur as the 48-bit number is internally represented as 64-bit number and the code was checking if there is enough space for `sizeof(val)`. This causes assertion failure when otherwise valid TSIG signature has a bad timing information. Specify the size of the argument explicitly, so the 48-bit number doesn't require 8-byte long buffer. (cherry picked from commit `37dbd57c16`)	2024-08-05 11:11:40 +00:00
Ondřej Surý	36265057f9	Don't open route socket if we don't need it When automatic-interface-scan is disabled, the route socket was still being opened. Add new API to connect / disconnect from the route socket only as needed. Additionally, move the block that disables periodic interface rescans to a place where it actually have access to the configuration values. Previously, the values were being checked before the configuration was loaded. (cherry picked from commit `b26079fdaf`)	2024-08-05 10:17:08 +00:00
Ondřej Surý	6df4fa58fd	Don't skip the counting if fcount_incr() is called with force==true The fcount_incr() was incorrectly skipping the accounting for the fetches-per-zone if the force argument was set to true. We want to skip the accounting only when the fetches-per-zone is completely disabled, but for individual names we need to do the accounting even if we are forcing the result to be success. (cherry picked from commit `a513d4c07f`)	2024-08-05 09:40:25 +00:00
Ondřej Surý	80738e98bd	Fix PTHREAD_MUTEX_ADAPTIVE_NP and PTHREAD_MUTEX_ERRORCHECK_NP usage The PTHREAD_MUTEX_ADAPTIVE_NP and PTHREAD_MUTEX_ERRORCHECK_NP are usually not defines, but enum values, so simple preprocessor check doesn't work. Check for PTHREAD_MUTEX_ADAPTIVE_NP from the autoconf AS_COMPILE_IFELSE block and define HAVE_PTHREAD_MUTEX_ADAPTIVE_NP. This should enable adaptive mutex on Linux and FreeBSD. As PTHREAD_MUTEX_ERRORCHECK actually comes from POSIX and Linux glibc does define it when compatibility macros are being set, we can just use PTHREAD_MUTEX_ERRORCHECK instead of PTHREAD_MUTEX_ERRORCHECK_NP. (cherry picked from commit `cc4f99bc6d`)	2024-08-05 09:13:07 +00:00
Ondřej Surý	5d76ef21f0	Remove ISC_MUTEX_INITIALIZER It's hard to get it right on different platforms and it's unused in BIND 9 anyway. (cherry picked from commit `f158884344`)	2024-08-05 09:13:07 +00:00
Mark Andrews	7b238ba36a	Remove unnecessary operations Decrementing optlen immediately before calling continue is unneccesary and inconsistent with the rest of dns_message_pseudosectiontoyaml and dns_message_pseudosectiontotext. Coverity was also reporting an impossible false positive overflow of optlen (CID 499061). 4176 } else if (optcode == DNS_OPT_CLIENT_TAG) { 4177 uint16_t id; 4178 ADD_STRING(target, "; CLIENT-TAG:"); 4179 if (optlen == 2U) { 4180 id = isc_buffer_getuint16(&optbuf); 4181 snprintf(buf, sizeof(buf), " %u\n", id); 4182 ADD_STRING(target, buf); CID 499061: (#1 of 1): Overflowed constant (INTEGER_OVERFLOW) overflow_const: Expression optlen, which is equal to 65534, underflows the type that receives it, an unsigned integer 16 bits wide. 4183 optlen -= 2; 4184 POST(optlen); 4185 continue; 4186 } 4187 } else if (optcode == DNS_OPT_SERVER_TAG) { (cherry picked from commit `47338c2c87`)	2024-08-02 06:42:59 +00:00
Aram Sargsyan	003cb48f0f	Allow shorter resolver-query-timeout configuration There are use cases for which shorter timeout values make sense. For example if there is a load balancer which sets RD=1 and forwards queries to a BIND resolver which is then configured to talk to backend servers which are not visible in the public NS set. WIth a shorter timeout value the frontend can give back SERVFAIL early when backends are not available and the ultimate client will not penalize the BIND-frontend for non-response. (cherry picked from commit `5f47c2b567`)	2024-08-01 19:23:06 +00:00
Aram Sargsyan	aa6e388591	Rename dns_zone_forcereload() to dns_zone_forcexfr() The new name describes the function more accurately. (cherry picked from commit `63b8a75de9`)	2024-08-01 17:04:27 +00:00
Aram Sargsyan	c76d083d4d	Make dns_xfrin_shutdown() safe to run from a different loop If the current loop is different than the zone transfer's loop then run the shutdown operation asynchronously. (cherry picked from commit `3d1179501a`)	2024-08-01 17:04:27 +00:00
Aram Sargsyan	2886a08c1f	Implement rndc retransfer -force With this new optional argument if there is an ongoing zone transfer it will be aborted before a new zone transfer is scheduled. (cherry picked from commit `402ca316ae`)	2024-08-01 17:04:27 +00:00
Aram Sargsyan	b6721db490	Do not automatically restart a canceled zone transfer If a zone transfer is canceled there is no need to try the next primary or retry with AXFR. (cherry picked from commit `b156531b29`)	2024-08-01 17:04:27 +00:00
Mark Andrews	fc350f311a	Add missing period to generated IPv4 6to4 name The period between the most significant nibble of the IPv4 address and the 2.0.0.2.IP6.ARPA suffix was missing resulting in the wrong name being checked. (cherry picked from commit `bca63437a1`)	2024-08-01 06:41:47 +00:00
Mark Andrews	eea196fc78	Cleanup old clang-format string splitting (cherry picked from commit `6d1c7beb15`)	2024-08-01 05:24:00 +00:00
Mark Andrews	2789554dec	Remove false positive qname minimisation error Don't report qname minimisation NXDOMAIN errors when the result is NXDOMAIN. (cherry picked from commit `f78beca942`)	2024-08-01 05:24:00 +00:00
Mark Andrews	9fe19ffafa	Fix yaml output In yaml mode we emit a string for each question and record. Certain names and data could result in invalid yaml being produced. Use single quote string for all questions and records. This requires that single quotes get converted to two quotes within the string. (cherry picked from commit `393d7fa78e`)	2024-08-01 03:48:08 +00:00
Mark Andrews	b4fdd2f0df	Properly reject zero length ALPN in commatxt_fromtext ALPN are defined as 1*255OCTET in RFC 9460. commatxt_fromtext was not rejecting invalid inputs produces by missing a level of escaping which where later caught be dns_rdata_fromwire on reception. These inputs should have been rejected svcb in svcb 1 1.svcb alpn=\,abc svcb1 in svcb 1 1.svcb alpn=a\,\,abc and generated 00 03 61 62 63 and 01 61 00 02 61 62 63 respectively. The correct inputs to include commas in the alpn requires double escaping. svcb in svcb 1 1.svcb alpn=\\,abc svcb1 in svcb 1 1.svcb alpn=a\\,\\,abc and generate 04 2C 61 62 63 and 06 61 2C 2C 61 62 63 respectively. (cherry picked from commit `b51c9eb797`)	2024-08-01 01:10:48 +00:00
Aram Sargsyan	adddcde263	Replace #define DNS_GETDB_ with struct of bools This makes it easier to pretty-print the attributes in a debugger. (cherry picked from commit `cb5238cc62`)	2024-07-31 16:28:29 +00:00
Aram Sargsyan	2b3ce5e514	Return SERVFAIL for a too long CNAME chain Due to the maximum query restart limitation a long CNAME chain it is cut after 16 queries but named still returns NOERROR. Return SERVFAIL instead and the partial answer. (cherry picked from commit `b621f1d88e`)	2024-07-31 11:55:35 +00:00
Mark Andrews	fbcdfefd2d	Properly compute the physical memory size On a 32 bit machine casting to size_t can still lead to an overflow. Cast to uint64_t. Also detect all possible negative values for pages and pagesize to silence warning about possible negative value. 39#if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) 1. tainted_data_return: Called function sysconf(_SC_PHYS_PAGES), and a possible return value may be less than zero. 2. assign: Assigning: pages = sysconf(_SC_PHYS_PAGES). 40 long pages = sysconf(_SC_PHYS_PAGES); 41 long pagesize = sysconf(_SC_PAGESIZE); 42 3. Condition pages == -1, taking false branch. 4. Condition pagesize == -1, taking false branch. 43 if (pages == -1 \|\| pagesize == -1) { 44 return (0); 45 } 46 5. overflow: The expression (size_t)pages * pagesize might be negative, but is used in a context that treats it as unsigned. CID 498034: (#1 of 1): Overflowed return value (INTEGER_OVERFLOW) 6. return_overflow: (size_t)pages * pagesize, which might have underflowed, is returned from the function. 47 return ((size_t)pages * pagesize); 48#endif /* if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) */ (cherry picked from commit `e8dbc5db92`)	2024-07-31 07:30:35 +00:00
Mark Andrews	450aa90309	Do not update find.result_v4 and find.result_v6 These values are supposed to be static for the life of the find and clean_finds_at_name was updating them resulting in TSAN error reports. WARNING: ThreadSanitizer: data race Write of size 4 at 0x000000000001 by thread T1 (mutexes: write M1, write M2): #0 clean_finds_at_name lib/dns/adb.c:1537 #1 fetch_callback lib/dns/adb.c:4009 #2 task_run lib/isc/task.c:815 #3 isc_task_run lib/isc/task.c:896 #4 isc__nm_async_task netmgr/netmgr.c:848 #5 process_netievent netmgr/netmgr.c:920 #6 process_queue netmgr/netmgr.c:1013 #7 process_all_queues netmgr/netmgr.c:767 #8 async_cb netmgr/netmgr.c:796 #9 uv__async_io /usr/src/libuv-v1.44.1/src/unix/async.c:163 #10 isc__trampoline_run lib/isc/trampoline.c:189 Previous read of size 4 at 0x000000000001 by thread T2: #0 findname lib/dns/resolver.c:3749 #1 fctx_getaddresses lib/dns/resolver.c:3993 #2 fctx_try lib/dns/resolver.c:4390 #3 rctx_nextserver lib/dns/resolver.c:10356 #4 rctx_done lib/dns/resolver.c:10503 #5 resquery_response lib/dns/resolver.c:8511 #6 udp_recv lib/dns/dispatch.c:638 #7 isc__nm_async_readcb netmgr/netmgr.c:2885 #8 isc__nm_readcb netmgr/netmgr.c:2858 #9 udp_recv_cb netmgr/udp.c:650 #10 isc__nm_udp_read_cb netmgr/udp.c:1057 #11 uv__udp_recvmsg /usr/src/libuv-v1.44.1/src/unix/udp.c:303 #12 isc__trampoline_run lib/isc/trampoline.c:189 (cherry picked from commit `53a5f50e9d`)	2024-07-31 06:20:45 +00:00
Mark Andrews	45e49640ce	Log key calculation overflows (cherry picked from commit `14a76ae498`)	2024-07-30 10:22:48 +00:00
Mark Andrews	edb1df856e	Check for overflow when adding lifetime (cherry picked from commit `25845a866e`)	2024-07-30 10:22:48 +00:00
Matthijs Mekking	b489e267d4	No longer update key lifetime if key is retired The key lifetime should no longer be adjusted if the key is being retired earlier, for example because a manual rollover was started. This would falsely be seen as a dnssec-policy lifetime reconfiguration, and would adjust the retire/removed time again. This also means we should update the status output, and the next rollover scheduled is now calculated using (retire-active) instead of key lifetime. (cherry picked from commit `129973ebb0`)	2024-07-30 10:22:48 +00:00
Matthijs Mekking	bfb29acc7f	Update key lifetime and metadata after reconfig If dnssec-policy is reconfigured and the key lifetime has changed, update existing keys with the new lifetime and adjust the retire and removed timing metadata accordingly. If the key has no lifetime yet, just initialize the lifetime. It may be that the retire/removed timing metadata has already been set. Skip keys which goal is not set to omnipresent. These keys are already in the progress of retiring, or still unused. (cherry picked from commit `1cec0b0448`)	2024-07-30 10:22:48 +00:00
Artem Boldariev	5781ff3a93	Drop expired but not accepted TCP connections This commit ensures that we are not attempting to accept an expired TCP connection as we are not interested in any data that could have been accumulated in its internal buffers. Now we just drop them for good.	2024-07-03 15:03:02 +03:00
Ondřej Surý	bf9fd2a6ff	Reset the TCP connection on a failed send When sending fails, the ns__client_request() would not reset the connection and continue as nothing is happening. This comes from the model that we don't care about failed UDP sends because datagrams are unreliable anyway, but it greatly affects TCP connections with keep-alive. The worst case scenario is as follows: 1. the 3-way TCP handshake gets completed 2. the libuv calls the "uv_connection_cb" callback 3. the TCP connection gets queue because of the tcp-clients quota 4. the TCP client sends as many DNS messages as the buffers allow 5. the TCP connection gets dropped by the client due to the timeout 6. the TCP connection gets accepted by the server 7. the data already sent by the client gets read 8. all sending fails immediately because the TCP connection is dead 9. we consume all the data in the buffer in a very tight loop As it doesn't make sense to trying to process more data on the TCP connection when the sending is failing, drop the connection immediately on the first sending error.	2024-07-03 09:07:20 +02:00
Ondřej Surý	1c0564d715	Remove ns_query_init() cannot fail, remove the error paths As ns_query_init() cannot fail now, remove the error paths, especially in ns__client_setup() where we now don't have to care what to do with the connection if setting up the client could fail. It couldn't fail even before, but now it's formal.	2024-07-03 09:05:51 +02:00
Ondřej Surý	bc3e713317	Throttle the reading when writes are asynchronous Be more aggressive when throttling the reading - when we can't send the outgoing TCP synchronously with uv_try_write(), we start throttling the reading immediately instead of waiting for the send buffers to fill up. This should not affect behaved clients that read the data from the TCP on the other end.	2024-07-03 08:45:39 +02:00
Ondřej Surý	57cd34441a	Be smarter about refusing to add many RR types to the database Instead of outright refusing to add new RR types to the cache, be a bit smarter: 1. If the new header type is in our priority list, we always add either positive or negative entry at the beginning of the list. 2. If the new header type is negative entry, and we are over the limit, we mark it as ancient immediately, so it gets evicted from the cache as soon as possible. 3. Otherwise add the new header after the priority headers (or at the head of the list). 4. If we are over the limit, evict the last entry on the normal header list.	2024-07-01 12:48:51 +02:00
Ondřej Surý	b27c6bcce8	Expand the list of the priority types and move it to db_p.h Add HTTPS, SVCB, SRV, PTR, NAPTR, DNSKEY and TXT records to the list of the priority types that are put at the beginning of the slabheader list for faster access and to avoid eviction when there are more types than the max-types-per-name limit.	2024-07-01 12:47:30 +02:00
Artem Boldariev	55b1a093ea	Do not un-throttle TCP connections on isc_nm_read() Due to omission it was possible to un-throttle a TCP connection previously throttled due to the peer not reading back data we are sending. In particular, that affected DoH code, but it could also affect other transports (the current or future ones) that pause/resume reading according to its internal state.	2024-06-12 13:44:37 +03:00
Mark Andrews	e52c2a654b	Clear qctx->zversion Clear qctx->zversion when clearing qctx->zrdataset et al in lib/ns/query.c:qctx_freedata. The uncleared pointer could lead to an assertion failure if zone data needed to be re-saved which could happen with stale data support enabled.	2024-06-10 17:45:38 +02:00
Petr Špaček	9370acd3a7	Require local KEYs for SIG(0) verification This is additional hardening. There is no known use-case for KEY RRs from DNS cache and it potentially allows attackers to put weird keys into cache.	2024-06-10 17:36:45 +02:00
Aram Sargsyan	d69fab1530	Mark SIG(0) quota settings as experimantal A different solution in the future might be adopted depending on feedback and other new information, so it makes sense to mark these options as EXPERIMENTAL until we have more data.	2024-06-10 17:36:45 +02:00
Aram Sargsyan	54ddd848fe	Avoid running get_matching_view() asynchronously on an error path Also create a new ns_client_async_reset() static function to decrease code duplication.	2024-06-10 17:35:40 +02:00
Aram Sargsyan	7ca9bd6014	Limit the number of keys for SIG(0) message verification Check at most two KEY RRs agains a SIG(0) signature. This should limit potential abuse and at the same time allow key rollover.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	70ff4a3f85	Run resolver message signature checking asynchronously	2024-06-10 17:33:11 +02:00
Aram Sargsyan	ad489c44df	Remove sig0checks-quota-maxwait-ms support Waiting for a quota to appear complicates things and wastes rosources on timer management. Just answer with REFUSE if there is no quota.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	f0cde05e06	Implement asynchronous view matching for SIG(0)-signed queries View matching on an incoming query checks the query's signature, which can be a CPU-heavy task for a SIG(0)-signed message. Implement an asynchronous mode of the view matching function which uses the offloaded signature checking facilities, and use it for the incoming queries.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	710bf9b938	Implement asynchronous message signature verification Add support for using the offload threadpool to perform message signature verifications. This should allow check SIG(0)-signed messages without affecting the worker threads.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	7f013ad05d	Remove dns_message_rechecksig() This is a tiny helper function which is used only once and can be replaced with two function calls instead. Removing this makes supporting asynchronous signature checking less complicated.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	c7f79a0353	Add a quota for SIG(0) signature checks In order to protect from a malicious DNS client that sends many queries with a SIG(0)-signed message, add a quota of simultaneously running SIG(0) checks. This protection can only help when named is using more than one worker threads. For example, if named is running with the '-n 4' option, and 'sig0checks-quota 2;' is used, then named will make sure to not use more than 2 workers for the SIG(0) signature checks in parallel, thus leaving the other workers to serve the remaining clients which do not use SIG(0)-signed messages. That limitation is going to change when SIG(0) signature checks are offloaded to "slow" threads in a future commit. The 'sig0checks-quota-exempt' ACL option can be used to exempt certain clients from the quota requirements using their IP or network addresses. The 'sig0checks-quota-maxwait-ms' option is used to define a maximum amount of time for named to wait for a quota to appear. If during that time no new quota becomes available, named will answer to the client with DNS_R_REFUSED.	2024-06-10 17:33:08 +02:00
Matthijs Mekking	c1ac8b6ad0	Log rekey failure as error if too many records By default we log a rekey failure on debug level. We should probably change the log level to error. We make an exception for when the zone is not loaded yet, it often happens at startup that a rekey is run before the zone is fully loaded.	2024-06-10 16:55:12 +02:00
Matthijs Mekking	82635e56d8	Log error when update fails The new "too many records" error can make an update fail without the error being logged. This commit fixes that.	2024-06-10 16:55:12 +02:00
Evan Hunt	7dd6b47ace	fix a memory leak that could occur when signing when signatures were not added because of too many types already existing at a node, the diff was not being cleaned up; this led to a memory leak being reported at shutdown.	2024-06-10 16:55:12 +02:00
Ondřej Surý	52b3d86ef0	Add a limit to the number of RR types for single name Previously, the number of RR types for a single owner name was limited only by the maximum number of the types (64k). As the data structure that holds the RR types for the database node is just a linked list, and there are places where we just walk through the whole list (again and again), adding a large number of RR types for a single owner named with would slow down processing of such name (database node). Add a configurable limit to cap the number of the RR types for a single owner. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-types-per-name configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:09 +02:00
Ondřej Surý	32af7299eb	Add a limit to the number of RRs in RRSets Previously, the number of RRs in the RRSets were internally unlimited. As the data structure that holds the RRs is just a linked list, and there are places where we just walk through all of the RRs, adding an RRSet with huge number of RRs inside would slow down processing of said RRSets. Add a configurable limit to cap the number of the RRs in a single RRSet. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-records-per-type configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:07 +02:00
Ondřej Surý	e28266bfbc	Remove the extra memory context with own arena for sending The changes in this MR prevent the memory used for sending the outgoing TCP requests to spike so much. That strictly remove the extra need for own memory context, and thus since we generally prefer simplicity, remove the extra memory context with own jemalloc arenas just for the outgoing send buffers.	2024-06-10 16:48:54 +02:00
Ondřej Surý	4c2ac25a95	Limit the number of DNS message processed from a single TCP read The single TCP read can create as much as 64k divided by the minimum size of the DNS message. This can clog the processing thread and trash the memory allocator because we need to do as much as ~20k allocations in a single UV loop tick. Limit the number of the DNS messages processed in a single UV loop tick to just single DNS message and limit the number of the outstanding DNS messages back to 23. This effectively limits the number of pipelined DNS messages to that number (this is the limit we already had before).	2024-06-10 16:48:54 +02:00
Ondřej Surý	452a2e6348	Replace the tcp_buffers memory pool with static per-loop buffer As a single thread can process only one TCP send at the time, we don't really need a memory pool for the TCP buffers, but it's enough to have a single per-loop (client manager) static buffer that's being used to assemble the DNS message and then it gets copied into own sending buffer. In the future, this should get optimized by exposing the uv_try API from the network manager, and first try to send the message directly and allocate the sending buffer only if we need to send the data asynchronously.	2024-06-10 16:48:53 +02:00

1 2 3 4 5 ...

15447 Commits