bind9

Author	SHA1	Message	Date
Michał Kępień	363902ce2c	Free resources when gss_accept_sec_context() fails Even if a call to gss_accept_sec_context() fails, it might still cause a GSS-API response token to be allocated and left for the caller to release. Make sure the token is released before an early return from dst_gssapi_acceptctx(). (cherry picked from commit `d954e152d9`)	2021-04-08 10:41:08 +02:00
Mark Andrews	7b93ff93d6	Rewrite managed-key journal immediately Both managed keys and regular zone journals need to be updated immediately when a recoverable error is discovered. (cherry picked from commit `0fbdf189c7`)	2021-04-07 21:29:07 +02:00
Mark Andrews	511ea2d3f3	Update dns_journal_compact() to handle bad transaction headers Previously, dns_journal_begin_transaction() could reserve the wrong amount of space. We now check that the transaction is internally consistent when upgrading / downgrading a journal and we also handle the bad transaction headers. (cherry picked from commit `83310ffd92`)	2021-04-07 21:29:06 +02:00
Mark Andrews	6da2e05df9	Compute transaction size based on journal/transaction type previously the code assumed that it was a new transaction. (cherry picked from commit `520509ac7e`)	2021-04-07 21:29:06 +02:00
Mark Andrews	d9ad7ccf2d	Use journal_write_xhdr() to write the dummy transaction header Instead of journal_write(), use correct format call journal_write_xhdr() to write the dummy transaction header which looks at j->header_ver1 to determine which transaction header to write instead of always writing a zero filled journal_rawxhdr_t header. (cherry picked from commit `5a6112ec8f`)	2021-04-07 21:29:06 +02:00
Diego Fronza	5d391f07c0	Resolve TSAN data race in zone_maintenance Fix race between zone_maintenance and dns_zone_notifyreceive functions, zone_maintenance was attempting to read a zone flag calling DNS_ZONE_FLAG(zone, flag) while dns_zone_notifyreceive was updating a flag in the same zone calling DNS_ZONE_SETFLAG(zone, ...). The code reading the flag in zone_maintenance was not protected by the zone's lock, to avoid a race the zone's lock is now being acquired before an attempt to read the zone flag is made.	2021-04-07 13:22:36 +00:00
Matthijs Mekking	194a72b3f1	If RPZ config'd, bail stale-answer-client-timeout When we are recursing, RPZ processing is not allowed. But when we are performing a lookup due to "stale-answer-client-timeout", we are still recursing. This effectively means that RPZ processing is disabled on such a lookup. In this case, bail the "stale-answer-client-timeout" lookup and wait for recursion to complete, as we we can't perform the RPZ rewrite rules reliably. (cherry picked from commit `3d3a6415f7`)	2021-04-02 13:29:27 +02:00
Matthijs Mekking	29bcd113ea	Rename "staleonly" The dboption DNS_DBFIND_STALEONLY caused confusion because it implies we are looking for stale data only and ignore any active RRsets in the cache. Rename it to DNS_DBFIND_STALETIMEOUT as it is more clear the option is related to a lookup due to "stale-answer-client-timeout". Rename other usages of "staleonly", instead use "lookup due to...". Also rename related function and variable names. (cherry picked from commit `839df94190`)	2021-04-02 13:29:17 +02:00
Matthijs Mekking	34dd6521b1	Restore the RECURSIONOK attribute after staleonly When doing a staleonly lookup we don't want to fallback to recursion. After all, there are obviously problems with recursion, otherwise we wouldn't do a staleonly lookup. When resuming from recursion however, we should restore the RECURSIONOK flag, allowing future required lookups for this client to recurse. (cherry picked from commit `3f81d79ffb`)	2021-04-02 13:29:09 +02:00
Matthijs Mekking	114dc7888a	Remove result exception on staleonly lookup When implementing "stale-answer-client-timeout", we decided that we should only return positive answers prematurely to clients. A negative response is not useful, and in that case it is better to wait for the recursion to complete. To do so, we check the result and if it is not ISC_R_SUCCESS, we decide that it is not good enough. However, there are more return codes that could lead to a positive answer (e.g. CNAME chains). This commit removes the exception and now uses the same logic that other stale lookups use to determine if we found a useful stale answer (stale_found == true). This means we can simplify two test cases in the serve-stale system test: nodata.example is no longer treated differently than data.example. (cherry picked from commit `aaed7f9d8c`)	2021-04-02 13:28:59 +02:00
Matthijs Mekking	06823aa255	Remove INSIST on NS_QUERYATTR_ANSWERED The NS_QUERYATTR_ANSWERED attribute is to prevent sending a response twice. Without the attribute, this may happen if a staleonly lookup found a useful answer and sends a response to the client, and later recursion ends and also tries to send a response. The attribute was also used to mask adding a duplicate RRset. This is considered harmful. When we created a response to the client with a stale only lookup (regardless if we actually have send the response), we should clear the rdatasets that were added during that lookup. Mark such rdatasets with the a new attribute, DNS_RDATASETATTR_STALE_ADDED. Set a query attribute NS_QUERYATTR_STALEOK if we may have added rdatasets during a stale only lookup. Before creating a response on a normal lookup, check if we can expect rdatasets to have been added during a staleonly lookup. If so, clear the rdatasets from the message with the attribute DNS_RDATASETATTR_STALE_ADDED set. (cherry picked from commit `3d5429f61f`)	2021-04-02 13:28:08 +02:00
Matthijs Mekking	33d61b9651	Simplify when to detach the client With stale-answer-client-timeout, we may send a response to the client, but we may want to hold on to the network manager handle, because recursion is going on in the background, or we need to refresh a stale RRset. Simplify the setting of 'nodetach': * During a staleonly lookup we should not detach the nmhandle, so just set it prior to 'query_lookup()'. * During a staleonly "stalefirst" lookup set the 'nodetach' to true if we are going to refresh the RRset. Now there is no longer the need to clear the 'nodetach' if we go through the "dbfind_stale", "stale_refresh_window", or "stale_only" paths. (cherry picked from commit `48b0dc159b`)	2021-04-02 13:28:01 +02:00
Matthijs Mekking	b1496d19d5	Refactor stale lookups, ignore active RRsets When doing a staleonly lookup, ignore active RRsets from cache. If we don't, we may add a duplicate RRset to the message, and hit an assertion failure in query.c because adding the duplicate RRset to the ANSWER section failed. This can happen on a race condition. When a client query is received, the recursion is started. When 'stale-answer-client-timeout' triggers around the same time the recursion completes, the following sequence of events may happen: 1. Queue the "try stale" fetch_callback() event to the client task. 2. Add the RRsets from the authoritative response to the cache. 3. Queue the "fetch complete" fetch_callback() event to the client task. 4. Execute the "try stale" fetch_callback(), which retrieves the just-inserted RRset from the database. 5. In "ns_query_done()" we are still recursing, but the "staleonly" query attribute has already been cleared. In other words, the query will resume when recursion ends (it already has ended but is still on the task queue). 6. Execute the "fetch complete" fetch_callback(). It finds the answer from recursion in the cache again and tries to add the duplicate to the answer section. This commit changes the logic for finding stale answers in the cache, such that on "stale_only" lookups actually only stale RRsets are considered. It refactors the code so that code paths for "dbfind_stale", "stale_refresh_window", and "stale_only" are more clear. First we call some generic code that applies in all three cases, formatting the domain name for logging purposes, increment the trystale stats, and check if we actually found stale data that we can use. The "dbfind_stale" lookup will return SERVFAIL if we didn't found a usable answer, otherwise we will continue with the lookup (query_gotanswer()). This is no different as before the introduction of "stale-answer-client-timeout" and "stale-refresh-time". The "stale_refresh_window" lookup is similar to the "dbfind_stale" lookup: return SERVFAIL if we didn't found a usable answer, otherwise continue with the lookup (query_gotanswer()). Finally the "stale_only" lookup. If the "stale_only" lookup was triggered because of an actual client timeout (stale-answer-client-timeout > 0), and if database lookup returned a stale usable RRset, trigger a response to the client. Otherwise return and wait until the recursion completes (or the resolver query times out). If the "stale_only" lookup is a "stale-anwer-client-timeout 0" lookup, preferring stale data over a lookup. In this case if there was no stale data, or the data was not a positive answer, retry the lookup with the stale options cleared, a.k.a. a normal lookup. Otherwise, continue with the lookup (query_gotanswer()) and refresh the stale RRset. This will trigger a response to the client, but will not detach the handle because a fetch will be created to refresh the RRset. (cherry picked from commit `92f7a67892`)	2021-04-02 13:27:52 +02:00
Matthijs Mekking	fcf8fb4f39	Keep track of allow client detach The stale-answer-client-timeout feature introduced a dependancy on when a client may be detached from the handle. The dboption DNS_DBFIND_STALEONLY was reused to track this attribute. This overloads the meaning of this database option, and actually introduced a bug because the option was checked in other places. In particular, in 'ns_query_done()' there is a check for 'RECURSING(qctx->client) && (!QUERY_STALEONLY(&qctx->client->query) \|\| ...' and the condition is satisfied because recursion has not completed yet and DNS_DBFIND_STALEONLY is already cleared by that time (in query_lookup()), because we found a useful answer and we should detach the client from the handle after sending the response. Add a new boolean to the client structure to keep track of client detach from handle is allowed or not. It is only disallowed if we are in a staleonly lookup and we didn't found a useful answer. (cherry picked from commit `fee164243f`)	2021-04-02 13:27:43 +02:00
Ondřej Surý	565a6a5679	Move the dummy shims to single ifndef GSSAPI block Previously, every function had it's own #ifdef GSSAPI #else #endif block that defined shim function in case GSSAPI was not being used. Now the dummy shim functions have be split out into a single #else #endif block at the end of the file. This makes the gssapictx.c similar to 9.17.x code, making the backports and reviews easier.	2021-04-01 10:42:32 +02:00
Mark Andrews	3fd30e1634	Add Heimdal compatibility support The Heimdal Kerberos library handles the OID sets in a different manner. Unify the handling of the OID sets between MIT and Heimdal implementations by dynamically creating the OID sets instead of using static predefined set. This is how upstream recommends to handle the OID sets.	2021-04-01 10:42:32 +02:00
Mark Andrews	a875dcc669	Remove custom ISC SPNEGO implementation The custom ISC SPNEGO mechanism implementation is no longer needed on the basis that all major Kerberos 5/GSSAPI (mit-krb5, heimdal and Windows) implementations support SPNEGO mechanism since 2006. This commit removes the custom ISC SPNEGO implementation, and removes the option from both autoconf and win32 Configure script. Unknown options are being ignored, so this doesn't require any special handling.	2021-04-01 10:42:32 +02:00
Ondřej Surý	ee7283b3ee	Merge branch 'bind-dyndb-ldap-v9.16.13' into 'main' Do not require config.h to use isc/util.h See merge request isc-projects/bind9!4840 (cherry picked from commit `19b69e9a3b`) `81eb3396` Do not require config.h to use isc/util.h	2021-03-26 18:48:06 +00:00
Matthijs Mekking	1f8c5786f8	Delete CDS/CDNSKEY records when zone is unsigned CDS/CDNSKEY DELETE records are only useful if they are signed, otherwise the parent cannot verify these RRsets anyway. So once the DS has been removed (and signaled to BIND), we can remove the DNSKEY and RRSIG records, and at this point we can also remove the CDS/CDNSKEY records. (cherry picked from commit `6f31f62d69`)	2021-03-22 13:57:10 +01:00
Matthijs Mekking	7882c7fbea	Allow CDS/CDNSKEY DELETE records in unsigned zone While not useful, having a CDS/CDNSKEY DELETE record in an unsigned zone is not an error and "named-checkzone" should not complain. (cherry picked from commit `f211c7c2a1`)	2021-03-22 13:31:02 +01:00
Matthijs Mekking	b81502f4ae	Fix keymgr key init bug The 'keymgr_key_init()' function initializes key states if they have not been set previously. It looks at the key timing metadata and determines using the given times whether a state should be set to RUMOURED or OMNIPRESENT. However, the DNSKEY and ZRRSIG states were mixed up: When looking at the Activate timing metadata we should set the ZRRSIG state, and when looking at the Published timing metadata we should set the DNSKEY state. (cherry picked from commit `27e7d5f698`)	2021-03-22 11:24:55 +01:00
Patrick McLean	c5c9c9b83f	Add isc_time_now_hires function to get current time with high resolution The current isc_time_now uses CLOCK_REALTIME_COARSE which only updates on a timer tick. This clock is generally fine for millisecond accuracy, but on servers with 100hz clocks, this clock is nowhere near accurate enough for microsecond accuracy. This commit adds a new isc_time_now_hires function that uses CLOCK_REALTIME, which gives the current time, though it is somewhat expensive to call. When microsecond accuracy is required, it may be required to use extra resources for higher accuracy. (cherry picked from commit `ebced74b19`)	2021-03-20 11:59:21 -07:00
Witold Kręcicki	a6c4702796	Fix a startup/shutdown crash in ns_clientmgr_create	2021-03-18 15:33:28 -03:00
Witold Kręcicki	dd564da286	Shutdown interface if we can't listen on it to avoid shutdown hang	2021-03-18 15:27:28 -03:00
Ondřej Surý	121641686c	Temporarily disable tlsdns_test until it gets refactored The tlsdns API is not yet used in the 9.16 branch and the tlsdns_test fails too often. Temporarily disable running the test until it is actually needed.	2021-03-18 15:42:03 +01:00
Ondřej Surý	db49ffca20	Change the isc_nm_(get\|set)timeouts() to work with milliseconds The RFC7828 specifies the keepalive interval to be 16-bit, specified in units of 100 milliseconds and the configuration options tcp-*-timeouts are following the suit. The units of 100 milliseconds are very unintuitive and while we can't change the configuration and presentation format, we should not follow this weird unit in the API. This commit changes the isc_nm_(get\|set)timeouts() functions to work with milliseconds and convert the values to milliseconds before passing them to the function, not just internally.	2021-03-18 15:16:13 +01:00
Ondřej Surý	5d0647e067	Merge the common parts between udp, tcpdns and tlsdns protocol The udp, tcpdns and tlsdns contained lot of cut&paste code or code that was very similar making the stack harder to maintain as any change to one would have to be copied to the the other protocols. In this commit, we merge the common parts into the common functions under isc__nm_<foo> namespace and just keep the little differences based on the socket type.	2021-03-18 15:16:13 +01:00
Ondřej Surý	a017ba2615	Fix TCPDNS and TLSDNS timers After the TCPDNS refactoring the initial and idle timers were broken and only the tcp-initial-timeout was always applied on the whole TCP connection. This broke any TCP connection that took longer than tcp-initial-timeout, most often this would affect large zone AXFRs. This commit changes the timeout logic in this way: * On TCP connection accept the tcp-initial-timeout is applied and the timer is started * When we are processing and/or sending any DNS message the timer is stopped * When we stop processing all DNS messages, the tcp-idle-timeout is applied and the timer is started again	2021-03-18 15:16:13 +01:00
Mark Andrews	3f9d7227b3	Stop using deprecated calls in lib/isc/tls.c from Rosen Penev @neheb (cherry picked from commit `a9f883cbc2`)	2021-03-18 10:15:44 +11:00
Matthijs Mekking	1a29d7ae7d	Fully sign a thawed zone When thawing a zone, we don't know what changes have been made. If we do DNSSEC maintenance on this zone, schedule a full sign. (cherry picked from commit `b90846f222`)	2021-03-17 11:12:39 +01:00
Matthijs Mekking	3ae16d0480	Fix "unable to thaw dynamic kasp zone" Dynamic zones with dnssec-policy could not be thawed because KASP zones were considered always dynamic. But a dynamic KASP zone should also check whether updates are disabled. (cherry picked from commit `b518ed9f46`)	2021-03-17 11:12:29 +01:00
Matthijs Mekking	96953fc293	Fix servestale fetchlimits crash When we query the resolver for a domain name that is in the same zone for which is already one or more fetches outstanding, we could potentially hit the fetch limits. If so, recursion fails immediately for the incoming query and if serve-stale is enabled, we may try to return a stale answer. If the resolver is also is authoritative for the parent zone (for example the root zone), first a delegation is found, but we first check the cache for a better response. Nothing is found in the cache, so we try to recurse to find the answer to the query. Because of fetch-limits 'dns_resolver_createfetch()' returns an error, which 'ns_query_recurse()' propagates to the caller, 'query_delegation_recurse()'. Because serve-stale is enabled, 'query_usestale()' is called, setting 'qctx->db' to the cache db, but leaving 'qctx->version' untouched. Now 'query_lookup()' is called to search for stale data in the cache database with a non-NULL 'qctx->version' (which is set to a zone db version), and thus we hit an assertion in rbtdb. This crash was introduced in 'v9_16' by commit `2afaff75ed`. (cherry picked from commit `87591de6f7`)	2021-03-11 13:47:20 +01:00
Evan Hunt	401642cbb4	fix automatic journal upgrade on windows - use a value less than 2^32 for DNS_ZONEFLG_FIXJOURNAL; a larger value could cause problems in some build environments. the zone flag DNS_ZONEFLG_DIFFONRELOAD, which was no longer in use, has now been deleted and its value reused for _FIXJOURNAL. (cherry picked from commit `990dd9dbff`)	2021-03-08 09:43:00 +01:00
Evan Hunt	2aed5c5954	add start and end serial numbers to 'named-printjournal -x' add the starting and ending serial number from the journal header to the output of dns_journal_print(). (cherry picked from commit `46c11726d4`)	2021-03-08 09:43:00 +01:00
Mark Andrews	898e9989f9	Silence CID 329159: Logically dead code in lib/dns/journal.c *** CID 329159: (DEADCODE) /lib/dns/journal.c: 1719 in dns_journal_print() 1713 } 1714 CHECK(dns_difftuple_create( 1715 diff.mctx, n_soa == 1 ? DNS_DIFFOP_DEL : DNS_DIFFOP_ADD, 1716 name, ttl, rdata, &tuple)); 1717 dns_diff_append(&diff, &tuple); 1718 >>> CID 329159: (DEADCODE) >>> Execution cannot reach the expression "printxhdr" inside this statement: "if (++n_put != 0U \|\| printx...". 1719 if (++n_put != 0 \|\| printxhdr) { 1720 result = dns_diff_print(&diff, file); 1721 dns_diff_clear(&diff); 1722 n_put = 0; 1723 if (result != ISC_R_SUCCESS) { 1724 break; /lib/dns/journal.c: 1734 in dns_journal_print() 1728 if (result == ISC_R_NOMORE) { 1729 result = ISC_R_SUCCESS; 1730 } 1731 CHECK(result); 1732 1733 if (!printxhdr && n_put != 0) { >>> CID 329159: (DEADCODE) >>> Execution cannot reach this statement: "result = dns_diff_print(&di...". 1734 result = dns_diff_print(&diff, file); 1735 dns_diff_clear(&diff); 1736 } 1737 goto cleanup; 1738 1739 failure: (cherry picked from commit `848e336db4`)	2021-03-08 16:44:43 +11:00
Mark Andrews	fb7e38cca2	Silence CID 329157: Dereference before null check in lib/dns/journal.c *** CID 329157: Null pointer dereferences (REVERSE_INULL) /lib/dns/journal.c: 754 in journal_open() 748 j->header.index_size * sizeof(journal_rawpos_t)); 749 } 750 if (j->index != NULL) { 751 isc_mem_put(j->mctx, j->index, 752 j->header.index_size * sizeof(journal_pos_t)); 753 } CID 329157: Null pointer dereferences (REVERSE_INULL) Null-checking "j->filename" suggests that it may be null, but it has already been dereferenced on all paths leading to the check. 754 if (j->filename != NULL) { 755 isc_mem_free(j->mctx, j->filename); 756 } 757 if (j->fp != NULL) { 758 (void)isc_stdio_close(j->fp); 759 } (cherry picked from commit `4054405909`)	2021-03-08 16:12:01 +11:00
Ondřej Surý	1c2e018962	Assigning uint64_t from buffer might be misaligned in netmgr tests Resolve possible 8-byte unaligned access when assigning the magic value from the received buffer. (cherry picked from commit `a55bdb28f9`)	2021-03-04 15:29:44 +01:00
Mark Andrews	985566fea5	Fixed librpz_dnsrpzd_path being a duplicate symbol librpz_dnsrpzd_path should have been declared extern in dns/librpz.h (cherry picked from commit `8f016dd57a`)	2021-03-04 11:28:49 +01:00
Evan Hunt	9f1f5abe0e	print journal index data and test for consistency 'named-journalprint -x' now prints the journal's index table and the offset of each transaction in the journal, so that index consistency can be confirmed. (cherry picked from commit `a4972324a6`)	2021-03-03 19:19:50 -08:00
Mark Andrews	5aea511e1b	extend named-journalprint to be able to force the journal version named-journalprint can now upgrade or downgrade a journal file in place; the '-u' option upgrades and the '-d' option downgrades. (cherry picked from commit `fb2d0e2897`)	2021-03-03 19:19:50 -08:00
Evan Hunt	47a274e9f1	allow dns_journal_rollforward() to read old journal files when the 'max-ixfr-ratio' option was added, journal transaction headers were revised to include a count of RR's in each transaction. this made it impossible to read old journal files after an upgrade. this branch restores the ability to read version 1 transaction headers. when rolling forward, printing journal contents, if the wrong transaction header format is found, we can switch. when dns_journal_rollforward() detects a version 1 transaction header, it returns DNS_R_RECOVERABLE. this triggers zone_postload() to force a rewrite of the journal file in the new format, and also to schedule a dump of the zone database with minimal delay. journal repair is done by dns_journal_compact(), which rewrites the entire journal, ignoring 'max-journal-size'. journal size is corrected later. newly created journal files now have "BIND LOG V9.2" in their headers instead of "BIND LOG V9". files with the new version string cannot be read using the old transaction header format. note that this means newly created journal files will be rejected by older versions of named. named-journalprint now takes a "-x" option, causing it to print transaction header information before each delta, including its format version. (cherry picked from commit `ee19966326`)	2021-03-03 19:19:50 -08:00
Mark Andrews	7a703244ed	Address theoretical buffer overrun in recent change The strlcat() call was wrong. *** CID 316608: Memory - corruptions (OVERRUN) /lib/dns/resolver.c: 5017 in fctx_create() 5011 * Make fctx->info point to a copy of a formatted string 5012 * "name/type". 5013 / 5014 dns_name_format(name, buf, sizeof(buf)); 5015 dns_rdatatype_format(type, typebuf, sizeof(typebuf)); 5016 p = strlcat(buf, "/", sizeof(buf)); >>> CID 316608: Memory - corruptions (OVERRUN) >>> Calling "strlcat" with "buf + p" and "1036UL" is suspicious because "buf" points into a buffer of 1036 bytes and the function call may access "(char )(buf + p) + 1035UL". [Note: The source code implementation of the function has been overridden by a builtin model.] 5017 strlcat(buf + p, typebuf, sizeof(buf)); 5018 fctx->info = isc_mem_strdup(mctx, buf); 5019 5020 FCTXTRACE("create"); 5021 dns_name_init(&fctx->name, NULL); 5022 dns_name_dup(name, mctx, &fctx->name); (cherry picked from commit `59bf6e71e2`)	2021-03-03 10:55:38 +01:00
Ondřej Surý	59d92c8af8	Call isc__initialize()/isc__shutdown() from win32 DllMain Call the libisc isc__initialize() constructor and isc__shutdown() destructor from DllMain instead of having duplicate code between those and DllMain() code. (cherry picked from commit `a50f5d0cf5`)	2021-03-02 08:50:29 +01:00
Ondřej Surý	86298fd112	Make linker include library ctor/dtor in statically linked libisc Under normal situation, the linker throws out all symbols from compilation unit when no symbols are used in the final binary, which is the case for lib/isc/lib.c. This commit adds empty function to lib.c that's being called from different CU (mem.c in this case) and that makes the linker to include all the symbols including the normally unreferenced isc__initialize() and isc__shutdown() in the final binary.	2021-02-26 21:19:33 +01:00
Ondřej Surý	f92b77ff0d	Change the isc_thread_self() return type to uintptr_t The pthread_self(), thrd_current() or GetCurrentThreadId() could actually be a pointer, so we should rather convert the value into uintptr_t instead of unsigned long. (cherry picked from commit `a0181056a8`)	2021-02-26 21:14:17 +01:00
Ondřej Surý	0e90858d99	Use globally assigned thread_id in the isc_hp API Convert the isc_hp API to use the globally available isc_tid_v instead of locally defined tid_v. This should solve most of the problems on machines with many number of cores / CPUs. (cherry picked from commit `bea333f7c9`)	2021-02-26 21:14:17 +01:00
Ondřej Surý	45c55b1e7e	Add isc_trampoline API to have simple accounting around threads The current isc_hp API uses internal tid_v variable that gets incremented for each new thread using hazard pointers. This tid_v variable is then used as a index to global shared table with hazard pointers state. Since the tid_v is only incremented and never decremented the table could overflow very quickly if we create set of threads for short period of time, they finish the work and cease to exist. Then we create identical set of threads and so on and so on. This is not a problem for a normal `named` operation as the set of threads is stable, but the problematic place are the unit tests where we test network manager or other APIs (task, timer) that create threads. This commits adds a thin wrapper around any function called from isc_thread_create() that adds unique-but-reusable small digit thread id that can be used as index to f.e. hazard pointer tables. The trampoline wrapper ensures that the thread ids will be reused, so the highest thread_id number doesn't grow indefinitely when threads are created and destroyed and then created again. This fixes the hazard pointer table overflow on machines with many cores. [GL #2396] (cherry picked from commit `cbbecfcc82`)	2021-02-26 21:14:17 +01:00
Ondřej Surý	0bd834856e	Enforce disabling internal memory allocator for AddressSanitizer Disable the internal memory allocator when AddressSanitizer is in use. The basic blocks in the internal memory allocator prevents AddressSanitizer from properly tracking the allocations and deallocations, so we need to ensure it has been disabled for any build that has AddressSanitizer enabled.	2021-02-26 10:13:56 -08:00
Ondřej Surý	1cd14c4dec	Add mempool get/put tracking with AddressSanitizer When AddressSanitizer is in use, disable the internal mempool implementation and redirect the isc_mempool_get to isc_mem_get (and similarly for isc_mempool_put). This is the method recommended by the AddressSanitizer authors for tracking allocations and deallocations instead of custom poison/unpoison code (see https://github.com/google/sanitizers/wiki/AddressSanitizerManualPoisoning).	2021-02-26 10:13:17 -08:00
Ondřej Surý	2011efa99d	Disable safe-guard assertion in DLL_THREAD_ATTACH/DLL_THREAD_DETACH The BIND 9 libraries on Windows define DllMain() optional entry point into a dynamic-link library (DLL). When the system starts or terminates a process or thread, it calls the entry-point function for each loaded DLL using the first thread of the process. When the DLL is being loaded into the virtual address space of the current process as a result of the process starting up, we make a call to DisableThreadLibraryCalls() which should disable the DLL_THREAD_ATTACH and DLL_THREAD_DETACH notifications for the specified dynamic-link library (DLL). This seems not be the case because we never check the return value of the DisableThreadLibraryCalls() call, and it could in fact fail. The DisableThreadLibraryCalls() function fails if the DLL specified by hModule has active static thread local storage, or if hModule is an invalid module handle. In this commit, we remove the safe-guard assertion put in place for the DLL_THREAD_ATTACH and DLL_THREAD_DETACH events and we just ignore them. BIND 9 doesn't create/destroy enough threads for it actually to make any difference, and in fact we do use static thread local storage in the code.	2021-02-26 17:18:06 +01:00

1 2 3 4 5 ...

12977 Commits