futriix

Author	SHA1	Message	Date
Ping Xie	2b5c7a0dbd	Fix a typo in the 8.0 release notes (#1036 ) Signed-off-by: Ping Xie <pingxie@google.com> 8.0.0	2024-09-15 13:08:04 -07:00
Ping Xie	7424b17dab	Add Valkey 8.0 GA Release Notes (#1034 ) Based on #1031 --------- Signed-off-by: Ping Xie <pingxie@google.com> Signed-off-by: Ping Xie <pingxie@outlook.com> Co-authored-by: Madelyn Olson <madelyneolson@gmail.com>	2024-09-15 12:53:12 -07:00
Ping Xie	76ec25f90a	Revert "Update version number to 8.0" This reverts commit 3179f2528db86582fb7fbf26d6d0e59555cd6b18. Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Ping Xie	aff4d508a3	Update version number to 8.0 Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Shivshankar	74113bd2c3	Update valkey-benchmark log output to reference 'server' instead of 'Redis' (#1029 ) Replaced "Could not connect to Redis" with "Could not connect to server" in the log output for connection errors in `getRedisContext` and `createClient`. Signed-off-by: Shivshankar-Reddy <shiva.sheri.github@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	6f9786b9b3	Replica flush the old data after RDB file is ok in disk-based replication (#926 ) Call emptyData right before rdbLoad to prevent errors in the middle and we drop the replication stream and leaving an empty database. The real changes is in disk-based part, the rest is just code movement. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Ping Xie	cf51462b5b	Improve code readability in dict.c (#943 ) This pull request improves code readability, as a follow up of #749. - Internal Naming Conventions: Removed the use of underscores (_) for internal static structures/functions. - Descriptive Function Names: Updated function names to be more descriptive, making their purpose clearer. For instance, `_dictExpand` is renamed to `dictExpandIfAutoResizeAllowed`. --------- Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	bcd5c4746b	Fix replica unable trigger migration when it received CLUSTER SETSLOT in advance (#981 ) Fix timing issue in evaluating `cluster-allow-replica-migration` for replicas There is a timing bug where the primary and replica have different `cluster-allow-replica-migration` settings. In issue #970, we found that if the replica receives `CLUSTER SETSLOT` before the gossip update, it remains in the original shard. This happens because we only process the `cluster-allow-replica-migration` flag for primaries during `CLUSTER SETSLOT`. This commit fixes the issue by also evaluating this flag for replicas in the `CLUSTER SETSLOT` path, ensuring correct replica migration behavior. Closes #970 --------- Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Ping Xie <pingxie@outlook.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	09ed2fccca	Avoid false positive in election tests (#984 ) The node may not be able to initiate an election in time due to problems with cluster communication. If an election is initiated, make sure its offset is 0. Closes #967. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	d4e6546f7c	Trigger a save of the cluster configuration file before shutting down (#822 ) The cluster configuration file is the metadata "database" for the cluster. It is best to trigger a save when shutdown the server, to avoid inconsistent content that is not refreshed. We save the nodes.conf whenever something that affects the nodes.conf has changed. But we are saving nodes.conf in clusterBeforeSleep, and some events may save it without a fsync, there is a time gap. And shutdown has its own save seems good to me, it doesn't need to care about the others. At the same time, a comment is added in unlock nodes.conf to explain why we actively unlock when shutdown. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Ping Xie	9204cd04d1	Re-enable empty-shard slot migration tests (#1024 ) Related to #734 and #858 Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
xu0o0	7c3f9379bd	Make clang-format insert a newline at end of file if missing (#1023 ) clang generates warning if there is no newline at the end of the source file. Update .clang-format to handle the missing newline at eof. Signed-off-by: haoqixu <hq.xu0o0@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
uriyage	420da15c0c	Fix wrong count for replica's tot-net-out (#1013 ) Fix duplicate calculation of replica's `net_output_bytes` - Remove redundant calculation leftover from previous refactor - Add test to prevent regression Signed-off-by: Uri Yagelnik <uriy@amazon.com> Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Madelyn Olson	13f2b61fff	Optimize the per slot dictionary by checking for cluster mode earlier (#995 ) While doing some profiling, I noticed that getKeySlot() was a fairly large part (~0.7%) of samples doing perf with high pipeline during standalone. I think this is because we do a very late check for server.cluster_mode, we first call getKeySlot() and then call calculateKeySlot(). (calculateKeySlot was surprisingly not automatically inlined, we were doing a jump into it and then immediately returning zero). We then also do useless work in the form of caching zero in client->slot, which will further mess with cache lines. So, this PR tries to accomplish a few things things. 1) The usage of the `slot` name made a lot more sense before the introduction of the kvstore. Now with kvstore, we call this the database index, so all the references to slot in standalone are no longer really accurate. 2) Pull the cluster mode check all the way out of getKeySlot(), so hopefully a bit more performant. 3) Remove calculateKeySlot() as independent from getKeySlot(). calculateKeySlot used to have 3 call sites outside of db.c, which warranted it's own function. It's now only called in two places, pubsub.c and networking.c. I ran some profiling, and saw about ~0.3% improvement, but don't really trust it because you'll see a much higher (~2%) variance in test runs just by how the branch predictions will get changed with a new memory layout. Running perf again showed no samples in getKeySlot() and a reduction in samples in lookupKey(), so maybe this will help a little bit. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Madelyn Olson	79561f0828	Improve stability of hostnames test (#1016 ) Maybe partially resolves https://github.com/valkey-io/valkey/issues/952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Mikhail Koviazin	c90bd280e2	Added .git-blame-ignore-revs (#1010 ) This file enables developers to ignore the certain revisions in git-blame. This is quite handy considering there was a commit that reformatted the large amount of code in valkey. As a downside, one has to do a manual step for each clone of valkey to enable this feature. The instructions are available in the file itself. --------- Signed-off-by: Mikhail Koviazin <mikhail.koviazin@aiven.io> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	850a8e90bf	Fix module RdbLoad wrongly disable the AOF (#1001 ) In RdbLoad, we disable AOF before emptyData and rdbLoad to prevent copy-on-write issues. After rdbLoad completes, AOF should be re-enabled, but the code incorrectly checks server.aof_state, which has been reset to AOF_OFF in stopAppendOnly. This leads to AOF not being re-enabled after being disabled. --------- Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Amit Nagler	bc6cca24c3	Dual Channel Replication - Verify Replica Local Buffer Limit Configuration (#989 ) Prior to comparing the replica buffer against the configured limit, we need to ensure that the limit configuration is enabled. If the limit is set to zero, it indicates that there is no limit, and we should skip the buffer limit check. --------- Signed-off-by: naglera <anagler123@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Lipeng Zhu	e495448dbe	Use hashtable as the default type of temp set object during sunion/sdiff (#996 ) This patch try to set the temp set object as default hash table type. And did a simple predication of the temp set object encoding when initialize `dstset` to reduce the unnecessary conversation. According to existing code logic, when did operation like `sunion` and `sdiff` , the temp set object could be `intset`, `listpack` and `hashtable`, for the `listpack`, the efficiency is low when did operation like `find` and `compare` , need to traverse all elements. When we exploring the hotspots, found the `lpFind` and `memcmp` has been the bottleneck when running workloads like below: - [memtier_benchmark-2keys-set-10-100-elements-sunion.yml](https://github.com/redis/redis-benchmarks-specification/blob/main/redis_benchmarks_specification/test-suites/memtier_benchmark-2keys-set-10-100-elements-sunion.yml) - [memtier_benchmark-2keys-set-10-100-elements-sdiff.yml](https://github.com/redis/redis-benchmarks-specification/blob/main/redis_benchmarks_specification/test-suites/memtier_benchmark-2keys-set-10-100-elements-sdiff.yml) ![image](https://github.com/user-attachments/assets/71dfc70b-2ad5-4832-a338-712deefca20e) This patch try to set the temp set object as default hash table type. And did a simple predication of the temp set object encoding when initialize `dstset` to reduce the unnecessary conversation. - OPERATING SYSTEM: Ubuntu 22.04.4 LTS - Kernel: 5.15.0-116-generic - PROCESSOR: Intel Xeon Platinum 8380 - Server and Client in same socket. ``` taskset -c 0-3 ~/valkey/src/valkey-server /tmp/valkey.conf port 9001 bind * -::* daemonize no protected-mode no save "" ``` \| Test Name\| Perf Boost\| \|-\|-\| \|[memtier_benchmark-2keys-set-10-100-elements-sunion.yml](https://github.com/redis/redis-benchmarks-specification/blob/main/redis_benchmarks_specification/test-suites/memtier_benchmark-2keys-set-10-100-elements-sunion.yml) \|41%\| \|[memtier_benchmark-2keys-set-10-100-elements-sdiff.yml](https://github.com/redis/redis-benchmarks-specification/blob/main/redis_benchmarks_specification/test-suites/memtier_benchmark-2keys-set-10-100-elements-sdiff.yml) \|27%\| With above test set which have total 110 elements in the 2 given sets. We also did some benchmarking by adjusting the total number of elements in all given sets. We can still observe the performance boost. ![image](https://github.com/user-attachments/assets/b2ab420c-43e5-45de-9715-7d943df229cb) --------- Signed-off-by: Lipeng Zhu <lipeng.zhu@intel.com> Co-authored-by: Wangyang Guo <wangyang.guo@intel.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
uriyage	b0478078c5	Fix crash in async IO threads with TLS (#1011 ) Fix for https://github.com/valkey-io/valkey/issues/997 Root Cause Analysis: 1. Two different jobs (READ and WRITE) may be sent to the same IO thread. 2. When processing the read job in `processIOThreadsReadDone`, the IO thread may find that the write job has also been completed. 3. In this case, the IO thread calls `processClientIOWriteDone` to first process the completed write job and free the COBs `affbea5dc1/src/networking.c (L4666)` 4. If there are pending writes (resulting from pipeline commands), a new async IO write job is sent before processing the completed read job `affbea5dc1/src/networking.c (L2417)` When sending the write job, the `TLS_CONN_FLAG_POSTPONE_UPDATE_STATE` flag is set to prevent the IO thread from updating the event loop, which is not thread-safe. 5. Upon resuming the read job processing, the flag is cleared, `affbea5dc1/src/networking.c (L4685)` causing the IO thread to update the event loop. Fix: Prevent sending async write job for pending writes when a read job is about to be processed. Testing: The issue could not be reproduced due to its rare occurrence, which requires multiple specific conditions to align simultaneously. Signed-off-by: Uri Yagelnik <uriy@amazon.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
bentotten	cce5c365d8	For MEETs, save the extensions support flag immediately during MEET processing (#778 ) For backwards compatibility reasons, a node will wait until it receives a cluster message with the extensions flag before sending its own extensions. This leads to a delay in shard ID propagation that can corrupt nodes.conf with inaccurate shard IDs if a node is restarted before this can stabilize. This fixes much of that delay by immediately triggering the extensions-supported flag during the MEET processing and attaching the node to the link, allowing the PONG reply to contain OSS extensions. Partially fixes #774 --------- Signed-off-by: Ben Totten <btotten@amazon.com> Co-authored-by: Ben Totten <btotten@amazon.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	3d92fb98ca	Add missing moduleapi getchannels test and fix tests (#1002 ) Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
zhaozhao.zz	cbb5a3be8b	add assertion for kvstore's dictType (#1004 ) Signed-off-by: zhaozhao.zz <zhaozhao.zz@alibaba-inc.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
xu0o0	9e10a3d712	Migrate dict.c unit tests to new framework (#946 ) This PR migrates the tests related to dict into new test framework as part of #428. Signed-off-by: haoqixu <hq.xu0o0@gmail.com> Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
xu0o0	237ef5db80	Migrate listpack.c unit tests to new framework (#949 ) This PR migrates the tests related to listpack into new test framework as part of #428. Signed-off-by: haoqixu <hq.xu0o0@gmail.com> Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	4dadcccb69	Add client info to SHUTDOWN / CLUSTER FAILOVER logs (#875 ) Print the full client info by using catClientInfoString, the info is useful when we want to identify the source of request. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	b5badeb785	Fix aof base suffix when modifying aof-use-rdb-preamble during rewrite (#886 ) If we modify aof-use-rdb-preamble in the middle of rewrite, we may get a wrong aof base suffix. This is because the suffix is concatenated by the main process afterwards, and it may be different from the beginning. We cache this value when we start the rewrite. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	ee1dcc0b72	Fix missing replication link re-connection when primary's IP/port is updated in `clusterProcessGossipSection` (#965 ) `clusterProcessGossipSection` currently doesn't trigger a check and call `replicationSetPrimary` when `myself`'s primary node’s IP/port is updated. This fix ensures that after every node address update, `replicationSetPrimary` is called if the updated node is `myself`'s primary. This prevents missed updates and ensures that replicas reconnect properly to maintain their replication link with the primary. Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Binbin	1478c0de8d	Add newline to argv in crash report when doing redact (#993 ) Minor cleanup, introduced in #877. Signed-off-by: Binbin <binloveplay1314@qq.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Kyle Kim (kimkyle@)	92bc8f1be3	Add SLOT-STATS under CLUSTER HELP string. (#988 ) Add help wording for cluster SLOT-STATS. Signed-off-by: Kyle Kim <kimkyle@amazon.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Viktor Söderqvist	42fdf7f527	Rewrite lazyfree docs in valkey.conf to reflect that lazy is now default (#983 ) Before this doc update, the comments in valkey.conf said that DEL is a blocking command, and even refered to other synchronous freeing as "in a blocking way, like if DEL was called". This has now become confusing and incorrect, since DEL is now non-blocking by default. The comments also mentioned too much about the "old default" and only later explain that the "new default" is non-blocking. This doc update focuses on the current default and expresses it like "Starting from Valkey 8.0, lazy freeing is enabled by default", rather than using words like old and new. This is a follow-up to #913. --------- Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
NAM UK KIM	230ff3ae65	Fix typo in valkey-cli.c (#979 ) Change from replicsa to replicas in valkey-cli.c Signed-off-by: NAM UK KIM <namuk2004@naver.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Ping Xie	6f2b3d46bb	Improve type safety and refactor dict entry handling (#749 ) This pull request introduces several changes to improve the type safety of Valkey's dictionary implementation: - Getter/Setter Macros: Implemented macros `DICT_SET_VALUE` and `DICT_GET_VALUE` to centralize type casting within these macros. This change emulates the behavior of C++ templates in C, limiting type casting to specific low-level operations and preventing it from being spread across the codebase. - Reduced Assert Overhead: Removed unnecessary asserts from critical hot paths in the dictionary implementation. - Consistent Naming: Standardized the naming of dictionary entry types. For example, all dictionary entry types start their names with `dictEntry`. Fix #737 --------- Signed-off-by: Ping Xie <pingxie@google.com> Signed-off-by: Ping Xie <pingxie@outlook.com> Co-authored-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-15 11:49:49 -07:00
Madelyn Olson	85a58478df	Added release notes for RC2 Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> 8.0.0-rc2	2024-09-03 09:00:45 -07:00
Madelyn Olson	509f026326	Initialize all the fields for the test kvstore (#982 ) Follow up to https://github.com/valkey-io/valkey/pull/966, which didn't update the kvstore tests. I'm not actually entirely clear why it fixes it, but the consistency prevents the crash very reliably so will merge it now and maybe see if Zhao has a better explanation. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>	2024-09-03 09:00:45 -07:00
Amit Nagler	2c46f2047b	Add configuration hide-user-data-from-log to hide user data from server logs (#877 ) Implement data masking for user data in server logs and diagnostic output. This change prevents potential exposure of confidential information, such as PII, and enhances privacy protection. It masks all command arguments, client names, and client usernames. Added a new hide-user-data-from-log configuration item, default yes. --------- Signed-off-by: Amit Nagler <anagler123@gmail.com>	2024-09-03 09:00:45 -07:00
Binbin	def9ddf1e5	Fix set expire test due to the new lazyfree configs changes (#980 ) Test failed because these two PRs #865 and #913. Signed-off-by: Binbin <binloveplay1314@qq.com>	2024-09-03 09:00:45 -07:00
zhaozhao.zz	e7d46558fe	Use metadata to handle the reference relationship between kvstore and dict (#966 ) Feature `one-dict-per-slot` refactors the database, and part of it involved splitting the rehashing list from the global level back to the database level, or more specifically, the kvstore level. This change is fine, and it also simplifies the process of swapping databases, which is good. And it should not have a major impact on the efficiency of incremental rehashing. To implement the kvstore-level rehashing list, each `dict` under the `kvstore` needs to know which `kvstore` it belongs. However, kvstore did not insert the reference relationship into the `dict` itself, instead, it placed it in the `dictType`. In my view, this is a somewhat odd way. Theoretically, `dictType` is just a collection of function handles, a kind of virtual type that can be referenced globally, not an entity. But now the `dictType` is instantiated, with each `kvstore` owning an actual `dictType`, which in turn holds a reverse reference to the `kvstore`'s resource pointer. This design is somewhat uncomfortable for me. I think the `dictType` should not be instantiated. The references between actual resources (`kvstore` and `dict`) should occur between specific objects, rather than force materializing the `dictType`, which is supposed to be virtual. --------- Signed-off-by: zhaozhao.zz <zhaozhao.zz@alibaba-inc.com>	2024-09-03 09:00:45 -07:00
Binbin	1dcd3f23d8	Change all the lazyfree configurations to yes by default (#913 ) ## Set replica-lazy-flush and lazyfree-lazy-user-flush to yes by default. There are many problems with running flush synchronously. Even in single CPU environments, the thread managers should balance between the freeing and serving incoming requests. ## Set lazy eviction, expire, server-del, user-del to yes by default We now have a del and a lazyfree del, we also have these configuration items to control: lazyfree-lazy-eviction, lazyfree-lazy-expire, lazyfree-lazy-server-del, lazyfree-lazy-user-del. In most cases lazyfree is better since it reduces the risk of blocking the main thread, and because we have lazyfreeGetFreeEffort, on those with high effor (currently 64) will use lazyfree. Part of #653. --------- Signed-off-by: Binbin <binloveplay1314@qq.com>	2024-09-03 09:00:45 -07:00
Madelyn Olson	cb4d6f665f	Fix zipmap test null pointer (#975 ) The previous test does a strncmp on a NULL, which is not valid. It should be using an empty length string instead. Addresses https://github.com/valkey-io/valkey/actions/runs/10649272046/job/29519233939. Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>	2024-09-03 09:00:45 -07:00
Binbin	56c61c6ee3	Fast path in SET if the expiration time is expired (#865 ) If the expiration time passed in SET is expired, for example, it has expired due to the machine time (DTS) or the expiration time passed in (wrong arg). In this case, we don't need to set the key and wait for the active expire scan before deleting the key. Compared with previous changes: 1. If the key does not exist, previously we would set the key and wait for the active expire to delete it, so it is a set + del from the perspective of propaganda. Now we will no set the key and return, so it a NOP. 2. If the key exists, previously we woule set the key and wait for the active expire to delete it, so it is a set + del From the perspective of propaganda. Now we will delete it and return, so it is a del. Adding a new deleteExpiredKeyFromOverwriteAndPropagate function to reduce the duplicate code. Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Madelyn Olson <madelyneolson@gmail.com>	2024-09-03 09:00:45 -07:00
Viktor Söderqvist	1758fbf27f	Delete unused parts of zipmap (#973 ) Deletes zipmapSet, zipmapGet, etc. Only keep iterator and validate integrity, what we use when loading an old RDB file. Adjust unit tests to not use zipmapSet, etc. Solves a build failure where when compiling with fortify source. --------- Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech>	2024-09-03 09:00:45 -07:00
Binbin	21c2ad7b4a	Fix timing issue in replica migration test (#968 ) The reason is the server 3 still have the server 7 as its replica due to a short wait, the wait is not enough, we should wait for server loss its replica. ``` *** [err]: valkey-cli make source node ignores NOREPLICAS error when doing the last CLUSTER SETSLOT Expected '{127.0.0.1 21497 267}' to be equal to '' (context: type eval line 34 cmd {assert_equal [lindex [R 3 role] 2] {}} proc ::test) ``` Signed-off-by: Binbin <binloveplay1314@qq.com>	2024-09-03 09:00:45 -07:00
zhaozhao.zz	1864c2f4b4	standalone -REDIRECT handles special case of MULTI context (#895 ) In standalone mode, when a `-REDIRECT` error occurs, special handling is required if the client is in the `MULTI` context. We have adopted the same handling method as the cluster mode: 1. If a command in the transaction encounters a `REDIRECT` at the time of queuing, the execution of `EXEC` will return an `EXECABORT` error (we expect the client to redirect and discard the transaction upon receiving a `REDIRECT`). That is: ``` MULTI ==> +OK SET x y ==> -REDIRECT EXEC ==> -EXECABORT ``` 2. If all commands are successfully queued (i.e., `QUEUED` results are received) but a redirect is detected during `EXEC` execution (such as a primary-replica switch), a `REDIRECT` is returned to instruct the client to perform a redirect. That is: ``` MULTI ==> +OK SET x y ==> +QUEUED failover EXEC ==> -REDIRECT ``` --------- Signed-off-by: zhaozhao.zz <zhaozhao.zz@alibaba-inc.com>	2024-09-03 09:00:45 -07:00
Shivshankar	5450d41fde	Migrate zipmap unit test to new framework (#474 ) Migrate zipmap unit test to new unit test framework, parent ticket #428 . --------- Signed-off-by: Shivshankar-Reddy <shiva.sheri.github@gmail.com> Signed-off-by: hwware <wen.hui.ware@gmail.com> Co-authored-by: hwware <wen.hui.ware@gmail.com>	2024-09-03 09:00:45 -07:00
Binbin	115fc0f912	Fix reconfiguring sub-replica causing data loss when myself change shard_id (#944 ) When reconfiguring sub-replica, there may a case that the sub-replica will use the old offset and win the election and cause the data loss if the old primary went down. In this case, sender is myself's primary, when executing updateShardId, not only the sender's shard_id is updated, but also the shard_id of myself is updated, casuing the subsequent areInSameShard check, that is, the full_sync_required check to fail. As part of the recent fix of #885, the sub-replica needs to decide whether a full sync is required or not when switching shards. This shard membership check is supposed to be done against sub-replica's current shard_id, which however was lost in this code path. This then leads to sub-replica joining the other shard with a completely different and incorrect replication history. This is the only place where replicaof state can be updated on this path so the most natural fix would be to pull the chain replication reduction logic into this code block and before the updateShardId call. This one follow #885 and closes #942. Signed-off-by: Binbin <binloveplay1314@qq.com> Co-authored-by: Ping Xie <pingxie@outlook.com>	2024-09-03 09:00:45 -07:00
zhaozhao.zz	bc1cc05a25	free client's multi state when it becomes dirty (#961 ) Release the client's MULTI state when the transaction becomes dirty to save memory. --------- Signed-off-by: zhaozhao.zz <zhaozhao.zz@alibaba-inc.com>	2024-09-03 09:00:45 -07:00
Ping Xie	0def936a88	Exclude '.' and ':' from `isValidAuxChar`'s banned charset (#963 ) Fix a bug in isValidAuxChar where valid characters '.' and ':' were incorrectly included in the banned charset. This issue affected the validation of auxiliary fields in the nodes.conf file used by Valkey in cluster mode, particularly when handling IPv4 and IPv6 addresses. The code now correctly allows '.' and ':' as valid characters, ensuring proper handling of these fields. Comments were added to clarify the use of the banned charset. Related to #736 --------- Signed-off-by: Ping Xie <pingxie@google.com>	2024-09-03 09:00:45 -07:00
Binbin	d2d93ea17c	Revert make KEYS to be an exact match if there is no pattern (#964 ) In #792, the time complexity became ambiguous, fluctuating between O(1) and O(n), which is a significant difference. And we agree uncertainty can potentially bring disaster to the business, the right thing to do is to persuade users to use EXISTS instead of KEYS in this case, to do the right thing the right way, rather than accommodating this incorrect usage. This reverts commit d66a06e8183818c035bb78706f46fd62645db07e. This reverts #792. Signed-off-by: Binbin <binloveplay1314@qq.com>	2024-09-03 09:00:45 -07:00
Viktor Söderqvist	be7f15c7b7	Delete TLS.md and update README.md about tests (#960 ) Most of the content of TLS.md has already been copied to README.md in #927. The description of how to run tests with TLS is moved to tests/README.md. Descriptions of the additional scripts runtest-cluster, runtest-sentinel and runtest-module are added in tests/README.md. Links to tests/README.md and src/unit/README.md are added in the top-level README.md along with a brief overview of the `make test-*` commands. Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech>	2024-09-03 09:00:45 -07:00

1 2 3 4 5 ...

12598 Commits