futriix/unit at ad242206819059937244eb519fa612936aee4143 - futriix - Gitea: Git with a cup of tea

gvsafronov/futriix

History

Binbin ad24220681

Automatic failover vote is not limited by two times the node timeout (#1356 )

This is a follow of #1305, we now decided to apply the same change
to automatic failover as well, that is, move forward with removing
it for both automatic and manual failovers.

Quote from Ping during the review:
Note that we already debounce transient primary failures with node
timeout, ensuring failover is only triggered after sustained outages.
Election timing is naturally staggered by replica spacing, making the
likelihood of simultaneous elections from replicas of the same shard
very low. The one-vote-per-epoch rule further throttles retries and
ensures orderly elections. On top of that, quorum-based primary failure
confirmation, cluster-state convergence, and slot ownership validation
are all built into the process.

Quote from Madelyn during the review:
It against the specific primary. It's to prevent double failovers.
If a primary just took over we don't want someone else to try to
take over and give the new primary some amount of time to take over.
I have not seen this issue though, it might have been over optimizing?
The double failure mode, where a node fails and then another node fails
within the nodetimeout also doesn't seem that common either though.

So the conclusion is that we all agreed to remove it completely,
it will make the code a lot simpler. And if there is other specific
edge cases we are missing, we will fix it in other way.

See discussion #1305 for more information.

Signed-off-by: Binbin <binloveplay1314@qq.com>

2024-12-15 12:09:53 +08:00

..

Automatic failover vote is not limited by two times the node timeout (#1356 )

2024-12-15 12:09:53 +08:00

Fix Module Update Args test when other modules are loaded (#1403 )

2024-12-07 10:25:40 +01:00

Replace dict with new hashtable for sets datatype (#1176 )

2024-12-14 20:53:48 +01:00

acl-v2.tcl

Minor cleanups in acl-v2 tests (#1166 )

2024-10-15 10:30:03 +08:00

acl.tcl

Remove 'Redis' in error replies (#206 )

2024-04-16 21:17:38 +02:00

aofrw.tcl

Speed up AOF rewrite test case (#1093 )

2024-09-30 19:55:23 +02:00

auth.tcl

Dual channel replication (#60 )

2024-07-17 13:59:33 -07:00

bitfield.tcl

Remove trademarked language in code comments (#223 )

2024-04-09 10:24:03 +02:00

bitops.tcl

Change BITCOUNT 'end' as optional like BITPOS (#118 )

2024-05-28 15:01:28 -04:00

client-eviction.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

dump.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

expire.tcl

Replace dict with hashtable for keys, expires and pubsub channels

2024-12-10 21:30:56 +01:00

functions.tcl

Fix FUNCTION KILL error message being displayed as SCRIPT KILL (#1171 )

2024-10-15 23:32:42 +08:00

geo.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

hyperloglog.tcl

Optimize PFCOUNT, PFMERGE command by SIMD acceleration (#1293 )

2024-12-02 19:40:38 +01:00

info-command.tcl

Move printver test to info-command file (#1056 )

2024-09-20 10:18:19 +08:00

info.tcl

Replace dict with new hashtable for sets datatype (#1176 )

2024-12-14 20:53:48 +01:00

introspection-2.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

introspection.tcl

Allow MEMORY MALLOC-STATS and MEMORY PURGE during loading phase (#1317 )

2024-12-08 20:30:07 +08:00

keyspace.tcl

Remove empty DB check branch in KEYS command (#1259 )

2024-11-06 10:32:00 +08:00

latency-monitor.tcl

Change all the lazyfree configurations to yes by default (#913 )

2024-09-02 07:07:17 -07:00

lazyfree.tcl

attempt to fix tracking test issue with external tests due to lazy free (#9722 )

2021-11-02 16:42:53 +02:00

limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

maxmemory.tcl

Replace dict with hashtable for keys, expires and pubsub channels

2024-12-10 21:30:56 +01:00

memefficiency.tcl

Synchronously delete data during defrag tests (#1443 )

2024-12-14 19:14:01 +01:00

multi.tcl

Add 'WithDictIndex' expiry API and update RANDOMKEY command (#1155 )

2024-10-16 17:40:11 -07:00

networking.tcl

Improve multithreaded performance with memory prefetching (#861 )

2024-08-26 21:10:44 -07:00

obuf-limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

oom-score-adj.tcl

Check user's oom_score_adj write permission for oom-score-adj test (#13111 )

2024-03-05 14:42:28 +02:00

other.tcl

Replace dict with hashtable for keys, expires and pubsub channels

2024-12-10 21:30:56 +01:00

pause.tcl

Fix primary crash when processing dirty slots during shutdown wait / failover wait / client pause (#1131 )

2024-11-15 16:47:15 +08:00

protocol.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

pubsub.tcl

Revert "Decline unsubscribe related command in non-subscribed mode" (#1265 )

2024-11-07 20:05:16 -05:00

pubsubshard.tcl

Revert "Decline unsubscribe related command in non-subscribed mode" (#1265 )

2024-11-07 20:05:16 -05:00

querybuf.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

quit.tcl

flushSlavesOutputBuffers should not write to replicas scheduled to drop (#12242 )

2023-06-12 14:05:34 +03:00

replybufsize.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

scan.tcl

Finish postponed SCAN changes (#501 )

2024-05-17 13:35:31 +02:00

scripting.tcl

Fix aof race in shutdown nosave timedout script test (#1156 )

2024-10-13 22:06:28 +08:00

shutdown.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

slowlog.tcl

Correctly recode client infomation to the slowlog when runing script (#805 )

2024-08-10 23:46:56 +08:00

sort.tcl

Fix SORT GET to ignore special pattern # in cluster slot check (#1182 )

2024-10-19 14:56:10 +08:00

tls.tcl

Replace "redis" with "valkey" test code (#287 )

2024-04-18 15:57:17 +02:00

tracking.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

violations.tcl

Run large-memory tests as solo. (#10626 )

2022-04-24 17:29:35 +03:00

wait.tcl

Fix the wrong woff when execute WAIT / WAITAOF in script (#776 )

2024-07-22 10:33:10 +02:00