futriix/unit at b9d224097a46dbe62ec0857cb91e7c67505a200e - futriix - Gitea: Git with a cup of tea

gvsafronov/futriix

History

Binbin b9d224097a

Brocast a PONG to all node in cluster when role changed (#1295 )

When a node role changes, we should brocast the change to notify other nodes.
For example, one primary and one replica, after a failover, the replica became
a new primary, the primary became a new replica.

And then we trigger a second cluster failover for the new replica, the
new replica will send a MFSTART to its primary, ie, the new primary.

But the new primary may reject the MFSTART due to this logic:
```
    } else if (type == CLUSTERMSG_TYPE_MFSTART) {
        if (!sender || sender->replicaof != myself) return 1;
```

In the new primary views, sender is still a primary, and sender->replicaof
is NULL, so we will return. Then the manual failover timedout.

Another possibility is that other primaries refuse to vote after receiving
the FAILOVER_AUTH_REQUEST, since in their's views, sender is still a primary,
so it refuse to vote, and then manual failover timedout.
```
void clusterSendFailoverAuthIfNeeded(clusterNode *node, clusterMsg *request) {
    ...
        if (clusterNodeIsPrimary(node)) {
            serverLog(LL_WARNING, "Failover auth denied to...
```

The reason is that, currently, we only update the node->replicaof information
when we receive a PING/PONG from the sender. For details, see clusterProcessPacket.
Therefore, in some scenarios, such as clusters with many nodes and a large
cluster-ping-interval (that is, cluster-node-timeout), the role change of the node
will be very delayed.

Added a DEBUG DISABLE-CLUSTER-RANDOM-PING command, send cluster ping
to a random node every second (see clusterCron).

Signed-off-by: Binbin <binloveplay1314@qq.com>

2024-11-23 00:22:04 +08:00

..

Brocast a PONG to all node in cluster when role changed (#1295 )

2024-11-23 00:22:04 +08:00

Change errno from EEXIST to EALREADY in serverFork if child process exists (#1258 )

2024-11-07 12:13:00 +08:00

Fix set expire test due to the new lazyfree configs changes (#980 )

2024-09-02 22:43:09 +08:00

acl-v2.tcl

Minor cleanups in acl-v2 tests (#1166 )

2024-10-15 10:30:03 +08:00

acl.tcl

Remove 'Redis' in error replies (#206 )

2024-04-16 21:17:38 +02:00

aofrw.tcl

Speed up AOF rewrite test case (#1093 )

2024-09-30 19:55:23 +02:00

auth.tcl

Dual channel replication (#60 )

2024-07-17 13:59:33 -07:00

bitfield.tcl

Remove trademarked language in code comments (#223 )

2024-04-09 10:24:03 +02:00

bitops.tcl

Change BITCOUNT 'end' as optional like BITPOS (#118 )

2024-05-28 15:01:28 -04:00

client-eviction.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

dump.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

expire.tcl

Import-mode: Avoid expiration and eviction during data syncing (#1185 )

2024-11-19 21:53:19 +01:00

functions.tcl

Fix FUNCTION KILL error message being displayed as SCRIPT KILL (#1171 )

2024-10-15 23:32:42 +08:00

geo.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

hyperloglog.tcl

redisbenchmark to valkeybenchmark in test directory and some test name rename. (#347 )

2024-04-23 10:51:53 -07:00

info-command.tcl

Move printver test to info-command file (#1056 )

2024-09-20 10:18:19 +08:00

info.tcl

Revert "Decline unsubscribe related command in non-subscribed mode" (#1265 )

2024-11-07 20:05:16 -05:00

introspection-2.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

introspection.tcl

Add io-threads-do-reads config to deprecated config table to have no effect. (#1138 )

2024-10-10 17:46:09 +02:00

keyspace.tcl

Remove empty DB check branch in KEYS command (#1259 )

2024-11-06 10:32:00 +08:00

latency-monitor.tcl

Change all the lazyfree configurations to yes by default (#913 )

2024-09-02 07:07:17 -07:00

lazyfree.tcl

attempt to fix tracking test issue with external tests due to lazy free (#9722 )

2021-11-02 16:42:53 +02:00

limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

maxmemory.tcl

Import-mode: Avoid expiration and eviction during data syncing (#1185 )

2024-11-19 21:53:19 +01:00

memefficiency.tcl

Change all the lazyfree configurations to yes by default (#913 )

2024-09-02 07:07:17 -07:00

multi.tcl

Add 'WithDictIndex' expiry API and update RANDOMKEY command (#1155 )

2024-10-16 17:40:11 -07:00

networking.tcl

Improve multithreaded performance with memory prefetching (#861 )

2024-08-26 21:10:44 -07:00

obuf-limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

oom-score-adj.tcl

Check user's oom_score_adj write permission for oom-score-adj test (#13111 )

2024-03-05 14:42:28 +02:00

other.tcl

Add support for setting the group on a unix domain socket (#901 )

2024-08-23 11:52:08 -07:00

pause.tcl

Fix primary crash when processing dirty slots during shutdown wait / failover wait / client pause (#1131 )

2024-11-15 16:47:15 +08:00

protocol.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

pubsub.tcl

Revert "Decline unsubscribe related command in non-subscribed mode" (#1265 )

2024-11-07 20:05:16 -05:00

pubsubshard.tcl

Revert "Decline unsubscribe related command in non-subscribed mode" (#1265 )

2024-11-07 20:05:16 -05:00

querybuf.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

quit.tcl

flushSlavesOutputBuffers should not write to replicas scheduled to drop (#12242 )

2023-06-12 14:05:34 +03:00

replybufsize.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

scan.tcl

Finish postponed SCAN changes (#501 )

2024-05-17 13:35:31 +02:00

scripting.tcl

Fix aof race in shutdown nosave timedout script test (#1156 )

2024-10-13 22:06:28 +08:00

shutdown.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

slowlog.tcl

Correctly recode client infomation to the slowlog when runing script (#805 )

2024-08-10 23:46:56 +08:00

sort.tcl

Fix SORT GET to ignore special pattern # in cluster slot check (#1182 )

2024-10-19 14:56:10 +08:00

tls.tcl

Replace "redis" with "valkey" test code (#287 )

2024-04-18 15:57:17 +02:00

tracking.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

violations.tcl

Run large-memory tests as solo. (#10626 )

2022-04-24 17:29:35 +03:00

wait.tcl

Fix the wrong woff when execute WAIT / WAITAOF in script (#776 )

2024-07-22 10:33:10 +02:00