futriix/unit at 002d052eef168d28c61bc92dc281793cbfd2588d - futriix - Gitea: Git with a cup of tea

gvsafronov/futriix

History

Binbin e1b3629186 Fix data loss when replica do a failover with a old history repl offset (#885 )

Our current replica can initiate a failover without restriction when
it detects that the primary node is offline. This is generally not a
problem. However, consider the following scenarios:

1. In slot migration, a primary loses its last slot and then becomes
a replica. When it is fully synchronized with the new primary, the new
primary downs.

2. In CLUSTER REPLICATE command, a replica becomes a replica of another
primary. When it is fully synchronized with the new primary, the new
primary downs.

In the above scenario, case 1 may cause the empty primary to be elected
as the new primary, resulting in primary data loss. Case 2 may cause the
non-empty replica to be elected as the new primary, resulting in data
loss and confusion.

The reason is that we have cached primary logic, which is used for psync.
In the above scenario, when clusterSetPrimary is called, myself will cache
server.primary in server.cached_primary for psync. In replicationGetReplicaOffset,
we get server.cached_primary->reploff for offset, gossip it and rank it,
which causes the replica to use the old historical offset to initiate
failover, and it get a good rank, initiates election first, and then is
elected as the new primary.

The main problem here is that when the replica has not completed full
sync, it may get the historical offset in replicationGetReplicaOffset.

The fix is to clear cached_primary in these places where full sync is
obviously needed, and let the replica use offset == 0 to participate
in the election. In this way, this unhealthy replica has a worse rank
and is not easy to be elected.

Of course, it is possible that it will be elected with offset == 0.
In the future, we may need to prohibit the replica with offset == 0
from having the right to initiate elections.

Another point worth mentioning, in above cases:
1. In the ROLE command, the replica status will be handshake, and the
offset will be -1.
2. Before this PR, in the CLUSTER SHARD command, the replica status will
be online, and the offset will be the old cached value (which is wrong).
3. After this PR, in the CLUSTER SHARD, the replica status will be loading,
and the offset will be 0.

Signed-off-by: Binbin <binloveplay1314@qq.com>

2024-08-21 13:11:21 +08:00

..

Fix data loss when replica do a failover with a old history repl offset (#885 )

2024-08-21 13:11:21 +08:00

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

acl-v2.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

acl.tcl

Remove 'Redis' in error replies (#206 )

2024-04-16 21:17:38 +02:00

aofrw.tcl

More rebranding (#606 )

2024-06-07 01:40:55 +02:00

auth.tcl

Dual channel replication (#60 )

2024-07-17 13:59:33 -07:00

bitfield.tcl

Remove trademarked language in code comments (#223 )

2024-04-09 10:24:03 +02:00

bitops.tcl

Change BITCOUNT 'end' as optional like BITPOS (#118 )

2024-05-28 15:01:28 -04:00

client-eviction.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

dump.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

expire.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

functions.tcl

More rebranding (#606 )

2024-06-07 01:40:55 +02:00

geo.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

hyperloglog.tcl

redisbenchmark to valkeybenchmark in test directory and some test name rename. (#347 )

2024-04-23 10:51:53 -07:00

info-command.tcl

Make INFO command variadic (#6891 )

2022-02-08 13:14:42 +02:00

info.tcl

Limit tracking custom errors (e.g. from LUA) while allowing non custom errors to be tracked normally (#500 )

2024-07-14 20:04:47 -07:00

introspection-2.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

introspection.tcl

Dual channel replication (#60 )

2024-07-17 13:59:33 -07:00

keyspace.tcl

List test files dynamically (#313 )

2024-04-15 14:25:33 +02:00

latency-monitor.tcl

Add printing for LATENCY related tests (#12514 )

2023-08-27 11:42:55 +03:00

lazyfree.tcl

attempt to fix tracking test issue with external tests due to lazy free (#9722 )

2021-11-02 16:42:53 +02:00

limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

maxmemory.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

memefficiency.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

multi.tcl

Nested MULTI or WATCH in MULTI now will abort the transaction (#723 )

2024-07-03 21:27:45 +02:00

networking.tcl

Replace "redis" with "valkey" test code (#287 )

2024-04-18 15:57:17 +02:00

obuf-limits.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

oom-score-adj.tcl

Check user's oom_score_adj write permission for oom-score-adj test (#13111 )

2024-03-05 14:42:28 +02:00

other.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

pause.tcl

More rebranding (#606 )

2024-06-07 01:40:55 +02:00

printver.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

protocol.tcl

Rename redis to valkey in test suite logs and test names. (#366 )

2024-04-25 15:13:21 +08:00

pubsub.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

pubsubshard.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

querybuf.tcl

Async IO threads (#758 )

2024-07-08 20:01:39 -07:00

quit.tcl

flushSlavesOutputBuffers should not write to replicas scheduled to drop (#12242 )

2023-06-12 14:05:34 +03:00

replybufsize.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

scan.tcl

Finish postponed SCAN changes (#501 )

2024-05-17 13:35:31 +02:00

scripting.tcl

Fix the wrong woff when execute WAIT / WAITAOF in script (#776 )

2024-07-22 10:33:10 +02:00

shutdown.tcl

rename procedure redis_deferring_client to valkey_deferring_client (#270 )

2024-04-09 10:38:09 -04:00

slowlog.tcl

Correctly recode client infomation to the slowlog when runing script (#805 )

2024-08-10 23:46:56 +08:00

sort.tcl

Remove trademarked language in code comments (#223 )

2024-04-09 10:24:03 +02:00

tls.tcl

Replace "redis" with "valkey" test code (#287 )

2024-04-18 15:57:17 +02:00

tracking.tcl

Rename redis_client* procedure to valkey_client* in test environment (#276 )

2024-04-10 10:18:47 -04:00

violations.tcl

Run large-memory tests as solo. (#10626 )

2022-04-24 17:29:35 +03:00

wait.tcl

Fix the wrong woff when execute WAIT / WAITAOF in script (#776 )

2024-07-22 10:33:10 +02:00