futriix

Author	SHA1	Message	Date
antirez	60f346889b	Cluster announce port: set port/bport for myself at startup.	2016-01-29 09:06:37 +01:00
antirez	427cf6170f	Cluster: persist bus port in nodes.conf.	2016-01-29 09:06:37 +01:00
antirez	e1c985760e	Cluster announce ip: take myself->ip always in sync.	2016-01-29 09:06:37 +01:00
antirez	51a16c9ca0	Cluster announce ip / port initial implementation.	2016-01-29 09:06:37 +01:00
antirez	be9dd0137d	Cluster: check packets length before accessing far fields.	2016-01-27 16:35:21 +01:00
antirez	14e38f6eff	Cluster: mismatch sender ID log put back at DEBUG level.	2016-01-26 14:21:18 +01:00
antirez	de6c8091e9	Cluster: fix missing ntohs() call to access gossip section port.	2016-01-26 14:18:13 +01:00
antirez	15130c50d2	Better address udpate strategy when processing gossip sections. The change covers the case where: 1. There is a node we can't reach (in fail or pfail state). 2. We see a different address for this node, in the gossip section sent to us by a node that, instead, is able to talk with the node we cannot talk to. In this case it's a good bet to switch to the address reported by this node, since there was an address switch and it is able to talk with the node and we are not. However previosuly this was done in a dangerous way, by initiating an handshake. The handshake, using the MEET packet, forces the receiver to join our cluster, and this is not a good idea. If the node in question really just switched address, but is the same node, it already knows about us, so we just need to perform an address update and a reconnection. So with this commit instead we just update the address of the node, release the node link if any, and attempt to reconnect in the next clusterCron() cycle. The commit also improves debugging messages printed by Cluster during address or ID switches.	2016-01-26 12:32:53 +01:00
antirez	7ef896f96c	Minor MIGRATE refactoring. Centralize cleanup of newargv in a single place. Add more comments to help a bit following a complex function. Related to issue #3016.	2016-01-19 09:53:04 +01:00
antirez	8f080412c7	More variadic MIGRATE fixes. Another leak was fixed in the case of syntax error by restructuring the allocation strategy for the two dynamic vectors. We also make sure to always close the cached socket on I/O errors so that all the I/O errors are handled the same, even if we had a previously queued error of a different kind from the destination server. Thanks to Kevin McGehee. Related to issue #3016.	2016-01-19 09:28:43 +01:00
antirez	164c7a6250	Various fixes to MIGRATE with multiple keys. In issue #3016 Kevin McGehee identified multiple very serious issues in the new implementation of MIGRATE. This commit attempts to restructure the code in oder to avoid mistakes, an analysis of the new implementation is in progress in order to check for possible edge cases.	2016-01-18 16:49:21 +01:00
antirez	4f2f067778	Cluster: fix setting nodes slaveof pointer to NULL on node release. With this commit we preserve the list of nodes that have .slaveof set to the node, even when the node is turned into a slave, and make sure to fix the .slaveof pointers to NULL when a node is freed from memory, regardless of the fact it's a slave or a master. Basically we try to remember the logical master in the current configuration even if the logical master advertised it as a slave already. However we still remember the associations, so that when a node is freed we can fix them. This should fix issue #3002.	2016-01-14 17:34:49 +01:00
antirez	32a1a089df	CLUSTER BUMPEPOCH initial implementation fixed.	2016-01-11 15:39:11 +01:00
antirez	865495465d	Cluster: CLUSTER BUMPEPOCH introduced to help redis-trib fix. Sometimes during "fixes" we have to setup a new configuration and assign slots to nodes. With BUMPEPOCH we can make sure the new configuration of the node will win if there are conflicting configurations (for example another node is also claiming the same slot because the cluster is totally messed up).	2016-01-11 15:01:14 +01:00
antirez	0c2ee17244	Cluster: don't allow CLUSTER SETSLOT with slaves.	2016-01-11 15:00:45 +01:00
antirez	85a9177ae0	Allow MIGRATE to always be called on local keys for open slots. Extend the MIGRATE extra freedom to be able to be called in the context of the local slot, anytime there is a slot open in one or the other direction (importing or migrating). This is useful for redis-trib to fix the cluster when it has in an odd state. Thix fix allows "redis-trib fix" to make its work in certain cases where previously an error was reported.	2016-01-08 15:04:16 +01:00
antirez	647b423fd4	Fix typos & grammar in clusterBumpConfigEpochWithoutConsensus() comment.	2016-01-08 12:07:54 +01:00
antirez	2b63007be9	Cluster: don't send -ASK to MIGRATE. For non existing keys, we don't want to send -ASK redirections to MIGRATE, since when moving slots from the migrating node to the importing node, we want just to ignore keys that are no longer there. They may be expired or deleted between the GETKEYSINSLOT call and the MIGRATE call. Otherwise this causes an error during migrations with redis-trib (or equivalent cluster management tools).	2016-01-06 12:14:49 +01:00
antirez	21c3376ef7	Suppress harmless warnings.	2015-12-16 12:36:32 +01:00
antirez	5e55c3929a	MIGRATE: Fix new argument rewriting refcount handling.	2015-12-11 14:26:41 +01:00
antirez	c7500f497c	MIGRATE: fix replies processing and argument rewriting. We need to process replies after errors in order to delete keys successfully transferred. Also argument rewriting was fixed since it was broken in several ways. Now a fresh argument vector is created and set if we are acknowledged of at least one key.	2015-12-11 14:04:47 +01:00
antirez	e3bb88e4f7	Pipelined multiple keys MIGRATE.	2015-12-11 13:38:26 +01:00
antirez	60daa127fd	Cluster: replica migration with delay. We wait a fixed amount of time (5 seconds currently) much greater than the usual Cluster node to node communication latency, before migrating. This way when a failover occurs, before detecting the new master as a target for migration, we give the time to its natural slaves (the slaves of the failed over master) to announce they switched to the new master, preventing an useless migration operation.	2015-12-11 09:19:06 +01:00
antirez	6aec0c37e2	Remove debugging message left there for error.	2015-12-10 08:56:33 +01:00
antirez	923098bbda	Fix replicas migration by adding a new flag. Some time ago I broken replicas migration (reported in #2924). The idea was to prevent masters without replicas from getting replicas because of replica migration, I remember it to create issues with tests, but there is no clue in the commit message about why it was so undesirable. However my patch as a side effect totally ruined the concept of replicas migration since we want it to work also for instances that, technically, never had slaves in the past: promoted slaves. So now instead the ability to be targeted by replicas migration, is a new flag "migrate-to". It only applies to masters, and is set in the following two cases: 1. When a master gets a slave, it is set. 2. When a slave turns into a master because of fail over, it is set. This way replicas migration targets are only masters that used to have slaves, and slaves of masters (that used to have slaves... obviously) and are promoted. The new flag is only internal, and is never exposed in the output nor persisted in the nodes configuration, since all the information to handle it are implicit in the cluster configuration we already have.	2015-12-09 23:03:18 +01:00
antirez	3c5a6bd0b2	Redis Cluster: hint about validity factor when slave can't failover.	2015-11-27 08:59:17 +01:00
antirez	cf03e071ea	Lazyfree: ability to free whole DBs in background.	2015-10-01 13:02:26 +02:00
antirez	c4175281f6	Lazyfree: Sorted sets convereted to plain SDS. (several commits squashed)	2015-10-01 13:02:24 +02:00
antirez	cf7b70e5d2	RDMF: use representClusterNodeFlags() generic name.	2015-07-27 15:08:58 +02:00
antirez	3064487047	RDMF: more names updated.	2015-07-27 15:03:10 +02:00
antirez	c15cac0d77	RDMF: More consistent define names.	2015-07-27 14:37:58 +02:00
antirez	8a893fa4cf	RDMF: REDIS_OK REDIS_ERR -> C_OK C_ERR.	2015-07-26 23:17:55 +02:00
antirez	58844a7bfe	RDMF: redisAssert -> serverAssert.	2015-07-26 15:29:53 +02:00
antirez	62b27ebc2a	RDMF: OBJ_ macros for object related stuff.	2015-07-26 15:28:00 +02:00
antirez	fa26d3dd63	RDMF: use client instead of redisClient, like Disque.	2015-07-26 15:20:52 +02:00
antirez	e2b858a580	RDMF: redisLog -> serverLog.	2015-07-26 15:17:43 +02:00
antirez	6a424b5e36	RDMF (Redis/Disque merge friendlyness) refactoring WIP 1.	2015-07-26 15:17:18 +02:00
Jan-Erik Rediger	2ca9748952	Do not attempt to lock on Solaris	2015-06-24 14:57:15 +02:00
antirez	57daae8758	Don't try to bind the source address for MIGRATE Related to issues #2609 and #2612.	2015-06-11 14:34:38 +02:00
antirez	64ae753eb0	Cluster: redirection refactoring + handling of blocked clients. There was a bug in Redis Cluster caused by clients blocked in a blocking list pop operation, for keys no longer handled by the instance, or in a condition where the cluster became down after the client blocked. A typical situation is: 1) BLPOP <somekey> 0 2) <somekey> hash slot is resharded to another master. The client will block forever int this case. A symmentrical non-cluster-specific bug happens when an instance is turned from master to slave. In that case it is more serious since this will desynchronize data between slaves and masters. This other bug was discovered as a side effect of thinking about the bug explained and fixed in this commit, but will be fixed in a separated commit.	2015-03-24 11:56:24 +01:00
antirez	6a44ada2ac	Two cluster.c comments improved.	2015-03-21 12:12:23 +01:00
antirez	f9090fccdd	Cluster: TAKEOVER option for manual failover.	2015-03-21 11:54:32 +01:00
antirez	2946fba29d	Cluster: non-conditional steps of slave failover refactored into a function.	2015-03-20 17:56:21 +01:00
antirez	dbe9d75c48	Cluster: separate unknown master check from the rest. In no case we should try to attempt to failover if myself->slaveof is NULL.	2015-03-20 16:56:59 +01:00
antirez	7a24091ef4	Cluster: refactoring around configEpoch handling. This commit moves the process of generating a new config epoch without consensus out of the clusterCommand() implementation, in order to make it reusable for other reasons (current target is to have a CLUSTER FAILOVER option forcing the failover when no master majority is reachable). Moreover the commit moves other functions which are similarly related to config epochs in a new logical section of the cluster.c file, just for clarity.	2015-03-20 16:42:52 +01:00
antirez	99e8cc230d	Cluster: better cluster state transiction handling. Before we relied on the global cluster state to make sure all the hash slots are linked to some node, when getNodeByQuery() is called. So finding the hash slot unbound was checked with an assertion. However this is fragile. The cluster state is often updated in the clusterBeforeSleep() function, and not ASAP on state change, so it may happen to process clients with a cluster state that is 'ok' but yet certain hash slots set to NULL. With this commit the condition is also checked in getNodeByQuery() and reported with a identical error code of -CLUSTERDOWN but slightly different error message so that we have more debugging clue in the future. Root cause of issue #2288.	2015-03-20 09:59:28 +01:00
antirez	ad7956ce89	Cluster: more robust slave check in CLUSTER REPLICATE. There are rare conditions where node->slaveof may be NULL even if the node is a slave. To check by flag is much more robust.	2015-03-18 12:10:14 +01:00
antirez	a0f82e4ee6	Cluster: fix CLUSTER NODES optimization error in 'j' increment.	2015-03-13 13:16:35 +01:00
antirez	7ba9877884	Cluster: CLUSTER NODES speedup.	2015-03-13 11:26:04 +01:00
Michel Martens	0480f01fc6	Add command CLUSTER MYID	2015-03-10 16:43:19 +00:00

1 2 3 4 5 ...

499 Commits